RankModelProviderScore (0-100)SamplesContextPrice / 1M tokens
1
G
gemini-3-pro Google
100.0
1K
1.05M
¥14.4 / ¥86.4Input/Output
2
A
claude-opus-4-6-thinking Anthropic
98.4
467
1M
¥36 / ¥180Input/Output
3
M
muse-spark Meta
96.9
297
-
-
4
A
claude-opus-4-7 Anthropic
95.3
413
1M
¥36 / ¥180Input/Output
5
A
claude-opus-4-7-thinking Anthropic
93.8
440
1M
¥36 / ¥180Input/Output
6
G
gemini-3.1-pro-preview Google
92.2
1.1K
1.05M
¥14.4 / ¥86.4Input/Output
7
A
claude-opus-4-6 Anthropic
90.6
598
1M
¥36 / ¥180Input/Output
8
G
gemini-3-flash Google
89.1
1.5K
1.05M
¥3.6 / ¥21.6Input/Output
9
G
gemini-3-flash (thinking-minimal) Google
87.5
1.3K
1.05M
¥3.6 / ¥21.6Input/Output
10
X
grok-4.20-beta-0309-reasoning Xai
85.9
603
2M
¥14.4 / ¥43.2Input/Output
11
X
grok-4.20-multi-agent-beta-0309 Xai
84.4
604
2M
¥14.4 / ¥43.2Input/Output
12
O
gpt-5.5-high Openai
82.8
254
1.05M
¥36 / ¥216Input/Output
13
O
gpt-5.4-high Openai
81.3
317
1.05M
¥18 / ¥108Input/Output
14
A
claude-sonnet-4-6 Anthropic
79.7
562
1M
¥21.6 / ¥108Input/Output
15
G
gemma-4-31b Google
78.1
1.1K
262K
¥3.24 / ¥7.2Input/Output
16
G
gemini-2.5-pro Google
76.6
3.4K
1.05M
¥9 / ¥72Input/Output
17
A
qwen3.5-397b-a17b Alibaba
75.0
726
262K
¥3.1 / ¥18.6Input/Output
18
O
gpt-5.5 Openai
73.4
278
1.05M
¥36 / ¥216Input/Output
19
M
kimi-k2.5-thinking Moonshot
71.9
857
262K
¥4.32 / ¥21.6Input/Output
20
O
chatgpt-4o-latest-20250326 Openai
70.3
1.2K
128K
¥18 / ¥72Input/Output
21
M
kimi-k2.6 Moonshot
68.8
299
262K
¥6.84 / ¥28.8Input/Output
22
G
gemini-2.5-flash-preview-09-2025 Google
67.2
294
1M
¥2.16 / ¥18Input/Output
23
Z
glm-5v-turbo Zai
65.6
466
200K
¥0 / ¥0Input/Output
24
O
gpt-5.4 Openai
64.1
340
1.05M
¥18 / ¥108Input/Output
25
B
dola-seed-2.0-pro Bytedance
62.5
430
-
-
26
O
gpt-5.1 Openai
60.9
696
400K
¥9 / ¥72Input/Output
27
O
gpt-5.1-high Openai
59.4
635
400K
¥9 / ¥72Input/Output
28
X
grok-4.3 Xai
57.8
244
1M
¥9 / ¥18Input/Output
29
G
gemma-4-26b-a4b Google
56.3
697
262K
¥0.94 / ¥2.88Input/Output
30
G
gemini-2.5-flash Google
54.7
2.6K
1.05M
¥2.16 / ¥18Input/Output
31
M
kimi-k2.5-instant Moonshot
53.1
247
262K
¥4.32 / ¥21.6Input/Output
32
A
claude-opus-4-20250514 Anthropic
51.6
106
200K
¥108 / ¥540Input/Output
33
G
gemini-3.1-flash-lite-preview Google
50.0
976
1.05M
¥1.8 / ¥10.8Input/Output
34
MI
mimo-v2.5 Xiaomi
48.4
391
1.05M
¥2.88 / ¥14.4Input/Output
35
O
gpt-5.2-chat-latest-20260210 Openai
46.9
761
400K
¥12.6 / ¥101Input/Output
36
X
grok-4-1-fast-reasoning Xai
45.3
675
2M
¥1.44 / ¥3.6Input/Output
37
X
grok-4-0709 Xai
43.8
1.3K
256K
¥21.6 / ¥108Input/Output
38
O
gpt-5.4-mini-high Openai
42.2
589
400K
¥5.4 / ¥32.4Input/Output
39
B
ernie-5.0-preview-1220 Baidu
40.6
381
128K
¥7.92 / ¥14.4Input/Output
40
A
qwen3.5-27b Alibaba
39.1
579
262K
¥2.16 / ¥17.3Input/Output
41
A
qwen3-vl-235b-a22b-instruct Alibaba
37.5
819
128K
¥2.16 / ¥8.64Input/Output
42
A
qwen3.5-122b-a10b Alibaba
35.9
632
262K
¥2.88 / ¥23Input/Output
43
O
gpt-5.2 Openai
34.4
947
400K
¥12.6 / ¥101Input/Output
44
O
gpt-5-chat Openai
32.8
1.3K
400K
¥9 / ¥72Input/Output
45
O
gpt-5.2-high Openai
31.3
958
400K
¥12.6 / ¥101Input/Output
46
O
gpt-4.1-2025-04-14 Openai
29.7
1.3K
1.05M
¥14.4 / ¥57.6Input/Output
47
O
o3-2025-04-16 Openai
28.1
1.6K
200K
¥14.4 / ¥57.6Input/Output
48
MA
mistral-medium-2508 Mistral
26.6
1.5K
262K
¥2.88 / ¥14.4Input/Output
49
O
gpt-5-high Openai
25.0
1.4K
400K
¥9 / ¥72Input/Output
50
G
gemini-2.5-flash-lite-preview-09-2025-no-thinking Google
23.4
316
1.05M
¥0.72 / ¥2.88Input/Output
51
MA
mistral-small-2506 Mistral
21.9
461
262K
¥2.88 / ¥14.4Input/Output
52
O
gpt-5-mini-high Openai
20.3
1K
400K
¥1.8 / ¥14.4Input/Output
53
O
gpt-4.1-mini-2025-04-14 Openai
18.8
1.2K
1.05M
¥2.88 / ¥11.5Input/Output
54
G
gemini-2.5-flash-lite-preview-06-17-thinking Google
17.2
1.2K
65.5K
¥0.72 / ¥2.88Input/Output
55
G
gemma-3-27b-it Google
15.6
630
128K
¥2.15 / ¥2.15Input/Output
56
MI
mimo-v2-omni Xiaomi
14.1
473
262K
¥2.88 / ¥14.4Input/Output
57
G
gemini-2.0-flash-001 Google
12.5
247
1.05M
¥1.08 / ¥4.32Input/Output
58
O
o4-mini-2025-04-16 Openai
10.9
1.3K
200K
¥7.92 / ¥31.7Input/Output
59
O
gpt-5.4-nano-high Openai
9.4
553
400K
¥1.44 / ¥9Input/Output
60
TE
hunyuan-vision-1.5-thinking Tencent
7.8
191
-
-
61
MA
mistral-medium-2505 Mistral
6.3
487
262K
¥2.88 / ¥14.4Input/Output
62
Z
glm-4.6v Zai
4.7
195
128K
¥2.16 / ¥6.48Input/Output
63
M
llama-4-scout-17b-16e-instruct Meta
3.1
234
128K
¥1.44 / ¥5.62Input/Output
64
MA
mistral-small-3.1-24b-instruct-2503 Mistral
1.6
782
262K
¥2.88 / ¥14.4Input/Output
65
M
llama-4-maverick-17b-128e-instruct Meta
0.0
212
1M
¥1.8 / ¥6.26Input/Output
Top model analysisgemini-3-pro why it ranks first
gemini-3-pro ranks first with a percent score of 100.0 and 1K samples. Use it as the first option for this leaderboard, then compare price, context and availability.
How to chooseDo not only look at rank #1
Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.
Related leaderboardsCompare adjacent capabilities