RankModelProviderScore (0-100)SamplesContextPrice / 1M tokens
1
O
GPT 5.5 (High) Openai
100.0
24.6K
1.05M
¥36 / ¥216Input/Output
2
A
Claude Opus 4.7 (Thinking) Anthropic
94.1
24.5K
1M
¥36 / ¥180Input/Output
3
O
GPT 5.4 (High) Openai
88.2
24.4K
1.05M
¥18 / ¥108Input/Output
4
A
Claude Opus 4.6 Anthropic
82.4
24.7K
1M
¥36 / ¥180Input/Output
5
O
GPT 5.5 Openai
76.5
24.9K
1.05M
¥36 / ¥216Input/Output
6
A
Claude Opus 4.7 Anthropic
70.6
24.7K
1M
¥36 / ¥180Input/Output
7
A
Claude Sonnet 4.6 Anthropic
64.7
24.6K
1M
¥21.6 / ¥108Input/Output
8
Z
GLM 5.1 Zai
58.8
19.8K
200K
¥0 / ¥0Input/Output
9
G
Gemini 3.1 Pro Preview Google
52.9
24.5K
1.05M
¥14.4 / ¥86.4Input/Output
10
G
Gemini 3.5 Flash Google
47.1
17.7K
1.05M
¥10.8 / ¥64.8Input/Output
11
M
Kimi K2.6 Moonshot
41.2
21.3K
262K
¥6.84 / ¥28.8Input/Output
12
D
DeepSeek V4 Pro Deepseek
35.3
20K
1M
¥3.13 / ¥6.26Input/Output
13
A
Qwen 3.6 Plus Alibaba
29.4
19.5K
1M
¥3.6 / ¥21.6Input/Output
14
D
DeepSeek V4 Flash Deepseek
23.5
19.9K
1M
¥1.01 / ¥2.02Input/Output
15
M
Minimax M2.7 Minimax
17.6
20K
205K
¥0 / ¥0Input/Output
16
G
Gemini 3 Flash Google
11.8
24.5K
1.05M
¥3.6 / ¥21.6Input/Output
17
G
Gemma 4 31B Google
5.9
13.7K
262K
¥3.24 / ¥7.2Input/Output
18
X
Grok 4.3 Xai
0.0
23.7K
1M
¥9 / ¥18Input/Output
Top model analysisGPT 5.5 (High) why it ranks first
GPT 5.5 (High) ranks first with a percent score of 100.0 and 24.6K samples. Use it as the first option for this leaderboard, then compare price, context and availability.
How to chooseDo not only look at rank #1
Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.
Related leaderboardsCompare adjacent capabilities