Chat · Text · Entertainment, Sports & Media Leaderboard

Ranking for Text / Entertainment, Sports & Media, based on public preference data.

Selection guide

Entertainment, Sports & Media model ranking guide

Ranking for Text / Entertainment, Sports & Media, based on public preference data.

claude-opus-4-6-thinkingclaude-opus-4-6claude-opus-4-7-thinkingclaude-opus-4-7gemini-3.1-pro-preview
Current DirectoryChat · Text · Entertainment, Sports & Media
Models358
Published2026/05/27
Arena public preference evaluationOriginal leaderboard: Text / Industry Entertainment And Sports And MediaPublished: 2026/05/27Leaderboard dataset: LMArena latest parquetOpen Arena sourceOpen leaderboard dataset
1
claude-opus-4-6-thinking
Anthropic
100.0
6.7K
1M
¥36 / ¥180Input/Output
2
claude-opus-4-6
Anthropic
99.7
7K
1M
¥36 / ¥180Input/Output
3
claude-opus-4-7-thinking
Anthropic
99.4
4.1K
1M
¥36 / ¥180Input/Output
4
claude-opus-4-7
Anthropic
99.2
4.4K
1M
¥36 / ¥180Input/Output
5
gemini-3.1-pro-preview
Google
98.9
8.5K
1.05M
¥14.4 / ¥86.4Input/Output
6
gemini-3-pro
Google
98.6
7.7K
1.05M
¥14.4 / ¥86.4Input/Output
7
gemini-3.5-flash
Google
98.3
1.8K
1.05M
¥10.8 / ¥64.8Input/Output
8
qwen3.5-max-preview
Alibaba
98.0
3.8K
-
-
9
glm-5.1
Zai
97.8
2.6K
200K
¥0 / ¥0Input/Output
10
gemini-3-flash
Google
97.5
5.6K
1.05M
¥3.6 / ¥21.6Input/Output
11
gpt-5.5-high
Openai
97.2
3.5K
1.05M
¥36 / ¥216Input/Output
12
muse-spark
Meta
96.9
2.2K
-
-
13
qwen3.7-max-preview
Alibaba
96.6
691
1M
¥18 / ¥54Input/Output
14
gpt-5.4-high
Openai
96.4
5.4K
1.05M
¥18 / ¥108Input/Output
15
mimo-v2.5-pro
Xiaomi
96.1
3.1K
1.05M
¥7.2 / ¥21.6Input/Output
16
gpt-5.5
Openai
95.8
3.6K
1.05M
¥36 / ¥216Input/Output
17
gemini-2.5-pro
Google
95.5
22.5K
1.05M
¥9 / ¥72Input/Output
18
ernie-5.1
Baidu
95.2
2.8K
119K
¥5.4 / ¥21.6Input/Output
19
claude-sonnet-4-6
Anthropic
95.0
5.3K
1M
¥21.6 / ¥108Input/Output
20
claude-opus-4-5-20251101
Anthropic
94.7
12.5K
200K
¥36 / ¥180Input/Output
21
gpt-5.4
Openai
94.4
5.8K
1.05M
¥18 / ¥108Input/Output
22
deepseek-v4-pro
Deepseek
94.1
3.3K
1M
¥3.13 / ¥6.26Input/Output
23
grok-4.20-beta-0309-reasoning
Xai
93.8
5.8K
2M
¥14.4 / ¥43.2Input/Output
24
kimi-k2.6
Moonshot
93.6
2.9K
262K
¥6.84 / ¥28.8Input/Output
25
claude-sonnet-4-5-20250929
Anthropic
93.3
14.2K
200K
¥21.6 / ¥108Input/Output
26
gemini-3-flash (thinking-minimal)
Google
93.0
10.3K
1.05M
¥3.6 / ¥21.6Input/Output
27
claude-opus-4-5-20251101-thinking-32k
Anthropic
92.7
6.8K
200K
¥108 / ¥540Input/Output
28
glm-5
Zai
92.4
4.1K
205K
¥7.2 / ¥23Input/Output
29
grok-4.20-multi-agent-beta-0309
Xai
92.2
5.6K
2M
¥14.4 / ¥43.2Input/Output
30
qwen3.6-max-preview
Alibaba
91.9
826
246K
¥9.5 / ¥56.9Input/Output
31
kimi-k2.5-thinking
Moonshot
91.6
6.7K
262K
¥4.32 / ¥21.6Input/Output
32
grok-4.20-beta1
Xai
91.3
4.6K
2M
¥14.4 / ¥43.2Input/Output
33
deepseek-v4-pro-thinking
Deepseek
91.0
3.2K
1M
¥3.13 / ¥6.26Input/Output
34
claude-sonnet-4-5-20250929-thinking-32k
Anthropic
90.8
14.7K
200K
¥21.6 / ¥108Input/Output
35
grok-3-preview-02-24
Xai
90.5
5.7K
1M
¥9 / ¥18Input/Output
36
dola-seed-2.0-pro
Bytedance
90.2
7.2K
-
-
37
ernie-5.0-0110
Baidu
89.9
6.3K
128K
¥7.92 / ¥14.4Input/Output
38
mimo-v2-pro
Xiaomi
89.6
4.3K
1.05M
¥7.2 / ¥21.6Input/Output
39
gpt-5.1-high
Openai
89.4
7.3K
400K
¥9 / ¥72Input/Output
40
chatgpt-4o-latest-20250326
Openai
89.1
14.8K
128K
¥18 / ¥72Input/Output
41
mimo-v2.5
Xiaomi
88.8
3.2K
1.05M
¥2.88 / ¥14.4Input/Output
42
gpt-5.5-instant
Openai
88.5
5.3K
400K
¥9 / ¥72Input/Output
43
grok-4.1-thinking
Xai
88.2
12K
200K
¥14.4 / ¥72Input/Output
44
glm-4.6
Zai
88.0
6.6K
205K
¥4.32 / ¥15.8Input/Output
45
ernie-5.0-preview-1203
Baidu
87.7
1.9K
128K
¥7.92 / ¥14.4Input/Output
46
gemma-4-26b-a4b
Google
87.4
1.1K
262K
¥0.94 / ¥2.88Input/Output
47
qwen3.5-397b-a17b
Alibaba
87.1
6.1K
262K
¥3.1 / ¥18.6Input/Output
48
grok-4.1
Xai
86.8
12.1K
200K
¥14.4 / ¥72Input/Output
49
glm-4.5
Zai
86.6
4.3K
131K
¥4.32 / ¥15.8Input/Output
50
deepseek-v3.2-exp
Deepseek
86.3
2.2K
128K
¥0 / ¥0Input/Output
51
gemma-4-31b
Google
86.0
1K
262K
¥3.24 / ¥7.2Input/Output
52
qwen3-max-preview
Alibaba
85.7
5K
262K
¥6.2 / ¥24.8Input/Output
53
ernie-5.0-preview-1022
Baidu
85.4
910
128K
¥7.92 / ¥14.4Input/Output
54
glm-4.7
Zai
85.2
2.4K
205K
¥0 / ¥0Input/Output
55
qwen3.6-plus
Alibaba
84.9
3.5K
1M
¥3.6 / ¥21.6Input/Output
56
deepseek-r1-0528
Deepseek
84.6
3.2K
164K
¥3.6 / ¥15.5Input/Output
57
amazon-nova-experimental-chat-26-02-10
Amazon
84.3
541
-
-
58
gpt-4.5-preview-2025-02-27
Openai
84.0
2.6K
8.19K
¥216 / ¥432Input/Output
59
gpt-5.2-chat-latest-20260210
Openai
83.8
6.2K
400K
¥12.6 / ¥101Input/Output
60
claude-opus-4-1-20250805
Anthropic
83.5
14.1K
200K
¥108 / ¥540Input/Output
61
deepseek-v3.1-thinking
Deepseek
83.2
2.2K
128K
¥1.44 / ¥5.04Input/Output
62
gpt-5.1
Openai
82.9
8K
400K
¥9 / ¥72Input/Output
63
claude-opus-4-1-20250805-thinking-16k
Anthropic
82.6
9K
200K
¥108 / ¥540Input/Output
64
gemini-2.5-flash
Google
82.4
22.3K
1.05M
¥2.16 / ¥18Input/Output
65
deepseek-v4-flash
Deepseek
82.1
3.2K
1M
¥1.01 / ¥2.02Input/Output
66
deepseek-v3.2
Deepseek
81.8
8.3K
128K
¥2.09 / ¥3.1Input/Output
67
longcat-flash-chat-2602-exp
Meituan
81.5
4.5K
128K
¥1.08 / ¥10.8Input/Output
68
deepseek-v4-flash-thinking
Deepseek
81.2
3.3K
1M
¥1.01 / ¥2.02Input/Output
69
grok-4-0709
Xai
81.0
7.4K
256K
¥21.6 / ¥108Input/Output
70
kimi-k2.5-instant
Moonshot
80.7
1.4K
262K
¥4.32 / ¥21.6Input/Output
71
mistral-medium-2508
Mistral
80.4
17.1K
262K
¥2.88 / ¥14.4Input/Output
72
mistral-large-3
Mistral
80.1
7.9K
262K
¥3.6 / ¥10.8Input/Output
73
deepseek-v3.2-exp-thinking
Deepseek
79.8
1.7K
128K
¥0 / ¥0Input/Output
74
deepseek-v3.1-terminus-thinking
Deepseek
79.6
617
128K
¥1.8 / ¥5.04Input/Output
75
hunyuan-vision-1.5-thinking
Tencent
79.3
375
-
-
76
deepseek-v3.2-thinking
Deepseek
79.0
7.3K
128K
¥2.09 / ¥3.1Input/Output
77
grok-4.3
Xai
78.7
3.4K
1M
¥9 / ¥18Input/Output
78
gemini-2.5-flash-preview-09-2025
Google
78.4
6K
1M
¥2.16 / ¥18Input/Output
79
gemini-3.1-flash-lite-preview
Google
78.2
6.8K
1.05M
¥1.8 / ¥10.8Input/Output
80
grok-4-fast-chat
Xai
77.9
1.3K
2M
¥1.44 / ¥3.6Input/Output
81
grok-4-1-fast-reasoning
Xai
77.6
10.3K
2M
¥1.44 / ¥3.6Input/Output
82
deepseek-v3.1-terminus
Deepseek
77.3
636
128K
¥1.8 / ¥5.04Input/Output
83
deepseek-v3.1
Deepseek
77.0
2.8K
128K
¥1.44 / ¥5.04Input/Output
84
gpt-5.2
Openai
76.8
8.8K
400K
¥12.6 / ¥101Input/Output
85
qwen3-max-2025-09-23
Alibaba
76.5
1.7K
258K
¥6.19 / ¥24.7Input/Output
86
gpt-5-high
Openai
76.2
5.8K
400K
¥9 / ¥72Input/Output
87
kimi-k2-thinking-turbo
Moonshot
75.9
11.1K
262K
¥17.3 / ¥72Input/Output
88
gpt-5.4-mini-high
Openai
75.6
5.2K
400K
¥5.4 / ¥32.4Input/Output
89
mimo-v2-flash (non-thinking)
Xiaomi
75.4
8.4K
262K
¥0.72 / ¥2.16Input/Output
90
qwen3-235b-a22b-instruct-2507
Alibaba
75.1
17.3K
128K
¥2.09 / ¥8.23Input/Output
91
gpt-5.2-high
Openai
74.8
8.6K
400K
¥12.6 / ¥101Input/Output
92
o3-2025-04-16
Openai
74.5
10.6K
200K
¥14.4 / ¥57.6Input/Output
93
qwen3-vl-235b-a22b-instruct
Alibaba
74.2
1.9K
128K
¥2.16 / ¥8.64Input/Output
94
qwen3-235b-a22b-thinking-2507
Alibaba
73.9
1.5K
131K
¥2.07 / ¥8.26Input/Output
95
qwen3-vl-235b-a22b-thinking
Alibaba
73.7
1.4K
131K
¥2.06 / ¥8.26Input/Output
96
mimo-v2-omni
Xiaomi
73.4
620
262K
¥2.88 / ¥14.4Input/Output
97
claude-opus-4-20250514-thinking-16k
Anthropic
73.1
6.4K
200K
¥108 / ¥540Input/Output
98
qwen3.5-122b-a10b
Alibaba
72.8
5.1K
262K
¥2.88 / ¥23Input/Output
99
grok-4-fast-reasoning
Xai
72.5
3.3K
2M
¥1.44 / ¥3.6Input/Output
100
amazon-nova-experimental-chat-11-10
Amazon
72.3
4.5K
-
-
101
gpt-5-chat
Openai
72.0
5.6K
400K
¥9 / ¥72Input/Output
102
hunyuan-hy3-preview
Tencent
71.7
1.1K
256K
¥0 / ¥0Input/Output
103
gpt-4.1-2025-04-14
Openai
71.4
9.1K
1.05M
¥14.4 / ¥57.6Input/Output
104
claude-haiku-4-5-20251001
Anthropic
71.1
14.6K
200K
¥7.2 / ¥36Input/Output
105
minimax-m2.1-preview
Minimax
70.9
3.2K
205K
¥0 / ¥0Input/Output
106
longcat-flash-chat
Meituan
70.6
2K
128K
¥1.08 / ¥10.8Input/Output
107
qwen3.5-27b
Alibaba
70.3
4.7K
262K
¥2.16 / ¥17.3Input/Output
108
o1-2024-12-17
Openai
70.0
5.2K
128K
¥108 / ¥432Input/Output
109
step-3.5-flash
Stepfun
69.7
6.3K
256K
¥0.69 / ¥2.07Input/Output
110
deepseek-v3-0324
Deepseek
69.5
8.1K
75K
¥1.44 / ¥5.76Input/Output
111
gpt-5.3-chat-latest
Openai
69.2
5.9K
128K
¥12.6 / ¥101Input/Output
112
mimo-v2-flash (thinking)
Xiaomi
68.9
2K
262K
¥0.72 / ¥2.16Input/Output
113
hunyuan-t1-20250711
Tencent
68.6
859
131K
¥0 / ¥0Input/Output
114
gemini-2.5-flash-lite-preview-06-17-thinking
Google
68.3
5.9K
65.5K
¥0.72 / ¥2.88Input/Output
115
minimax-m2.7
Minimax
68.1
4.3K
205K
¥0 / ¥0Input/Output
116
qwen3-next-80b-a3b-instruct
Alibaba
67.8
4.2K
131K
¥1.04 / ¥4.13Input/Output
117
qwen3-235b-a22b-no-thinking
Alibaba
67.5
6.7K
131K
¥2.07 / ¥8.26Input/Output
118
claude-opus-4-20250514
Anthropic
67.2
7.6K
200K
¥108 / ¥540Input/Output
119
amazon-nova-experimental-chat-26-01-10
Amazon
66.9
606
-
-
120
qwen3.5-35b-a3b
Alibaba
66.7
5.3K
262K
¥1.8 / ¥14.4Input/Output
121
deepseek-r1
Deepseek
66.4
3.6K
164K
¥5.04 / ¥18Input/Output
122
amazon-nova-experimental-chat-12-10
Amazon
66.1
728
-
-
123
qwen3.5-flash
Alibaba
65.8
5.4K
1M
¥1.24 / ¥12.4Input/Output
124
glm-4.6v
Zai
65.5
513
128K
¥2.16 / ¥6.48Input/Output
125
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google
65.3
8.6K
1.05M
¥0.72 / ¥2.88Input/Output
126
kimi-k2-0905-preview
Moonshot
65.0
2.1K
262K
¥4.32 / ¥18Input/Output
127
gpt-5-mini-high
Openai
64.7
4.9K
400K
¥1.8 / ¥14.4Input/Output
128
amazon-nova-experimental-chat-10-20
Amazon
64.4
2K
-
-
129
glm-4.5-air
Zai
64.1
5.7K
131K
¥0 / ¥0Input/Output
130
nvidia-nemotron-3-super-120b-a12b
Nvidia
63.9
1.2K
262K
¥1.44 / ¥5.76Input/Output
131
qwen3-30b-a3b-instruct-2507
Alibaba
63.6
4.1K
262K
¥2.16 / ¥3.6Input/Output
132
mistral-medium-2505
Mistral
63.3
5.6K
262K
¥2.88 / ¥14.4Input/Output
133
gemini-2.0-flash-001
Google
63.0
7.8K
1.05M
¥1.08 / ¥4.32Input/Output
134
hunyuan-turbos-20250416
Tencent
62.7
1.9K
131K
¥0 / ¥0Input/Output
135
claude-sonnet-4-20250514-thinking-32k
Anthropic
62.5
6.1K
200K
¥21.6 / ¥108Input/Output
136
grok-3-mini-high
Xai
62.2
3K
128K
¥0 / ¥0Input/Output
137
kimi-k2-0711-preview
Moonshot
61.9
4.9K
131K
¥4.32 / ¥18Input/Output
138
o1-preview
Openai
61.6
4.8K
128K
¥108 / ¥432Input/Output
139
step-3
Stepfun
61.3
1.1K
65.5K
¥1.8 / ¥4.68Input/Output
140
minimax-m2.5
Minimax
61.1
7K
205K
¥0 / ¥0Input/Output
141
grok-3-mini-beta
Xai
60.8
3.8K
1M
¥9 / ¥18Input/Output
142
qwen2.5-max
Alibaba
60.5
6K
32K
¥11.5 / ¥46Input/Output
143
gpt-5.4-nano-high
Openai
60.2
5K
400K
¥1.44 / ¥9Input/Output
144
deepseek-v3
Deepseek
59.9
4.2K
128K
¥0 / ¥0Input/Output
145
gemma-3-27b-it
Google
59.7
8.1K
128K
¥2.15 / ¥2.15Input/Output
146
claude-sonnet-4-20250514
Anthropic
59.4
6.9K
200K
¥21.6 / ¥108Input/Output
147
qwen3-coder-480b-a35b-instruct
Alibaba
59.1
4.5K
262K
¥6.2 / ¥24.8Input/Output
148
mercury-2
Inception Ai
58.8
659
128K
¥1.8 / ¥5.4Input/Output
149
qwen3-235b-a22b
Alibaba
58.5
4.7K
131K
¥2.07 / ¥8.26Input/Output
150
trinity-large-preview
-
58.3
5.2K
262K
¥1.8 / ¥6.48Input/Output
151
qwen3-next-80b-a3b-thinking
Alibaba
58.0
2.5K
131K
¥1.04 / ¥10.3Input/Output
152
claude-3-7-sonnet-20250219-thinking-32k
Anthropic
57.7
6.6K
-
-
153
o4-mini-2025-04-16
Openai
57.4
8K
200K
¥7.92 / ¥31.7Input/Output
154
amazon-nova-experimental-chat-10-09
Amazon
57.1
484
-
-
155
glm-4.7-flash
Zai
56.9
2.1K
200K
¥0 / ¥0Input/Output
156
nova-2-lite
Amazon
56.6
2.2K
128K
¥2.38 / ¥19.8Input/Output
157
intellect-3
-
56.3
1.1K
131K
¥1.44 / ¥7.92Input/Output
158
command-a-03-2025
Cohere
56.0
10.1K
256K
¥18 / ¥72Input/Output
159
gemini-1.5-pro-002
Google
55.7
9.3K
-
-
160
mistral-small-2506
Mistral
55.5
3.2K
262K
¥2.88 / ¥14.4Input/Output
161
glm-4.5v
Zai
55.2
882
64K
¥4.32 / ¥13Input/Output
162
trinity-large-thinking
-
54.9
4.8K
262K
¥1.8 / ¥6.48Input/Output
163
minimax-m1
Minimax
54.6
6.3K
1M
¥0.95 / ¥9.03Input/Output
164
minimax-m2
Minimax
54.3
1.2K
197K
¥0 / ¥0Input/Output
165
gpt-4.1-mini-2025-04-14
Openai
54.1
6.8K
1.05M
¥2.88 / ¥11.5Input/Output
166
claude-3-7-sonnet-20250219
Anthropic
53.8
7.4K
200K
¥21.6 / ¥108Input/Output
167
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia
53.5
2.9K
131K
¥0 / ¥0Input/Output
168
gemma-3-12b-it
Google
53.2
697
128K
¥1.96 / ¥1.96Input/Output
169
gemini-2.0-flash-lite-preview-02-05
Google
52.9
4.5K
1.05M
¥0.54 / ¥2.16Input/Output
170
step-1o-turbo-202506
Stepfun
52.7
1.5K
-
-
171
ling-flash-2.0
Ant Group
52.4
1.2K
131K
¥1.01 / ¥4.1Input/Output
172
gpt-oss-120b
Openai
52.1
5.5K
131K
¥1.08 / ¥4.32Input/Output
173
step-2-16k-exp-202412
Stepfun
51.8
873
16.4K
¥37.5 / ¥118Input/Output
174
glm-4-plus-0111
Zai
51.5
1.1K
128K
¥72 / ¥72Input/Output
175
gpt-4o-2024-05-13
Openai
51.3
17.6K
128K
¥36 / ¥108Input/Output
176
llama-3.3-nemotron-49b-super-v1
Nvidia
51.0
410
131K
¥0 / ¥0Input/Output
177
ring-flash-2.0
Ant Group
50.7
1.3K
131K
¥1.01 / ¥4.1Input/Output
178
o3-mini-high
Openai
50.4
3.2K
200K
¥7.92 / ¥31.7Input/Output
179
grok-2-2024-08-13
Xai
50.1
10.3K
1M
¥9 / ¥18Input/Output
180
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia
49.9
570
131K
¥2.88 / ¥2.88Input/Output
181
qwen3-32b
Alibaba
49.6
704
131K
¥2.07 / ¥8.26Input/Output
182
qwq-32b
Alibaba
49.3
4.5K
131K
¥2.07 / ¥6.2Input/Output
183
claude-3-5-sonnet-20241022
Anthropic
49.0
14.3K
200K
¥21.6 / ¥108Input/Output
184
llama-3.1-nemotron-ultra-253b-v1
Nvidia
48.7
532
128K
¥4.32 / ¥13Input/Output
185
deepseek-v2.5-1210
Deepseek
48.5
1.2K
1M
¥1.01 / ¥2.02Input/Output
186
o3-mini
Openai
48.2
10.3K
200K
¥7.92 / ¥31.7Input/Output
187
yi-lightning
-
47.9
4K
12K
¥1.44 / ¥1.44Input/Output
188
olmo-3.1-32b-instruct
Allenai
47.6
2.1K
200K
¥14.4 / ¥57.6Input/Output
189
gemini-advanced-0514
Google
47.3
7.5K
-
-
190
qwen-plus-0125
Alibaba
47.1
1.1K
1M
¥0.83 / ¥2.07Input/Output
191
gpt-5-nano-high
Openai
46.8
1.4K
400K
¥0.36 / ¥2.88Input/Output
192
gpt-4o-2024-08-06
Openai
46.5
7K
128K
¥18 / ¥72Input/Output
193
hunyuan-turbo-0110
Tencent
46.2
449
-
-
194
gemma-3n-e4b-it
Google
45.9
3.9K
128K
¥0 / ¥0Input/Output
195
hunyuan-large-2025-02-10
Tencent
45.7
646
-
-
196
qwen3-30b-a3b
Alibaba
45.4
4.6K
128K
¥0.79 / ¥7.78Input/Output
197
gpt-4-turbo-2024-04-09
Openai
45.1
14.8K
128K
¥72 / ¥216Input/Output
198
o1-mini
Openai
44.8
8.6K
128K
¥7.92 / ¥31.7Input/Output
199
olmo-3-32b-think
Allenai
44.5
1.1K
128K
¥2.16 / ¥3.24Input/Output
200
gpt-4o-mini-2024-07-18
Openai
44.3
11.3K
128K
¥1.08 / ¥4.32Input/Output
201
hunyuan-turbos-20250226
Tencent
44.0
448
131K
¥0 / ¥0Input/Output
202
gemini-1.5-flash-002
Google
43.7
5.7K
2M
¥0.54 / ¥2.2Input/Output
203
gemini-1.5-pro-001
Google
43.4
12.1K
-
-
204
mistral-small-3.1-24b-instruct-2503
Mistral
43.1
5.9K
262K
¥2.88 / ¥14.4Input/Output
205
llama-4-maverick-17b-128e-instruct
Meta
42.9
7K
1M
¥1.8 / ¥6.26Input/Output
206
llama-3.1-405b-instruct-fp8
Meta
42.6
9.5K
128K
¥0 / ¥0Input/Output
207
glm-4-plus
Zai
42.3
3.9K
128K
¥54 / ¥54Input/Output
208
llama-3.1-nemotron-70b-instruct
Nvidia
42.0
961
128K
¥0 / ¥0Input/Output
209
granite-4.1-8b
Ibm
41.7
743
131K
¥0.36 / ¥0.72Input/Output
210
qwen2.5-plus-1127
Alibaba
41.5
1.9K
-
-
211
llama-4-scout-17b-16e-instruct
Meta
41.2
5.3K
128K
¥1.44 / ¥5.62Input/Output
212
magistral-medium-2506
Mistral
40.9
2K
128K
¥14.4 / ¥36Input/Output
213
gpt-4.1-nano-2025-04-14
Openai
40.6
1.1K
1.05M
¥14.4 / ¥57.6Input/Output
214
gpt-4-1106-preview
Openai
40.3
15.2K
8.19K
¥216 / ¥432Input/Output
215
llama-3.3-70b-instruct
Meta
40.1
9.8K
128K
¥0 / ¥0Input/Output
216
mistral-large-2407
Mistral
39.8
7K
131K
¥14.4 / ¥43.2Input/Output
217
llama-3.1-405b-instruct-bf16
Meta
39.5
7.2K
128K
¥0 / ¥0Input/Output
218
qwen-max-0919
Alibaba
39.2
2.4K
131K
¥2.48 / ¥9.91Input/Output
219
llama-3.1-tulu-3-70b
Allenai
38.9
551
-
-
220
claude-3-5-sonnet-20240620
Anthropic
38.7
13.2K
200K
¥21.6 / ¥108Input/Output
221
gpt-4-0125-preview
Openai
38.4
14K
8.19K
¥216 / ¥432Input/Output
222
olmo-3.1-32b-think
Allenai
38.1
1.7K
200K
¥14.4 / ¥57.6Input/Output
223
gemma-3-4b-it
Google
37.8
761
128K
¥1.44 / ¥1.44Input/Output
224
grok-2-mini-2024-08-13
Xai
37.5
8.6K
1M
¥9 / ¥18Input/Output
225
athene-70b-0725
-
37.3
3.2K
-
-
226
mistral-large-2411
Mistral
37.0
5.1K
128K
¥14.4 / ¥43.2Input/Output
227
deepseek-v2.5
Deepseek
36.7
3.7K
1M
¥1.01 / ¥2.02Input/Output
228
llama-3.1-70b-instruct
Meta
36.4
8.7K
131K
¥2.88 / ¥2.88Input/Output
229
athene-v2-chat
-
36.1
4.4K
-
-
230
olmo-2-0325-32b-instruct
Allenai
35.9
606
-
-
231
claude-3-5-haiku-20241022
Anthropic
35.6
12.5K
200K
¥5.76 / ¥28.8Input/Output
232
command-r-plus-08-2024
Cohere
35.3
1.5K
128K
¥18 / ¥72Input/Output
233
hunyuan-standard-2025-02-10
Tencent
35.0
675
-
-
234
hunyuan-large-vision
Tencent
34.7
885
-
-
235
claude-3-opus-20240229
Anthropic
34.5
29.4K
200K
¥108 / ¥540Input/Output
236
jamba-1.5-large
-
34.2
1.4K
256K
¥0 / ¥0Input/Output
237
gpt-oss-20b
Openai
33.9
1.7K
131K
¥0.32 / ¥1.3Input/Output
238
gemma-2-9b-it-simpo
-
33.6
1.7K
8.19K
¥1.44 / ¥1.44Input/Output
239
ibm-granite-h-small
Ibm
33.3
978
-
-
240
gemma-2-27b-it
Google
33.1
12.3K
8.19K
¥0.58 / ¥0.58Input/Output
241
qwen2.5-72b-instruct
Alibaba
32.8
6.2K
131K
¥4.13 / ¥12.4Input/Output
242
amazon-nova-pro-v1.0
Amazon
32.5
4.5K
300K
¥5.76 / ¥23Input/Output
243
nemotron-4-340b-instruct
Nvidia
32.2
3.3K
-
-
244
gemini-1.5-flash-001
Google
31.9
9.8K
2M
¥0.54 / ¥2.2Input/Output
245
reka-core-20240904
-
31.7
1.1K
-
-
246
mercury
Inception Ai
31.4
367
128K
¥1.8 / ¥5.4Input/Output
247
llama-3-70b-instruct
Meta
31.1
23.2K
8.19K
¥3.67 / ¥5.33Input/Output
248
llama-3.1-nemotron-51b-instruct
Nvidia
30.8
551
128K
¥0 / ¥0Input/Output
249
command-r-plus
Cohere
30.5
11.3K
128K
¥18 / ¥72Input/Output
250
glm-4-0520
Zai
30.3
1.6K
128K
¥108 / ¥108Input/Output
251
gemini-1.5-flash-8b-001
Google
30.0
5.7K
2M
¥0.54 / ¥2.2Input/Output
252
gemma-2-9b-it
Google
29.7
9.1K
8.19K
¥1.44 / ¥1.44Input/Output
253
gpt-4-0314
Openai
29.4
8.1K
8.19K
¥216 / ¥432Input/Output
254
mistral-small-24b-instruct-2501
Mistral
29.1
2.7K
262K
¥2.88 / ¥14.4Input/Output
255
c4ai-aya-expanse-32b
Cohere
28.9
4.3K
-
-
256
gpt-4-0613
Openai
28.6
13.9K
8.19K
¥216 / ¥432Input/Output
257
reka-flash-20240904
-
28.3
1.1K
65.5K
¥0.72 / ¥1.44Input/Output
258
amazon-nova-lite-v1.0
Amazon
28.0
3.4K
300K
¥0.43 / ¥1.73Input/Output
259
claude-3-sonnet-20240229
Anthropic
27.7
16K
200K
¥21.6 / ¥108Input/Output
260
qwen2-72b-instruct
Alibaba
27.5
5.8K
131K
¥4.13 / ¥12.4Input/Output
261
amazon-nova-micro-v1.0
Amazon
27.2
3.3K
128K
¥0.25 / ¥1.01Input/Output
262
command-r-08-2024
Cohere
26.9
1.6K
128K
¥18 / ¥72Input/Output
263
phi-4
Microsoft
26.6
4.5K
128K
¥0.9 / ¥3.6Input/Output
264
llama-3.1-tulu-3-8b
Allenai
26.3
563
-
-
265
jamba-1.5-mini
-
26.1
1.5K
256K
¥0 / ¥0Input/Output
266
hunyuan-standard-256k
Tencent
25.8
379
-
-
267
claude-3-haiku-20240307
Anthropic
25.5
17.1K
200K
¥1.8 / ¥9Input/Output
268
mistral-large-2402
Mistral
25.2
9.2K
262K
¥2.88 / ¥14.4Input/Output
269
qwen2.5-coder-32b-instruct
Alibaba
24.9
929
131K
¥2.07 / ¥6.2Input/Output
270
c4ai-aya-expanse-8b
Cohere
24.6
1.8K
-
-
271
llama-3.1-8b-instruct
Meta
24.4
7.7K
131K
¥0.79 / ¥0.79Input/Output
272
command-r
Cohere
24.1
8K
128K
¥18 / ¥72Input/Output
273
ministral-8b-2410
Mistral
23.8
773
128K
¥0.72 / ¥0.72Input/Output
274
mixtral-8x22b-instruct-v0.1
Mistral
23.5
7.6K
64K
¥14.4 / ¥43.2Input/Output
275
qwen1.5-110b-chat
Alibaba
23.2
3.9K
-
-
276
mistral-medium
Mistral
23.0
5.4K
262K
¥2.88 / ¥14.4Input/Output
277
reka-flash-21b-20240226-online
-
22.7
2.1K
-
-
278
llama-3-8b-instruct
Meta
22.4
15.4K
8.19K
¥0.29 / ¥0.29Input/Output
279
zephyr-orpo-141b-A35b-v0.1
-
22.1
617
200K
¥108 / ¥432Input/Output
280
gemma-2-2b-it
Google
21.8
7.5K
128K
¥0 / ¥0Input/Output
281
qwen1.5-72b-chat
Alibaba
21.6
5.8K
-
-
282
deepseek-coder-v2
Deepseek
21.3
2.5K
1M
¥1.01 / ¥2.02Input/Output
283
yi-1.5-34b-chat
-
21.0
3.9K
-
-
284
tulu-2-dpo-70b
-
20.7
989
-
-
285
qwq-32b-preview
Alibaba
20.4
540
131K
¥2.07 / ¥6.2Input/Output
286
wizardlm-70b
Microsoft
20.2
1.5K
-
-
287
granite-3.1-8b-instruct
Ibm
19.9
537
-
-
288
yi-34b-chat
-
19.6
2.3K
-
-
289
mixtral-8x7b-instruct-v0.1
Mistral
19.3
11K
32K
¥5.04 / ¥5.04Input/Output
290
granite-3.1-2b-instruct
Ibm
19.0
567
-
-
291
internlm2_5-20b-chat
-
18.8
1.4K
-
-
292
vicuna-33b
-
18.5
3.6K
-
-
293
reka-flash-21b-20240226
-
18.2
3.5K
-
-
294
gemini-pro-dev-api
Google
17.9
2.9K
1.05M
¥14.4 / ¥86.4Input/Output
295
dbrx-instruct-preview
-
17.6
4.6K
-
-
296
openchat-3.5
-
17.4
1.3K
-
-
297
gpt-3.5-turbo-0125
Openai
17.1
9.7K
16.4K
¥3.6 / ¥10.8Input/Output
298
gemini-pro
Google
16.8
1K
1.05M
¥14.4 / ¥86.4Input/Output
299
starling-lm-7b-beta
-
16.5
2.2K
200K
¥5.4 / ¥18.7Input/Output
300
llama-3.2-3b-instruct
Meta
16.2
1.1K
131K
¥0.22 / ¥0.35Input/Output
301
starling-lm-7b-alpha
-
16.0
1.6K
200K
¥5.4 / ¥18.7Input/Output
302
phi-3-medium-4k-instruct
Microsoft
15.7
3.9K
4.1K
¥1.22 / ¥4.9Input/Output
303
qwen1.5-32b-chat
Alibaba
15.4
3.1K
-
-
304
wizardlm-13b
Microsoft
15.1
1.2K
-
-
305
openchat-3.5-0106
-
14.8
1.9K
-
-
306
openhermes-2.5-mistral-7b
-
14.6
799
1M
¥36 / ¥180Input/Output
307
guanaco-33b
-
14.3
443
200K
¥14.4 / ¥57.6Input/Output
308
llama2-70b-steerlm-chat
Nvidia
14.0
591
-
-
309
snowflake-arctic-instruct
-
13.7
4.7K
-
-
310
qwen1.5-14b-chat
Alibaba
13.4
2.5K
-
-
311
nous-hermes-2-mixtral-8x7b-dpo
-
13.2
593
1M
¥36 / ¥180Input/Output
312
llama-2-70b-chat
Meta
12.9
6.1K
-
-
313
zephyr-7b-beta
-
12.6
1.9K
-
-
314
deepseek-llm-67b-chat
Deepseek
12.3
802
1M
¥1.01 / ¥2.02Input/Output
315
granite-3.0-8b-instruct
Ibm
12.0
934
-
-
316
solar-10.7b-instruct-v1.0
-
11.8
654
128K
¥0 / ¥0Input/Output
317
falcon-180b-chat
-
11.5
229
-
-
318
mistral-7b-instruct-v0.2
Mistral
11.2
2.9K
262K
¥2.88 / ¥14.4Input/Output
319
phi-3-small-8k-instruct
Microsoft
10.9
2.6K
8.19K
¥1.08 / ¥4.32Input/Output
320
mpt-30b-chat
-
10.6
435
-
-
321
gemma-1.1-7b-it
Google
10.4
3.4K
-
-
322
dolphin-2.2.1-mistral-7b
-
10.1
282
262K
¥2.88 / ¥14.4Input/Output
323
zephyr-7b-alpha
-
9.8
329
-
-
324
llama-2-13b-chat
Meta
9.5
3.1K
-
-
325
granite-3.0-2b-instruct
Ibm
9.2
941
-
-
326
gpt-3.5-turbo-1106
Openai
9.0
2.6K
16.4K
¥7.2 / ¥14.4Input/Output
327
phi-3-mini-4k-instruct-june-2024
Microsoft
8.7
2K
4.1K
¥0.94 / ¥3.74Input/Output
328
vicuna-13b
-
8.4
3.2K
-
-
329
qwen1.5-7b-chat
Alibaba
8.1
708
-
-
330
llama-3.2-1b-instruct
Meta
7.8
1.2K
16.4K
¥0.07 / ¥0.08Input/Output
331
llama-2-7b-chat
Meta
7.6
2.2K
128K
¥4.03 / ¥48Input/Output
332
phi-3-mini-4k-instruct
Microsoft
7.3
3K
4.1K
¥0.94 / ¥3.74Input/Output
333
vicuna-7b
-
7.0
1.3K
-
-
334
codellama-34b-instruct
Meta
6.7
1.4K
-
-
335
stripedhyena-nous-7b
-
6.4
806
-
-
336
phi-3-mini-128k-instruct
Microsoft
6.2
2.9K
128K
¥0.94 / ¥3.74Input/Output
337
smollm2-1.7b-instruct
-
5.9
360
-
-
338
gemma-7b-it
Google
5.6
1.3K
-
-
339
mistral-7b-instruct
Mistral
5.3
1.5K
262K
¥2.88 / ¥14.4Input/Output
340
qwen-14b-chat
Alibaba
5.0
847
32.8K
¥1.04 / ¥3.1Input/Output
341
palm-2
Google
4.8
1.5K
-
-
342
olmo-7b-instruct
Allenai
4.5
876
-
-
343
gemma-1.1-2b-it
Google
4.2
1.5K
-
-
344
gemma-2b-it
Google
3.9
677
-
-
345
qwen1.5-4b-chat
Alibaba
3.6
1.1K
-
-
346
koala-13b
-
3.4
1.1K
-
-
347
gpt4all-13b-snoozy
-
3.1
253
1M
¥36 / ¥216Input/Output
348
alpaca-13b
-
2.8
935
-
-
349
chatglm3-6b
-
2.5
754
200K
¥5.4 / ¥18.7Input/Output
350
mpt-7b-chat
-
2.2
717
-
-
351
chatglm2-6b
-
2.0
483
200K
¥5.4 / ¥18.7Input/Output
352
RWKV-4-Raven-14B
-
1.7
834
-
-
353
oasst-pythia-12b
-
1.4
1.1K
-
-
354
fastchat-t5-3b
-
1.1
719
-
-
355
chatglm-6b
-
0.8
774
200K
¥5.4 / ¥18.7Input/Output
356
stablelm-tuned-alpha-7b
-
0.6
514
-
-
357
dolly-v2-12b
-
0.3
558
-
-
358
llama-13b
Meta
0.0
402
-
-
Top model analysis

claude-opus-4-6-thinking why it ranks first

claude-opus-4-6-thinking ranks first with a percent score of 100.0 and 6.7K samples. Use it as the first option for this leaderboard, then compare price, context and availability.

How to choose

Do not only look at rank #1

Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.

FAQ

FAQ

娱乐、体育与媒体排行榜看什么指标?

主要看排名、百分制分数、样本量和来源。分数用于快速比较同一榜单内模型表现,样本量用于判断结果稳定性。

为什么不同榜单不能直接混合成总分?

不同榜单的任务、样本和评测口径不同,模力榜默认只在同一榜单内排序,避免把写作、代码、图像等能力强行合并。

娱乐、体育与媒体模型应该怎么选?

优先看与你任务最接近的榜单,再结合价格、上下文长度、开源闭源和厂商可用性。排名靠前不代表适合所有预算和部署方式。

榜单多久更新?

页面展示的是最新成功采集的公开榜单数据。当前优先使用 LMArena leaderboard dataset,并在页面来源中保留原始链接。