Chat · Text · Writing, Literature & Language Leaderboard

Ranking for Text / Writing, Literature & Language, based on public preference data.

Selection guide

Writing, Literature & Language model ranking guide

Ranking for Text / Writing, Literature & Language, based on public preference data.

claude-opus-4-6-thinkingclaude-opus-4-6claude-opus-4-7-thinkingclaude-opus-4-7gemini-3.1-pro-preview
Current DirectoryChat · Text · Writing, Literature & Language
Models359
Published2026/05/27
Arena public preference evaluationOriginal leaderboard: Text / Industry Writing And Literature And LanguagePublished: 2026/05/27Leaderboard dataset: LMArena latest parquetOpen Arena sourceOpen leaderboard dataset
1
claude-opus-4-6-thinking
Anthropic
100.0
8.1K
1M
¥36 / ¥180Input/Output
2
claude-opus-4-6
Anthropic
99.7
8.4K
1M
¥36 / ¥180Input/Output
3
claude-opus-4-7-thinking
Anthropic
99.4
4.7K
1M
¥36 / ¥180Input/Output
4
claude-opus-4-7
Anthropic
99.2
5K
1M
¥36 / ¥180Input/Output
5
gemini-3.1-pro-preview
Google
98.9
10.2K
1.05M
¥14.4 / ¥86.4Input/Output
6
gemini-3-pro
Google
98.6
9.3K
1.05M
¥14.4 / ¥86.4Input/Output
7
gemini-3.5-flash
Google
98.3
2.1K
1.05M
¥10.8 / ¥64.8Input/Output
8
gpt-5.5-high
Openai
98.0
4K
1.05M
¥36 / ¥216Input/Output
9
qwen3.5-max-preview
Alibaba
97.8
4.6K
-
-
10
gpt-5.5
Openai
97.5
4.1K
1.05M
¥36 / ¥216Input/Output
11
gpt-5.4-high
Openai
97.2
6.7K
1.05M
¥18 / ¥108Input/Output
12
glm-5.1
Zai
96.9
3.2K
200K
¥0 / ¥0Input/Output
13
gemini-3-flash
Google
96.6
6.8K
1.05M
¥3.6 / ¥21.6Input/Output
14
qwen3.7-max-preview
Alibaba
96.4
866
1M
¥18 / ¥54Input/Output
15
gemini-2.5-pro
Google
96.1
27.3K
1.05M
¥9 / ¥72Input/Output
16
gpt-5.4
Openai
95.8
7K
1.05M
¥18 / ¥108Input/Output
17
muse-spark
Meta
95.5
2.8K
-
-
18
mimo-v2.5-pro
Xiaomi
95.3
3.7K
1.05M
¥7.2 / ¥21.6Input/Output
19
ernie-5.1
Baidu
95.0
3.3K
119K
¥5.4 / ¥21.6Input/Output
20
claude-opus-4-5-20251101
Anthropic
94.7
15.1K
200K
¥36 / ¥180Input/Output
21
qwen3.6-max-preview
Alibaba
94.4
989
246K
¥9.5 / ¥56.9Input/Output
22
deepseek-v4-pro
Deepseek
94.1
3.9K
1M
¥3.13 / ¥6.26Input/Output
23
claude-opus-4-5-20251101-thinking-32k
Anthropic
93.9
8.3K
200K
¥108 / ¥540Input/Output
24
claude-sonnet-4-6
Anthropic
93.6
6.3K
1M
¥21.6 / ¥108Input/Output
25
claude-sonnet-4-5-20250929
Anthropic
93.3
17.2K
200K
¥21.6 / ¥108Input/Output
26
grok-4.20-beta-0309-reasoning
Xai
93.0
6.9K
2M
¥14.4 / ¥43.2Input/Output
27
gemini-3-flash (thinking-minimal)
Google
92.7
12K
1.05M
¥3.6 / ¥21.6Input/Output
28
glm-5
Zai
92.5
5K
205K
¥7.2 / ¥23Input/Output
29
kimi-k2.6
Moonshot
92.2
3.6K
262K
¥6.84 / ¥28.8Input/Output
30
deepseek-v4-pro-thinking
Deepseek
91.9
3.7K
1M
¥3.13 / ¥6.26Input/Output
31
grok-4.20-multi-agent-beta-0309
Xai
91.6
6.7K
2M
¥14.4 / ¥43.2Input/Output
32
gemma-4-31b
Google
91.3
1.3K
262K
¥3.24 / ¥7.2Input/Output
33
claude-sonnet-4-5-20250929-thinking-32k
Anthropic
91.1
17.7K
200K
¥21.6 / ¥108Input/Output
34
grok-4.20-beta1
Xai
90.8
5.5K
2M
¥14.4 / ¥43.2Input/Output
35
gpt-5.1-high
Openai
90.5
9.1K
400K
¥9 / ¥72Input/Output
36
gpt-5.5-instant
Openai
90.2
6.1K
400K
¥9 / ¥72Input/Output
37
ernie-5.0-0110
Baidu
89.9
7.7K
128K
¥7.92 / ¥14.4Input/Output
38
kimi-k2.5-thinking
Moonshot
89.7
8.3K
262K
¥4.32 / ¥21.6Input/Output
39
mimo-v2-pro
Xiaomi
89.4
5.2K
1.05M
¥7.2 / ¥21.6Input/Output
40
qwen3.5-397b-a17b
Alibaba
89.1
7.3K
262K
¥3.1 / ¥18.6Input/Output
41
gpt-5.2-chat-latest-20260210
Openai
88.8
7.5K
400K
¥12.6 / ¥101Input/Output
42
qwen3.6-plus
Alibaba
88.5
4.1K
1M
¥3.6 / ¥21.6Input/Output
43
claude-opus-4-1-20250805
Anthropic
88.3
17.2K
200K
¥108 / ¥540Input/Output
44
claude-opus-4-1-20250805-thinking-16k
Anthropic
88.0
11K
200K
¥108 / ¥540Input/Output
45
glm-4.6
Zai
87.7
8K
205K
¥4.32 / ¥15.8Input/Output
46
chatgpt-4o-latest-20250326
Openai
87.4
18.6K
128K
¥18 / ¥72Input/Output
47
ernie-5.0-preview-1203
Baidu
87.2
2.2K
128K
¥7.92 / ¥14.4Input/Output
48
dola-seed-2.0-pro
Bytedance
86.9
8.5K
-
-
49
gpt-4.5-preview-2025-02-27
Openai
86.6
4.1K
8.19K
¥216 / ¥432Input/Output
50
glm-4.7
Zai
86.3
2.8K
205K
¥0 / ¥0Input/Output
51
mimo-v2.5
Xiaomi
86.0
3.7K
1.05M
¥2.88 / ¥14.4Input/Output
52
gpt-5.1
Openai
85.8
9.8K
400K
¥9 / ¥72Input/Output
53
deepseek-v4-flash
Deepseek
85.5
3.9K
1M
¥1.01 / ¥2.02Input/Output
54
grok-3-preview-02-24
Xai
85.2
7.8K
1M
¥9 / ¥18Input/Output
55
ernie-5.0-preview-1022
Baidu
84.9
1.1K
128K
¥7.92 / ¥14.4Input/Output
56
gemma-4-26b-a4b
Google
84.6
1.3K
262K
¥0.94 / ¥2.88Input/Output
57
gemini-3.1-flash-lite-preview
Google
84.4
8K
1.05M
¥1.8 / ¥10.8Input/Output
58
deepseek-v3.1-terminus-thinking
Deepseek
84.1
772
128K
¥1.8 / ¥5.04Input/Output
59
deepseek-v3.1-terminus
Deepseek
83.8
810
128K
¥1.8 / ¥5.04Input/Output
60
deepseek-v3.2-exp
Deepseek
83.5
2.6K
128K
¥0 / ¥0Input/Output
61
grok-4.1
Xai
83.2
14.4K
200K
¥14.4 / ¥72Input/Output
62
gemini-2.5-flash
Google
83.0
27.5K
1.05M
¥2.16 / ¥18Input/Output
63
deepseek-v4-flash-thinking
Deepseek
82.7
3.9K
1M
¥1.01 / ¥2.02Input/Output
64
qwen3-max-preview
Alibaba
82.4
6.2K
262K
¥6.2 / ¥24.8Input/Output
65
deepseek-v3.2
Deepseek
82.1
10.1K
128K
¥2.09 / ¥3.1Input/Output
66
deepseek-v3.1-thinking
Deepseek
81.8
2.7K
128K
¥1.44 / ¥5.04Input/Output
67
grok-4.1-thinking
Xai
81.6
14.4K
200K
¥14.4 / ¥72Input/Output
68
deepseek-r1-0528
Deepseek
81.3
4.2K
164K
¥3.6 / ¥15.5Input/Output
69
mistral-large-3
Mistral
81.0
9.7K
262K
¥3.6 / ¥10.8Input/Output
70
glm-4.5
Zai
80.7
5.4K
131K
¥4.32 / ¥15.8Input/Output
71
grok-4-0709
Xai
80.4
9.1K
256K
¥21.6 / ¥108Input/Output
72
kimi-k2.5-instant
Moonshot
80.2
1.8K
262K
¥4.32 / ¥21.6Input/Output
73
deepseek-v3.1
Deepseek
79.9
3.5K
128K
¥1.44 / ¥5.04Input/Output
74
grok-4.3
Xai
79.6
3.8K
1M
¥9 / ¥18Input/Output
75
mistral-medium-2508
Mistral
79.3
20.9K
262K
¥2.88 / ¥14.4Input/Output
76
deepseek-v3.2-thinking
Deepseek
79.1
9.1K
128K
¥2.09 / ¥3.1Input/Output
77
gemini-2.5-flash-preview-09-2025
Google
78.8
7.3K
1M
¥2.16 / ¥18Input/Output
78
gpt-5.4-mini-high
Openai
78.5
6.2K
400K
¥5.4 / ¥32.4Input/Output
79
deepseek-v3.2-exp-thinking
Deepseek
78.2
2.1K
128K
¥0 / ¥0Input/Output
80
gpt-5.2-high
Openai
77.9
10.6K
400K
¥12.6 / ¥101Input/Output
81
qwen3-vl-235b-a22b-instruct
Alibaba
77.7
2.4K
128K
¥2.16 / ¥8.64Input/Output
82
grok-4-fast-chat
Xai
77.4
1.5K
2M
¥1.44 / ¥3.6Input/Output
83
gpt-5.2
Openai
77.1
10.7K
400K
¥12.6 / ¥101Input/Output
84
claude-opus-4-20250514-thinking-16k
Anthropic
76.8
7.9K
200K
¥108 / ¥540Input/Output
85
qwen3-235b-a22b-instruct-2507
Alibaba
76.5
21.3K
128K
¥2.09 / ¥8.23Input/Output
86
qwen3.5-122b-a10b
Alibaba
76.3
6K
262K
¥2.88 / ¥23Input/Output
87
amazon-nova-experimental-chat-26-02-10
Amazon
76.0
697
-
-
88
longcat-flash-chat-2602-exp
Meituan
75.7
5.4K
128K
¥1.08 / ¥10.8Input/Output
89
qwen3-max-2025-09-23
Alibaba
75.4
2K
258K
¥6.19 / ¥24.7Input/Output
90
qwen3-235b-a22b-thinking-2507
Alibaba
75.1
1.9K
131K
¥2.07 / ¥8.26Input/Output
91
hunyuan-vision-1.5-thinking
Tencent
74.9
489
-
-
92
mimo-v2-flash (non-thinking)
Xiaomi
74.6
10K
262K
¥0.72 / ¥2.16Input/Output
93
gpt-5-chat
Openai
74.3
6.8K
400K
¥9 / ¥72Input/Output
94
grok-4-1-fast-reasoning
Xai
74.0
12.4K
2M
¥1.44 / ¥3.6Input/Output
95
amazon-nova-experimental-chat-12-10
Amazon
73.7
829
-
-
96
hunyuan-hy3-preview
Tencent
73.5
1.3K
256K
¥0 / ¥0Input/Output
97
hunyuan-t1-20250711
Tencent
73.2
1K
131K
¥0 / ¥0Input/Output
98
claude-haiku-4-5-20251001
Anthropic
72.9
17.5K
200K
¥7.2 / ¥36Input/Output
99
claude-opus-4-20250514
Anthropic
72.6
9.4K
200K
¥108 / ¥540Input/Output
100
kimi-k2-thinking-turbo
Moonshot
72.3
13.7K
262K
¥17.3 / ¥72Input/Output
101
grok-4-fast-reasoning
Xai
72.1
4.2K
2M
¥1.44 / ¥3.6Input/Output
102
qwen3.5-27b
Alibaba
71.8
5.8K
262K
¥2.16 / ¥17.3Input/Output
103
gpt-5-high
Openai
71.5
7K
400K
¥9 / ¥72Input/Output
104
gpt-5.3-chat-latest
Openai
71.2
7.1K
128K
¥12.6 / ¥101Input/Output
105
mimo-v2-omni
Xiaomi
70.9
683
262K
¥2.88 / ¥14.4Input/Output
106
gemini-2.5-flash-lite-preview-06-17-thinking
Google
70.7
7.1K
65.5K
¥0.72 / ¥2.88Input/Output
107
amazon-nova-experimental-chat-11-10
Amazon
70.4
5.7K
-
-
108
minimax-m2.1-preview
Minimax
70.1
3.9K
205K
¥0 / ¥0Input/Output
109
o3-2025-04-16
Openai
69.8
13.3K
200K
¥14.4 / ¥57.6Input/Output
110
qwen3.5-flash
Alibaba
69.6
6.6K
1M
¥1.24 / ¥12.4Input/Output
111
gpt-4.1-2025-04-14
Openai
69.3
11.4K
1.05M
¥14.4 / ¥57.6Input/Output
112
step-3.5-flash
Stepfun
69.0
8K
256K
¥0.69 / ¥2.07Input/Output
113
qwen3-235b-a22b-no-thinking
Alibaba
68.7
8.1K
131K
¥2.07 / ¥8.26Input/Output
114
longcat-flash-chat
Meituan
68.4
2.5K
128K
¥1.08 / ¥10.8Input/Output
115
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google
68.2
10.5K
1.05M
¥0.72 / ¥2.88Input/Output
116
qwen3.5-35b-a3b
Alibaba
67.9
6.3K
262K
¥1.8 / ¥14.4Input/Output
117
qwen3-vl-235b-a22b-thinking
Alibaba
67.6
1.8K
131K
¥2.06 / ¥8.26Input/Output
118
minimax-m2.7
Minimax
67.3
5.1K
205K
¥0 / ¥0Input/Output
119
deepseek-v3-0324
Deepseek
67.0
10.3K
75K
¥1.44 / ¥5.76Input/Output
120
deepseek-r1
Deepseek
66.8
5.4K
164K
¥5.04 / ¥18Input/Output
121
mimo-v2-flash (thinking)
Xiaomi
66.5
2.4K
262K
¥0.72 / ¥2.16Input/Output
122
o1-2024-12-17
Openai
66.2
7.7K
128K
¥108 / ¥432Input/Output
123
qwen3-next-80b-a3b-instruct
Alibaba
65.9
5.1K
131K
¥1.04 / ¥4.13Input/Output
124
claude-sonnet-4-20250514-thinking-32k
Anthropic
65.6
7.5K
200K
¥21.6 / ¥108Input/Output
125
amazon-nova-experimental-chat-26-01-10
Amazon
65.4
761
-
-
126
glm-4.5-air
Zai
65.1
7K
131K
¥0 / ¥0Input/Output
127
mistral-medium-2505
Mistral
64.8
7.2K
262K
¥2.88 / ¥14.4Input/Output
128
hunyuan-turbos-20250416
Tencent
64.5
2.5K
131K
¥0 / ¥0Input/Output
129
amazon-nova-experimental-chat-10-20
Amazon
64.2
2.5K
-
-
130
claude-sonnet-4-20250514
Anthropic
64.0
8.6K
200K
¥21.6 / ¥108Input/Output
131
minimax-m2.5
Minimax
63.7
8.5K
205K
¥0 / ¥0Input/Output
132
gemini-2.0-flash-001
Google
63.4
10.5K
1.05M
¥1.08 / ¥4.32Input/Output
133
gemma-3-27b-it
Google
63.1
10.7K
128K
¥2.15 / ¥2.15Input/Output
134
grok-3-mini-high
Xai
62.8
3.6K
128K
¥0 / ¥0Input/Output
135
glm-4.6v
Zai
62.6
608
128K
¥2.16 / ¥6.48Input/Output
136
qwen2.5-max
Alibaba
62.3
8.2K
32K
¥11.5 / ¥46Input/Output
137
gpt-5-mini-high
Openai
62.0
6.1K
400K
¥1.8 / ¥14.4Input/Output
138
kimi-k2-0905-preview
Moonshot
61.7
2.6K
262K
¥4.32 / ¥18Input/Output
139
grok-3-mini-beta
Xai
61.5
4.8K
1M
¥9 / ¥18Input/Output
140
qwen3-coder-480b-a35b-instruct
Alibaba
61.2
5.4K
262K
¥6.2 / ¥24.8Input/Output
141
o1-preview
Openai
60.9
8.4K
128K
¥108 / ¥432Input/Output
142
intellect-3
-
60.6
1.3K
131K
¥1.44 / ¥7.92Input/Output
143
gpt-5.4-nano-high
Openai
60.3
6K
400K
¥1.44 / ¥9Input/Output
144
qwen3-30b-a3b-instruct-2507
Alibaba
60.1
5.1K
262K
¥2.16 / ¥3.6Input/Output
145
claude-3-7-sonnet-20250219-thinking-32k
Anthropic
59.8
8.9K
-
-
146
qwen3-235b-a22b
Alibaba
59.5
6.2K
131K
¥2.07 / ¥8.26Input/Output
147
nvidia-nemotron-3-super-120b-a12b
Nvidia
59.2
1.6K
262K
¥1.44 / ¥5.76Input/Output
148
deepseek-v3
Deepseek
58.9
5.9K
128K
¥0 / ¥0Input/Output
149
trinity-large-preview
-
58.7
6.5K
262K
¥1.8 / ¥6.48Input/Output
150
qwen3-next-80b-a3b-thinking
Alibaba
58.4
3.1K
131K
¥1.04 / ¥10.3Input/Output
151
command-a-03-2025
Cohere
58.1
12.8K
256K
¥18 / ¥72Input/Output
152
kimi-k2-0711-preview
Moonshot
57.8
6K
131K
¥4.32 / ¥18Input/Output
153
glm-4-plus-0111
Zai
57.5
1.5K
128K
¥72 / ¥72Input/Output
154
gemini-1.5-pro-002
Google
57.3
15K
-
-
155
step-3
Stepfun
57.0
1.4K
65.5K
¥1.8 / ¥4.68Input/Output
156
claude-3-7-sonnet-20250219
Anthropic
56.7
9.8K
200K
¥21.6 / ¥108Input/Output
157
trinity-large-thinking
-
56.4
5.6K
262K
¥1.8 / ¥6.48Input/Output
158
glm-4.7-flash
Zai
56.1
2.6K
200K
¥0 / ¥0Input/Output
159
amazon-nova-experimental-chat-10-09
Amazon
55.9
618
-
-
160
gemini-2.0-flash-lite-preview-02-05
Google
55.6
6.7K
1.05M
¥0.54 / ¥2.16Input/Output
161
gpt-4.1-mini-2025-04-14
Openai
55.3
8.7K
1.05M
¥2.88 / ¥11.5Input/Output
162
nova-2-lite
Amazon
55.0
2.8K
128K
¥2.38 / ¥19.8Input/Output
163
o4-mini-2025-04-16
Openai
54.7
10.1K
200K
¥7.92 / ¥31.7Input/Output
164
o3-mini-high
Openai
54.5
4.8K
200K
¥7.92 / ¥31.7Input/Output
165
claude-3-5-sonnet-20241022
Anthropic
54.2
21.9K
200K
¥21.6 / ¥108Input/Output
166
mistral-small-2506
Mistral
53.9
3.8K
262K
¥2.88 / ¥14.4Input/Output
167
minimax-m1
Minimax
53.6
7.8K
1M
¥0.95 / ¥9.03Input/Output
168
step-1o-turbo-202506
Stepfun
53.4
1.8K
-
-
169
gemma-3-12b-it
Google
53.1
1K
128K
¥1.96 / ¥1.96Input/Output
170
gpt-oss-120b
Openai
52.8
6.7K
131K
¥1.08 / ¥4.32Input/Output
171
glm-4.5v
Zai
52.5
1.1K
64K
¥4.32 / ¥13Input/Output
172
step-2-16k-exp-202412
Stepfun
52.2
1.3K
16.4K
¥37.5 / ¥118Input/Output
173
mercury-2
Inception Ai
52.0
696
128K
¥1.8 / ¥5.4Input/Output
174
hunyuan-turbos-20250226
Tencent
51.7
662
131K
¥0 / ¥0Input/Output
175
qwq-32b
Alibaba
51.4
6.2K
131K
¥2.07 / ¥6.2Input/Output
176
llama-3.3-nemotron-49b-super-v1
Nvidia
51.1
614
131K
¥0 / ¥0Input/Output
177
llama-3.1-nemotron-ultra-253b-v1
Nvidia
50.8
712
128K
¥4.32 / ¥13Input/Output
178
minimax-m2
Minimax
50.6
1.5K
197K
¥0 / ¥0Input/Output
179
qwen3-32b
Alibaba
50.3
1.1K
131K
¥2.07 / ¥8.26Input/Output
180
gpt-4o-2024-05-13
Openai
50.0
30K
128K
¥36 / ¥108Input/Output
181
ring-flash-2.0
Ant Group
49.7
1.5K
131K
¥1.01 / ¥4.1Input/Output
182
qwen-plus-0125
Alibaba
49.4
1.5K
1M
¥0.83 / ¥2.07Input/Output
183
o3-mini
Openai
49.2
13.4K
200K
¥7.92 / ¥31.7Input/Output
184
ling-flash-2.0
Ant Group
48.9
1.5K
131K
¥1.01 / ¥4.1Input/Output
185
gemini-1.5-flash-002
Google
48.6
9.4K
2M
¥0.54 / ¥2.2Input/Output
186
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia
48.3
658
131K
¥2.88 / ¥2.88Input/Output
187
grok-2-2024-08-13
Xai
48.0
17.4K
1M
¥9 / ¥18Input/Output
188
qwen3-30b-a3b
Alibaba
47.8
6K
128K
¥0.79 / ¥7.78Input/Output
189
hunyuan-turbo-0110
Tencent
47.5
711
-
-
190
gpt-4o-2024-08-06
Openai
47.2
12.1K
128K
¥18 / ¥72Input/Output
191
gemini-1.5-pro-001
Google
46.9
21K
-
-
192
gemma-3-4b-it
Google
46.6
1.1K
128K
¥1.44 / ¥1.44Input/Output
193
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia
46.4
3.5K
131K
¥0 / ¥0Input/Output
194
gemma-3n-e4b-it
Google
46.1
4.9K
128K
¥0 / ¥0Input/Output
195
gemini-advanced-0514
Google
45.8
13.2K
-
-
196
gpt-5-nano-high
Openai
45.5
1.7K
400K
¥0.36 / ¥2.88Input/Output
197
deepseek-v2.5-1210
Deepseek
45.3
1.9K
1M
¥1.01 / ¥2.02Input/Output
198
gpt-4o-mini-2024-07-18
Openai
45.0
18.5K
128K
¥1.08 / ¥4.32Input/Output
199
claude-3-5-sonnet-20240620
Anthropic
44.7
21.9K
200K
¥21.6 / ¥108Input/Output
200
o1-mini
Openai
44.4
14.3K
128K
¥7.92 / ¥31.7Input/Output
201
gpt-4-turbo-2024-04-09
Openai
44.1
24.5K
128K
¥72 / ¥216Input/Output
202
glm-4-plus
Zai
43.9
7.2K
128K
¥54 / ¥54Input/Output
203
llama-4-maverick-17b-128e-instruct
Meta
43.6
8.8K
1M
¥1.8 / ¥6.26Input/Output
204
olmo-3.1-32b-instruct
Allenai
43.3
2.7K
200K
¥14.4 / ¥57.6Input/Output
205
yi-lightning
-
43.0
7.6K
12K
¥1.44 / ¥1.44Input/Output
206
llama-3.1-405b-instruct-fp8
Meta
42.7
16K
128K
¥0 / ¥0Input/Output
207
qwen2.5-plus-1127
Alibaba
42.5
2.8K
-
-
208
llama-4-scout-17b-16e-instruct
Meta
42.2
6.7K
128K
¥1.44 / ¥5.62Input/Output
209
olmo-3-32b-think
Allenai
41.9
1.3K
128K
¥2.16 / ¥3.24Input/Output
210
hunyuan-large-2025-02-10
Tencent
41.6
890
-
-
211
llama-3.1-405b-instruct-bf16
Meta
41.3
11.2K
128K
¥0 / ¥0Input/Output
212
qwen-max-0919
Alibaba
41.1
4.6K
131K
¥2.48 / ¥9.91Input/Output
213
mistral-small-3.1-24b-instruct-2503
Mistral
40.8
7.1K
262K
¥2.88 / ¥14.4Input/Output
214
claude-3-opus-20240229
Anthropic
40.5
49.9K
200K
¥108 / ¥540Input/Output
215
gpt-4-1106-preview
Openai
40.2
24.7K
8.19K
¥216 / ¥432Input/Output
216
granite-4.1-8b
Ibm
39.9
821
131K
¥0.36 / ¥0.72Input/Output
217
grok-2-mini-2024-08-13
Xai
39.7
14.5K
1M
¥9 / ¥18Input/Output
218
hunyuan-large-vision
Tencent
39.4
1.1K
-
-
219
gpt-4.1-nano-2025-04-14
Openai
39.1
1.6K
1.05M
¥14.4 / ¥57.6Input/Output
220
mistral-large-2411
Mistral
38.8
7.4K
128K
¥14.4 / ¥43.2Input/Output
221
magistral-medium-2506
Mistral
38.5
2.4K
128K
¥14.4 / ¥36Input/Output
222
mistral-large-2407
Mistral
38.3
12.2K
131K
¥14.4 / ¥43.2Input/Output
223
gpt-4-0125-preview
Openai
38.0
23.5K
8.19K
¥216 / ¥432Input/Output
224
hunyuan-standard-2025-02-10
Tencent
37.7
937
-
-
225
claude-3-5-haiku-20241022
Anthropic
37.4
17K
200K
¥5.76 / ¥28.8Input/Output
226
llama-3.3-70b-instruct
Meta
37.2
13.8K
128K
¥0 / ¥0Input/Output
227
llama-3.1-tulu-3-70b
Allenai
36.9
758
-
-
228
athene-v2-chat
-
36.6
6.7K
-
-
229
llama-3.1-nemotron-70b-instruct
Nvidia
36.3
1.8K
128K
¥0 / ¥0Input/Output
230
gemini-1.5-flash-001
Google
36.0
16.5K
2M
¥0.54 / ¥2.2Input/Output
231
olmo-3.1-32b-think
Allenai
35.8
1.9K
200K
¥14.4 / ¥57.6Input/Output
232
gemma-2-27b-it
Google
35.5
20.4K
8.19K
¥0.58 / ¥0.58Input/Output
233
qwen2.5-72b-instruct
Alibaba
35.2
10.8K
131K
¥4.13 / ¥12.4Input/Output
234
deepseek-v2.5
Deepseek
34.9
6.6K
1M
¥1.01 / ¥2.02Input/Output
235
amazon-nova-pro-v1.0
Amazon
34.6
6.6K
300K
¥5.76 / ¥23Input/Output
236
reka-core-20240904
-
34.4
1.9K
-
-
237
command-r-plus-08-2024
Cohere
34.1
2.7K
128K
¥18 / ¥72Input/Output
238
athene-70b-0725
-
33.8
5.4K
-
-
239
gpt-oss-20b
Openai
33.5
2.2K
131K
¥0.32 / ¥1.3Input/Output
240
llama-3.1-70b-instruct
Meta
33.2
15K
131K
¥2.88 / ¥2.88Input/Output
241
ibm-granite-h-small
Ibm
33.0
1.2K
-
-
242
mercury
Inception Ai
32.7
456
128K
¥1.8 / ¥5.4Input/Output
243
jamba-1.5-large
-
32.4
2.4K
256K
¥0 / ¥0Input/Output
244
gemini-1.5-flash-8b-001
Google
32.1
9.5K
2M
¥0.54 / ¥2.2Input/Output
245
gemma-2-9b-it-simpo
-
31.8
2.8K
8.19K
¥1.44 / ¥1.44Input/Output
246
nemotron-4-340b-instruct
Nvidia
31.6
5.4K
-
-
247
c4ai-aya-expanse-32b
Cohere
31.3
7.3K
-
-
248
llama-3.1-nemotron-51b-instruct
Nvidia
31.0
1K
128K
¥0 / ¥0Input/Output
249
gemma-2-9b-it
Google
30.7
14.6K
8.19K
¥1.44 / ¥1.44Input/Output
250
gpt-4-0314
Openai
30.4
12.9K
8.19K
¥216 / ¥432Input/Output
251
command-r-plus
Cohere
30.2
19.6K
128K
¥18 / ¥72Input/Output
252
claude-3-sonnet-20240229
Anthropic
29.9
27.3K
200K
¥21.6 / ¥108Input/Output
253
reka-flash-20240904
-
29.6
2K
65.5K
¥0.72 / ¥1.44Input/Output
254
gpt-4-0613
Openai
29.3
22.1K
8.19K
¥216 / ¥432Input/Output
255
mistral-small-24b-instruct-2501
Mistral
29.1
3.9K
262K
¥2.88 / ¥14.4Input/Output
256
amazon-nova-lite-v1.0
Amazon
28.8
5.3K
300K
¥0.43 / ¥1.73Input/Output
257
glm-4-0520
Zai
28.5
2.6K
128K
¥108 / ¥108Input/Output
258
llama-3-70b-instruct
Meta
28.2
37.7K
8.19K
¥3.67 / ¥5.33Input/Output
259
olmo-2-0325-32b-instruct
Allenai
27.9
872
-
-
260
qwen2-72b-instruct
Alibaba
27.7
10K
131K
¥4.13 / ¥12.4Input/Output
261
phi-4
Microsoft
27.4
6.5K
128K
¥0.9 / ¥3.6Input/Output
262
claude-3-haiku-20240307
Anthropic
27.1
29.9K
200K
¥1.8 / ¥9Input/Output
263
command-r-08-2024
Cohere
26.8
2.6K
128K
¥18 / ¥72Input/Output
264
amazon-nova-micro-v1.0
Amazon
26.5
5.2K
128K
¥0.25 / ¥1.01Input/Output
265
qwen2.5-coder-32b-instruct
Alibaba
26.3
1.5K
131K
¥2.07 / ¥6.2Input/Output
266
c4ai-aya-expanse-8b
Cohere
26.0
2.7K
-
-
267
ministral-8b-2410
Mistral
25.7
1.4K
128K
¥0.72 / ¥0.72Input/Output
268
mistral-large-2402
Mistral
25.4
15.2K
262K
¥2.88 / ¥14.4Input/Output
269
llama-3.1-tulu-3-8b
Allenai
25.1
816
-
-
270
hunyuan-standard-256k
Tencent
24.9
752
-
-
271
mistral-medium
Mistral
24.6
8.4K
262K
¥2.88 / ¥14.4Input/Output
272
qwen1.5-110b-chat
Alibaba
24.3
6.5K
-
-
273
deepseek-coder-v2
Deepseek
24.0
3.9K
1M
¥1.01 / ¥2.02Input/Output
274
jamba-1.5-mini
-
23.7
2.4K
256K
¥0 / ¥0Input/Output
275
llama-3.1-8b-instruct
Meta
23.5
13.5K
131K
¥0.79 / ¥0.79Input/Output
276
command-r
Cohere
23.2
13.5K
128K
¥18 / ¥72Input/Output
277
mixtral-8x22b-instruct-v0.1
Mistral
22.9
12.8K
64K
¥14.4 / ¥43.2Input/Output
278
reka-flash-21b-20240226-online
-
22.6
3.6K
-
-
279
qwen1.5-72b-chat
Alibaba
22.3
9.5K
-
-
280
gemma-2-2b-it
Google
22.1
12.5K
128K
¥0 / ¥0Input/Output
281
reka-flash-21b-20240226
-
21.8
5.8K
-
-
282
llama-3-8b-instruct
Meta
21.5
25.5K
8.19K
¥0.29 / ¥0.29Input/Output
283
wizardlm-70b
Microsoft
21.2
2.3K
-
-
284
yi-1.5-34b-chat
-
20.9
6.5K
-
-
285
gemini-pro-dev-api
Google
20.7
4.6K
1.05M
¥14.4 / ¥86.4Input/Output
286
qwq-32b-preview
Alibaba
20.4
855
131K
¥2.07 / ¥6.2Input/Output
287
zephyr-orpo-141b-A35b-v0.1
-
20.1
1.1K
200K
¥108 / ¥432Input/Output
288
gpt-3.5-turbo-0125
Openai
19.8
16.1K
16.4K
¥3.6 / ¥10.8Input/Output
289
granite-3.1-8b-instruct
Ibm
19.6
837
-
-
290
internlm2_5-20b-chat
-
19.3
2.8K
-
-
291
gemini-pro
Google
19.0
1.3K
1.05M
¥14.4 / ¥86.4Input/Output
292
phi-3-medium-4k-instruct
Microsoft
18.7
6.7K
4.1K
¥1.22 / ¥4.9Input/Output
293
tulu-2-dpo-70b
-
18.4
1.5K
-
-
294
mixtral-8x7b-instruct-v0.1
Mistral
18.2
17.9K
32K
¥5.04 / ¥5.04Input/Output
295
dbrx-instruct-preview
-
17.9
8K
-
-
296
vicuna-33b
-
17.6
5.6K
-
-
297
qwen1.5-32b-chat
Alibaba
17.3
5.4K
-
-
298
openchat-3.5
-
17.0
2K
-
-
299
yi-34b-chat
-
16.8
3.7K
-
-
300
qwen1.5-14b-chat
Alibaba
16.5
4.5K
-
-
301
starling-lm-7b-beta
-
16.2
4K
200K
¥5.4 / ¥18.7Input/Output
302
deepseek-llm-67b-chat
Deepseek
15.9
1.1K
1M
¥1.01 / ¥2.02Input/Output
303
openchat-3.5-0106
-
15.6
3.1K
-
-
304
falcon-180b-chat
-
15.4
342
-
-
305
nous-hermes-2-mixtral-8x7b-dpo
-
15.1
858
1M
¥36 / ¥180Input/Output
306
snowflake-arctic-instruct
-
14.8
7.2K
-
-
307
starling-lm-7b-alpha
-
14.5
2.5K
200K
¥5.4 / ¥18.7Input/Output
308
wizardlm-13b
Microsoft
14.2
1.8K
-
-
309
openhermes-2.5-mistral-7b
-
14.0
1.1K
1M
¥36 / ¥180Input/Output
310
llama-3.2-3b-instruct
Meta
13.7
2.1K
131K
¥0.22 / ¥0.35Input/Output
311
llama2-70b-steerlm-chat
Nvidia
13.4
836
-
-
312
phi-3-small-8k-instruct
Microsoft
13.1
4.6K
8.19K
¥1.08 / ¥4.32Input/Output
313
llama-2-70b-chat
Meta
12.8
9.6K
-
-
314
granite-3.1-2b-instruct
Ibm
12.6
850
-
-
315
gpt-3.5-turbo-1106
Openai
12.3
3.9K
16.4K
¥7.2 / ¥14.4Input/Output
316
solar-10.7b-instruct-v1.0
-
12.0
894
128K
¥0 / ¥0Input/Output
317
dolphin-2.2.1-mistral-7b
-
11.7
382
262K
¥2.88 / ¥14.4Input/Output
318
zephyr-7b-beta
-
11.5
2.7K
-
-
319
zephyr-7b-alpha
-
11.2
569
-
-
320
mpt-30b-chat
-
10.9
647
-
-
321
vicuna-13b
-
10.6
4.7K
-
-
322
granite-3.0-8b-instruct
Ibm
10.3
1.8K
-
-
323
gemma-1.1-7b-it
Google
10.1
6K
-
-
324
mistral-7b-instruct-v0.2
Mistral
9.8
4.7K
262K
¥2.88 / ¥14.4Input/Output
325
guanaco-33b
-
9.5
714
200K
¥14.4 / ¥57.6Input/Output
326
qwen1.5-7b-chat
Alibaba
9.2
1.1K
-
-
327
llama-2-13b-chat
Meta
8.9
4.8K
-
-
328
phi-3-mini-4k-instruct-june-2024
Microsoft
8.7
3.4K
4.1K
¥0.94 / ¥3.74Input/Output
329
phi-3-mini-4k-instruct
Microsoft
8.4
5.2K
4.1K
¥0.94 / ¥3.74Input/Output
330
granite-3.0-2b-instruct
Ibm
8.1
1.8K
-
-
331
qwen-14b-chat
Alibaba
7.8
1.2K
32.8K
¥1.04 / ¥3.1Input/Output
332
vicuna-7b
-
7.5
1.8K
-
-
333
codellama-34b-instruct
Meta
7.3
2K
-
-
334
phi-3-mini-128k-instruct
Microsoft
7.0
4.6K
128K
¥0.94 / ¥3.74Input/Output
335
llama-2-7b-chat
Meta
6.7
3.5K
128K
¥4.03 / ¥48Input/Output
336
stripedhyena-nous-7b
-
6.4
1.3K
-
-
337
gemma-7b-it
Google
6.1
2.2K
-
-
338
mistral-7b-instruct
Mistral
5.9
2.3K
262K
¥2.88 / ¥14.4Input/Output
339
llama-3.2-1b-instruct
Meta
5.6
2K
16.4K
¥0.07 / ¥0.08Input/Output
340
smollm2-1.7b-instruct
-
5.3
627
-
-
341
codellama-70b-instruct
Meta
5.0
262
-
-
342
palm-2
Google
4.7
2.2K
-
-
343
gemma-1.1-2b-it
Google
4.5
2.6K
-
-
344
olmo-7b-instruct
Allenai
4.2
1.5K
-
-
345
qwen1.5-4b-chat
Alibaba
3.9
1.9K
-
-
346
koala-13b
-
3.6
1.5K
-
-
347
gemma-2b-it
Google
3.4
1.2K
-
-
348
gpt4all-13b-snoozy
-
3.1
347
1M
¥36 / ¥216Input/Output
349
mpt-7b-chat
-
2.8
892
-
-
350
chatglm3-6b
-
2.5
1.1K
200K
¥5.4 / ¥18.7Input/Output
351
alpaca-13b
-
2.2
1.2K
-
-
352
chatglm2-6b
-
2.0
775
200K
¥5.4 / ¥18.7Input/Output
353
RWKV-4-Raven-14B
-
1.7
1.1K
-
-
354
oasst-pythia-12b
-
1.4
1.4K
-
-
355
chatglm-6b
-
1.1
1.1K
200K
¥5.4 / ¥18.7Input/Output
356
fastchat-t5-3b
-
0.8
940
-
-
357
dolly-v2-12b
-
0.6
732
-
-
358
stablelm-tuned-alpha-7b
-
0.3
681
-
-
359
llama-13b
Meta
0.0
513
-
-
Top model analysis

claude-opus-4-6-thinking why it ranks first

claude-opus-4-6-thinking ranks first with a percent score of 100.0 and 8.1K samples. Use it as the first option for this leaderboard, then compare price, context and availability.

How to choose

Do not only look at rank #1

Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.

FAQ

FAQ

写作、文学与语言排行榜看什么指标?

主要看排名、百分制分数、样本量和来源。分数用于快速比较同一榜单内模型表现,样本量用于判断结果稳定性。

为什么不同榜单不能直接混合成总分?

不同榜单的任务、样本和评测口径不同,模力榜默认只在同一榜单内排序,避免把写作、代码、图像等能力强行合并。

写作、文学与语言模型应该怎么选?

优先看与你任务最接近的榜单,再结合价格、上下文长度、开源闭源和厂商可用性。排名靠前不代表适合所有预算和部署方式。

榜单多久更新?

页面展示的是最新成功采集的公开榜单数据。当前优先使用 LMArena leaderboard dataset,并在页面来源中保留原始链接。