Chat · Text · Creative Writing Leaderboard

Ranking for Text / Creative Writing, based on public preference data.

Selection guide

Creative Writing model ranking guide

Ranking for Text / Creative Writing, based on public preference data.

claude-opus-4-6-thinkinggemini-3.1-pro-previewgemini-3-proclaude-opus-4-6claude-opus-4-7-thinking
Current DirectoryChat · Text · Creative Writing
Models358
Published2026/05/27
Arena public preference evaluationOriginal leaderboard: Text / Creative WritingPublished: 2026/05/27Leaderboard dataset: LMArena latest parquetOpen Arena sourceOpen leaderboard dataset
1
claude-opus-4-6-thinking
Anthropic
100.0
5.5K
1M
¥36 / ¥180Input/Output
2
gemini-3.1-pro-preview
Google
99.7
7K
1.05M
¥14.4 / ¥86.4Input/Output
3
gemini-3-pro
Google
99.4
6.3K
1.05M
¥14.4 / ¥86.4Input/Output
4
claude-opus-4-6
Anthropic
99.2
5.7K
1M
¥36 / ¥180Input/Output
5
claude-opus-4-7-thinking
Anthropic
98.9
3.2K
1M
¥36 / ¥180Input/Output
6
claude-opus-4-7
Anthropic
98.6
3.4K
1M
¥36 / ¥180Input/Output
7
gemini-3.5-flash
Google
98.3
1.4K
1.05M
¥10.8 / ¥64.8Input/Output
8
qwen3.5-max-preview
Alibaba
98.0
3K
-
-
9
glm-5.1
Zai
97.8
2.2K
200K
¥0 / ¥0Input/Output
10
gemini-3-flash
Google
97.5
4.7K
1.05M
¥3.6 / ¥21.6Input/Output
11
muse-spark
Meta
97.2
1.8K
-
-
12
gemini-2.5-pro
Google
96.9
17.2K
1.05M
¥9 / ¥72Input/Output
13
ernie-5.1
Baidu
96.6
2.2K
119K
¥5.4 / ¥21.6Input/Output
14
gpt-5.5-high
Openai
96.4
2.8K
1.05M
¥36 / ¥216Input/Output
15
gpt-5.5
Openai
96.1
2.9K
1.05M
¥36 / ¥216Input/Output
16
qwen3.7-max-preview
Alibaba
95.8
496
1M
¥18 / ¥54Input/Output
17
claude-opus-4-5-20251101-thinking-32k
Anthropic
95.5
5.5K
200K
¥108 / ¥540Input/Output
18
claude-opus-4-5-20251101
Anthropic
95.2
10.2K
200K
¥36 / ¥180Input/Output
19
mimo-v2.5-pro
Xiaomi
95.0
2.4K
1.05M
¥7.2 / ¥21.6Input/Output
20
claude-sonnet-4-5-20250929
Anthropic
94.7
11.3K
200K
¥21.6 / ¥108Input/Output
21
gpt-5.4-high
Openai
94.4
4.5K
1.05M
¥18 / ¥108Input/Output
22
glm-5
Zai
94.1
3.5K
205K
¥7.2 / ¥23Input/Output
23
gemini-3-flash (thinking-minimal)
Google
93.8
8.2K
1.05M
¥3.6 / ¥21.6Input/Output
24
deepseek-v4-pro
Deepseek
93.6
2.6K
1M
¥3.13 / ¥6.26Input/Output
25
grok-4.20-beta1
Xai
93.3
3.7K
2M
¥14.4 / ¥43.2Input/Output
26
claude-sonnet-4-6
Anthropic
93.0
4.3K
1M
¥21.6 / ¥108Input/Output
27
deepseek-v4-pro-thinking
Deepseek
92.7
2.4K
1M
¥3.13 / ¥6.26Input/Output
28
gpt-5.4
Openai
92.4
4.6K
1.05M
¥18 / ¥108Input/Output
29
grok-4.20-beta-0309-reasoning
Xai
92.2
4.7K
2M
¥14.4 / ¥43.2Input/Output
30
grok-4.20-multi-agent-beta-0309
Xai
91.9
4.6K
2M
¥14.4 / ¥43.2Input/Output
31
kimi-k2.6
Moonshot
91.6
2.3K
262K
¥6.84 / ¥28.8Input/Output
32
qwen3.6-max-preview
Alibaba
91.3
666
246K
¥9.5 / ¥56.9Input/Output
33
ernie-5.0-0110
Baidu
91.0
5.4K
128K
¥7.92 / ¥14.4Input/Output
34
gpt-5.1-high
Openai
90.8
6.1K
400K
¥9 / ¥72Input/Output
35
ernie-5.0-preview-1203
Baidu
90.5
1.6K
128K
¥7.92 / ¥14.4Input/Output
36
claude-sonnet-4-5-20250929-thinking-32k
Anthropic
90.2
11.7K
200K
¥21.6 / ¥108Input/Output
37
kimi-k2.5-thinking
Moonshot
89.9
5.6K
262K
¥4.32 / ¥21.6Input/Output
38
gpt-5.5-instant
Openai
89.6
4.1K
400K
¥9 / ¥72Input/Output
39
gemma-4-31b
Google
89.4
934
262K
¥3.24 / ¥7.2Input/Output
40
ernie-5.0-preview-1022
Baidu
89.1
717
128K
¥7.92 / ¥14.4Input/Output
41
grok-3-preview-02-24
Xai
88.8
4.7K
1M
¥9 / ¥18Input/Output
42
glm-4.6
Zai
88.5
5.1K
205K
¥4.32 / ¥15.8Input/Output
43
mimo-v2-pro
Xiaomi
88.2
3.4K
1.05M
¥7.2 / ¥21.6Input/Output
44
grok-4.1
Xai
88.0
9.5K
200K
¥14.4 / ¥72Input/Output
45
claude-opus-4-1-20250805-thinking-16k
Anthropic
87.7
6.8K
200K
¥108 / ¥540Input/Output
46
claude-opus-4-1-20250805
Anthropic
87.4
10.7K
200K
¥108 / ¥540Input/Output
47
deepseek-r1-0528
Deepseek
87.1
2.4K
164K
¥3.6 / ¥15.5Input/Output
48
dola-seed-2.0-pro
Bytedance
86.8
5.9K
-
-
49
qwen3.5-397b-a17b
Alibaba
86.6
5K
262K
¥3.1 / ¥18.6Input/Output
50
chatgpt-4o-latest-20250326
Openai
86.3
11.6K
128K
¥18 / ¥72Input/Output
51
grok-4.1-thinking
Xai
86.0
9.6K
200K
¥14.4 / ¥72Input/Output
52
deepseek-v3.1-terminus
Deepseek
85.7
457
128K
¥1.8 / ¥5.04Input/Output
53
gpt-5.1
Openai
85.4
6.5K
400K
¥9 / ¥72Input/Output
54
gemma-4-26b-a4b
Google
85.2
943
262K
¥0.94 / ¥2.88Input/Output
55
deepseek-v3.2-exp
Deepseek
84.9
1.6K
128K
¥0 / ¥0Input/Output
56
deepseek-v3.1-thinking
Deepseek
84.6
1.6K
128K
¥1.44 / ¥5.04Input/Output
57
glm-4.7
Zai
84.3
1.9K
205K
¥0 / ¥0Input/Output
58
gemini-2.5-flash
Google
84.0
17K
1.05M
¥2.16 / ¥18Input/Output
59
qwen3-max-preview
Alibaba
83.8
3.8K
262K
¥6.2 / ¥24.8Input/Output
60
gemini-3.1-flash-lite-preview
Google
83.5
5.4K
1.05M
¥1.8 / ¥10.8Input/Output
61
gpt-5.2-chat-latest-20260210
Openai
83.2
5K
400K
¥12.6 / ¥101Input/Output
62
qwen3.6-plus
Alibaba
82.9
2.6K
1M
¥3.6 / ¥21.6Input/Output
63
grok-4-0709
Xai
82.6
5.5K
256K
¥21.6 / ¥108Input/Output
64
deepseek-v3.2
Deepseek
82.4
6.5K
128K
¥2.09 / ¥3.1Input/Output
65
grok-4.3
Xai
82.1
2.5K
1M
¥9 / ¥18Input/Output
66
deepseek-v4-flash
Deepseek
81.8
2.5K
1M
¥1.01 / ¥2.02Input/Output
67
mimo-v2.5
Xiaomi
81.5
2.5K
1.05M
¥2.88 / ¥14.4Input/Output
68
glm-4.5
Zai
81.2
3.1K
131K
¥4.32 / ¥15.8Input/Output
69
gpt-4.5-preview-2025-02-27
Openai
81.0
2.6K
8.19K
¥216 / ¥432Input/Output
70
hunyuan-t1-20250711
Tencent
80.7
622
131K
¥0 / ¥0Input/Output
71
mistral-medium-2508
Mistral
80.4
13.5K
262K
¥2.88 / ¥14.4Input/Output
72
mistral-large-3
Mistral
80.1
6.5K
262K
¥3.6 / ¥10.8Input/Output
73
deepseek-v4-flash-thinking
Deepseek
79.8
2.5K
1M
¥1.01 / ¥2.02Input/Output
74
grok-4-1-fast-reasoning
Xai
79.6
8.5K
2M
¥1.44 / ¥3.6Input/Output
75
deepseek-v3.2-exp-thinking
Deepseek
79.3
1.2K
128K
¥0 / ¥0Input/Output
76
grok-4-fast-chat
Xai
79.0
941
2M
¥1.44 / ¥3.6Input/Output
77
deepseek-v3.1-terminus-thinking
Deepseek
78.7
440
128K
¥1.8 / ¥5.04Input/Output
78
deepseek-v3.2-thinking
Deepseek
78.4
6.1K
128K
¥2.09 / ¥3.1Input/Output
79
deepseek-v3.1
Deepseek
78.2
2K
128K
¥1.44 / ¥5.04Input/Output
80
gemini-2.5-flash-preview-09-2025
Google
77.9
4.5K
1M
¥2.16 / ¥18Input/Output
81
hunyuan-vision-1.5-thinking
Tencent
77.6
261
-
-
82
qwen3-235b-a22b-thinking-2507
Alibaba
77.3
1.1K
131K
¥2.07 / ¥8.26Input/Output
83
longcat-flash-chat-2602-exp
Meituan
77.0
3.5K
128K
¥1.08 / ¥10.8Input/Output
84
claude-opus-4-20250514-thinking-16k
Anthropic
76.8
4.5K
200K
¥108 / ¥540Input/Output
85
qwen3-max-2025-09-23
Alibaba
76.5
1.2K
258K
¥6.19 / ¥24.7Input/Output
86
kimi-k2.5-instant
Moonshot
76.2
1.3K
262K
¥4.32 / ¥21.6Input/Output
87
mimo-v2-flash (non-thinking)
Xiaomi
75.9
6.8K
262K
¥0.72 / ¥2.16Input/Output
88
gpt-5.2-high
Openai
75.6
7.1K
400K
¥12.6 / ¥101Input/Output
89
qwen3-235b-a22b-instruct-2507
Alibaba
75.4
13.3K
128K
¥2.09 / ¥8.23Input/Output
90
kimi-k2-thinking-turbo
Moonshot
75.1
9K
262K
¥17.3 / ¥72Input/Output
91
claude-opus-4-20250514
Anthropic
74.8
5.5K
200K
¥108 / ¥540Input/Output
92
gpt-5.2
Openai
74.5
7.2K
400K
¥12.6 / ¥101Input/Output
93
grok-4-fast-reasoning
Xai
74.2
2.5K
2M
¥1.44 / ¥3.6Input/Output
94
gpt-5.4-mini-high
Openai
73.9
4.2K
400K
¥5.4 / ¥32.4Input/Output
95
qwen3-vl-235b-a22b-instruct
Alibaba
73.7
1.4K
128K
¥2.16 / ¥8.64Input/Output
96
mimo-v2-omni
Xiaomi
73.4
470
262K
¥2.88 / ¥14.4Input/Output
97
qwen3.5-122b-a10b
Alibaba
73.1
4.1K
262K
¥2.88 / ¥23Input/Output
98
claude-haiku-4-5-20251001
Anthropic
72.8
11.5K
200K
¥7.2 / ¥36Input/Output
99
gpt-5-chat
Openai
72.5
4K
400K
¥9 / ¥72Input/Output
100
gemini-2.5-flash-lite-preview-06-17-thinking
Google
72.3
4.2K
65.5K
¥0.72 / ¥2.88Input/Output
101
gpt-5-high
Openai
72.0
4.3K
400K
¥9 / ¥72Input/Output
102
amazon-nova-experimental-chat-26-02-10
Amazon
71.7
457
-
-
103
deepseek-v3-0324
Deepseek
71.4
6K
75K
¥1.44 / ¥5.76Input/Output
104
gpt-4.1-2025-04-14
Openai
71.1
6.7K
1.05M
¥14.4 / ¥57.6Input/Output
105
minimax-m2.1-preview
Minimax
70.9
2.7K
205K
¥0 / ¥0Input/Output
106
qwen3.5-27b
Alibaba
70.6
3.9K
262K
¥2.16 / ¥17.3Input/Output
107
o3-2025-04-16
Openai
70.3
7.7K
200K
¥14.4 / ¥57.6Input/Output
108
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google
70.0
6.5K
1.05M
¥0.72 / ¥2.88Input/Output
109
hunyuan-turbos-20250416
Tencent
69.7
1.5K
131K
¥0 / ¥0Input/Output
110
qwen3-235b-a22b-no-thinking
Alibaba
69.5
4.6K
131K
¥2.07 / ¥8.26Input/Output
111
step-3.5-flash
Stepfun
69.2
5.2K
256K
¥0.69 / ¥2.07Input/Output
112
gpt-5.3-chat-latest
Openai
68.9
4.8K
128K
¥12.6 / ¥101Input/Output
113
hunyuan-hy3-preview
Tencent
68.6
890
256K
¥0 / ¥0Input/Output
114
deepseek-r1
Deepseek
68.3
3.3K
164K
¥5.04 / ¥18Input/Output
115
amazon-nova-experimental-chat-12-10
Amazon
68.1
581
-
-
116
amazon-nova-experimental-chat-11-10
Amazon
67.8
3.8K
-
-
117
kimi-k2-0905-preview
Moonshot
67.5
1.5K
262K
¥4.32 / ¥18Input/Output
118
qwen3.5-flash
Alibaba
67.2
4.3K
1M
¥1.24 / ¥12.4Input/Output
119
mimo-v2-flash (thinking)
Xiaomi
66.9
1.7K
262K
¥0.72 / ¥2.16Input/Output
120
qwen3.5-35b-a3b
Alibaba
66.7
4.3K
262K
¥1.8 / ¥14.4Input/Output
121
o1-2024-12-17
Openai
66.4
4.6K
128K
¥108 / ¥432Input/Output
122
longcat-flash-chat
Meituan
66.1
1.4K
128K
¥1.08 / ¥10.8Input/Output
123
claude-sonnet-4-20250514-thinking-32k
Anthropic
65.8
4.3K
200K
¥21.6 / ¥108Input/Output
124
gemma-3-27b-it
Google
65.5
6.4K
128K
¥2.15 / ¥2.15Input/Output
125
qwen3-vl-235b-a22b-thinking
Alibaba
65.3
1K
131K
¥2.06 / ¥8.26Input/Output
126
mistral-medium-2505
Mistral
65.0
4.1K
262K
¥2.88 / ¥14.4Input/Output
127
glm-4.5-air
Zai
64.7
4K
131K
¥0 / ¥0Input/Output
128
minimax-m2.7
Minimax
64.4
3.3K
205K
¥0 / ¥0Input/Output
129
glm-4.6v
Zai
64.1
420
128K
¥2.16 / ¥6.48Input/Output
130
gemini-2.0-flash-001
Google
63.9
6.3K
1.05M
¥1.08 / ¥4.32Input/Output
131
grok-3-mini-beta
Xai
63.6
2.8K
1M
¥9 / ¥18Input/Output
132
qwen2.5-max
Alibaba
63.3
5K
32K
¥11.5 / ¥46Input/Output
133
claude-sonnet-4-20250514
Anthropic
63.0
5K
200K
¥21.6 / ¥108Input/Output
134
amazon-nova-experimental-chat-26-01-10
Amazon
62.7
551
-
-
135
qwen3-coder-480b-a35b-instruct
Alibaba
62.5
3.3K
262K
¥6.2 / ¥24.8Input/Output
136
qwen3-next-80b-a3b-instruct
Alibaba
62.2
3K
131K
¥1.04 / ¥4.13Input/Output
137
gemini-1.5-pro-002
Google
61.9
8.1K
-
-
138
minimax-m2.5
Minimax
61.6
5.7K
205K
¥0 / ¥0Input/Output
139
claude-3-7-sonnet-20250219-thinking-32k
Anthropic
61.3
5.2K
-
-
140
gemma-3-12b-it
Google
61.1
600
128K
¥1.96 / ¥1.96Input/Output
141
grok-3-mini-high
Xai
60.8
2.1K
128K
¥0 / ¥0Input/Output
142
deepseek-v3
Deepseek
60.5
3.6K
128K
¥0 / ¥0Input/Output
143
step-2-16k-exp-202412
Stepfun
60.2
751
16.4K
¥37.5 / ¥118Input/Output
144
gpt-5-mini-high
Openai
59.9
3.6K
400K
¥1.8 / ¥14.4Input/Output
145
kimi-k2-0711-preview
Moonshot
59.7
3.4K
131K
¥4.32 / ¥18Input/Output
146
command-a-03-2025
Cohere
59.4
7.6K
256K
¥18 / ¥72Input/Output
147
gemini-2.0-flash-lite-preview-02-05
Google
59.1
4.1K
1.05M
¥0.54 / ¥2.16Input/Output
148
qwen3-30b-a3b-instruct-2507
Alibaba
58.8
3K
262K
¥2.16 / ¥3.6Input/Output
149
trinity-large-preview
-
58.5
4.4K
262K
¥1.8 / ¥6.48Input/Output
150
intellect-3
-
58.3
800
131K
¥1.44 / ¥7.92Input/Output
151
o1-preview
Openai
58.0
4.5K
128K
¥108 / ¥432Input/Output
152
step-3
Stepfun
57.7
786
65.5K
¥1.8 / ¥4.68Input/Output
153
amazon-nova-experimental-chat-10-20
Amazon
57.4
1.5K
-
-
154
claude-3-7-sonnet-20250219
Anthropic
57.1
6K
200K
¥21.6 / ¥108Input/Output
155
nvidia-nemotron-3-super-120b-a12b
Nvidia
56.9
1.1K
262K
¥1.44 / ¥5.76Input/Output
156
qwen3-235b-a22b
Alibaba
56.6
3.6K
131K
¥2.07 / ¥8.26Input/Output
157
gpt-5.4-nano-high
Openai
56.3
4K
400K
¥1.44 / ¥9Input/Output
158
llama-3.1-nemotron-ultra-253b-v1
Nvidia
56.0
444
128K
¥4.32 / ¥13Input/Output
159
trinity-large-thinking
-
55.7
3.8K
262K
¥1.8 / ¥6.48Input/Output
160
qwen3-next-80b-a3b-thinking
Alibaba
55.5
1.8K
131K
¥1.04 / ¥10.3Input/Output
161
glm-4-plus-0111
Zai
55.2
918
128K
¥72 / ¥72Input/Output
162
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia
54.9
367
131K
¥2.88 / ¥2.88Input/Output
163
step-1o-turbo-202506
Stepfun
54.6
997
-
-
164
amazon-nova-experimental-chat-10-09
Amazon
54.3
322
-
-
165
mistral-small-2506
Mistral
54.1
2.2K
262K
¥2.88 / ¥14.4Input/Output
166
glm-4.7-flash
Zai
53.8
1.8K
200K
¥0 / ¥0Input/Output
167
gpt-4.1-mini-2025-04-14
Openai
53.5
5.2K
1.05M
¥2.88 / ¥11.5Input/Output
168
qwen3-32b
Alibaba
53.2
616
131K
¥2.07 / ¥8.26Input/Output
169
minimax-m1
Minimax
52.9
4.5K
1M
¥0.95 / ¥9.03Input/Output
170
o4-mini-2025-04-16
Openai
52.7
5.8K
200K
¥7.92 / ¥31.7Input/Output
171
glm-4.5v
Zai
52.4
640
64K
¥4.32 / ¥13Input/Output
172
mercury-2
Inception Ai
52.1
530
128K
¥1.8 / ¥5.4Input/Output
173
nova-2-lite
Amazon
51.8
1.7K
128K
¥2.38 / ¥19.8Input/Output
174
qwen-plus-0125
Alibaba
51.5
931
1M
¥0.83 / ¥2.07Input/Output
175
gpt-4o-2024-05-13
Openai
51.3
16.6K
128K
¥36 / ¥108Input/Output
176
claude-3-5-sonnet-20241022
Anthropic
51.0
11.8K
200K
¥21.6 / ¥108Input/Output
177
minimax-m2
Minimax
50.7
899
197K
¥0 / ¥0Input/Output
178
qwq-32b
Alibaba
50.4
3.7K
131K
¥2.07 / ¥6.2Input/Output
179
llama-3.3-nemotron-49b-super-v1
Nvidia
50.1
428
131K
¥0 / ¥0Input/Output
180
ling-flash-2.0
Ant Group
49.9
842
131K
¥1.01 / ¥4.1Input/Output
181
gemma-3n-e4b-it
Google
49.6
2.9K
128K
¥0 / ¥0Input/Output
182
o3-mini-high
Openai
49.3
3.1K
200K
¥7.92 / ¥31.7Input/Output
183
gemini-1.5-flash-002
Google
49.0
5K
2M
¥0.54 / ¥2.2Input/Output
184
deepseek-v2.5-1210
Deepseek
48.7
1.1K
1M
¥1.01 / ¥2.02Input/Output
185
grok-2-2024-08-13
Xai
48.5
9.3K
1M
¥9 / ¥18Input/Output
186
gemini-advanced-0514
Google
48.2
7K
-
-
187
yi-lightning
-
47.9
3.8K
12K
¥1.44 / ¥1.44Input/Output
188
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia
47.6
2.3K
131K
¥0 / ¥0Input/Output
189
gpt-4o-2024-08-06
Openai
47.3
6.8K
128K
¥18 / ¥72Input/Output
190
gpt-oss-120b
Openai
47.1
4K
131K
¥1.08 / ¥4.32Input/Output
191
ring-flash-2.0
Ant Group
46.8
888
131K
¥1.01 / ¥4.1Input/Output
192
o3-mini
Openai
46.5
8.2K
200K
¥7.92 / ¥31.7Input/Output
193
gemini-1.5-pro-001
Google
46.2
11.8K
-
-
194
qwen3-30b-a3b
Alibaba
45.9
3.4K
128K
¥0.79 / ¥7.78Input/Output
195
gemma-3-4b-it
Google
45.7
666
128K
¥1.44 / ¥1.44Input/Output
196
hunyuan-turbos-20250226
Tencent
45.4
424
131K
¥0 / ¥0Input/Output
197
hunyuan-turbo-0110
Tencent
45.1
451
-
-
198
llama-3.1-nemotron-70b-instruct
Nvidia
44.8
1K
128K
¥0 / ¥0Input/Output
199
gpt-4-turbo-2024-04-09
Openai
44.5
14.4K
128K
¥72 / ¥216Input/Output
200
gpt-4o-mini-2024-07-18
Openai
44.3
10.5K
128K
¥1.08 / ¥4.32Input/Output
201
llama-4-maverick-17b-128e-instruct
Meta
44.0
5.2K
1M
¥1.8 / ¥6.26Input/Output
202
hunyuan-large-2025-02-10
Tencent
43.7
568
-
-
203
glm-4-plus
Zai
43.4
3.8K
128K
¥54 / ¥54Input/Output
204
olmo-3.1-32b-instruct
Allenai
43.1
1.7K
200K
¥14.4 / ¥57.6Input/Output
205
llama-3.1-405b-instruct-fp8
Meta
42.9
8.9K
128K
¥0 / ¥0Input/Output
206
qwen2.5-plus-1127
Alibaba
42.6
1.6K
-
-
207
gpt-4.1-nano-2025-04-14
Openai
42.3
1K
1.05M
¥14.4 / ¥57.6Input/Output
208
llama-3.1-405b-instruct-bf16
Meta
42.0
6.5K
128K
¥0 / ¥0Input/Output
209
granite-4.1-8b
Ibm
41.7
571
131K
¥0.36 / ¥0.72Input/Output
210
olmo-3-32b-think
Allenai
41.5
867
128K
¥2.16 / ¥3.24Input/Output
211
hunyuan-large-vision
Tencent
41.2
629
-
-
212
mistral-small-3.1-24b-instruct-2503
Mistral
40.9
4.2K
262K
¥2.88 / ¥14.4Input/Output
213
llama-3.3-70b-instruct
Meta
40.6
8.1K
128K
¥0 / ¥0Input/Output
214
llama-4-scout-17b-16e-instruct
Meta
40.3
3.8K
128K
¥1.44 / ¥5.62Input/Output
215
gpt-5-nano-high
Openai
40.1
915
400K
¥0.36 / ¥2.88Input/Output
216
qwen-max-0919
Alibaba
39.8
2.4K
131K
¥2.48 / ¥9.91Input/Output
217
magistral-medium-2506
Mistral
39.5
1.4K
128K
¥14.4 / ¥36Input/Output
218
gpt-4-1106-preview
Openai
39.2
15.6K
8.19K
¥216 / ¥432Input/Output
219
o1-mini
Openai
38.9
7.9K
128K
¥7.92 / ¥31.7Input/Output
220
mistral-large-2407
Mistral
38.7
6.7K
131K
¥14.4 / ¥43.2Input/Output
221
hunyuan-standard-2025-02-10
Tencent
38.4
590
-
-
222
mistral-large-2411
Mistral
38.1
4.4K
128K
¥14.4 / ¥43.2Input/Output
223
grok-2-mini-2024-08-13
Xai
37.8
7.9K
1M
¥9 / ¥18Input/Output
224
gemma-2-27b-it
Google
37.5
11.4K
8.19K
¥0.58 / ¥0.58Input/Output
225
gemma-2-9b-it-simpo
-
37.3
1.7K
8.19K
¥1.44 / ¥1.44Input/Output
226
athene-70b-0725
-
37.0
3.1K
-
-
227
claude-3-5-sonnet-20240620
Anthropic
36.7
12.4K
200K
¥21.6 / ¥108Input/Output
228
deepseek-v2.5
Deepseek
36.4
3.5K
1M
¥1.01 / ¥2.02Input/Output
229
reka-core-20240904
-
36.1
1K
-
-
230
claude-3-opus-20240229
Anthropic
35.9
28.4K
200K
¥108 / ¥540Input/Output
231
command-r-plus-08-2024
Cohere
35.6
1.5K
128K
¥18 / ¥72Input/Output
232
athene-v2-chat
-
35.3
3.7K
-
-
233
gpt-4-0125-preview
Openai
35.0
13.9K
8.19K
¥216 / ¥432Input/Output
234
claude-3-5-haiku-20241022
Anthropic
34.7
9.9K
200K
¥5.76 / ¥28.8Input/Output
235
llama-3.1-70b-instruct
Meta
34.5
8.3K
131K
¥2.88 / ¥2.88Input/Output
236
llama-3.1-tulu-3-70b
Allenai
34.2
441
-
-
237
olmo-3.1-32b-think
Allenai
33.9
1.4K
200K
¥14.4 / ¥57.6Input/Output
238
gemini-1.5-flash-001
Google
33.6
9K
2M
¥0.54 / ¥2.2Input/Output
239
qwen2.5-72b-instruct
Alibaba
33.3
5.7K
131K
¥4.13 / ¥12.4Input/Output
240
gemini-1.5-flash-8b-001
Google
33.1
5K
2M
¥0.54 / ¥2.2Input/Output
241
llama-3.1-nemotron-51b-instruct
Nvidia
32.8
482
128K
¥0 / ¥0Input/Output
242
amazon-nova-pro-v1.0
Amazon
32.5
3.9K
300K
¥5.76 / ¥23Input/Output
243
jamba-1.5-large
-
32.2
1.4K
256K
¥0 / ¥0Input/Output
244
ibm-granite-h-small
Ibm
31.9
715
-
-
245
llama-3-70b-instruct
Meta
31.7
22.8K
8.19K
¥3.67 / ¥5.33Input/Output
246
gemma-2-9b-it
Google
31.4
8.3K
8.19K
¥1.44 / ¥1.44Input/Output
247
nemotron-4-340b-instruct
Nvidia
31.1
3.2K
-
-
248
reka-flash-20240904
-
30.8
1K
65.5K
¥0.72 / ¥1.44Input/Output
249
command-r-plus
Cohere
30.5
11K
128K
¥18 / ¥72Input/Output
250
c4ai-aya-expanse-32b
Cohere
30.3
3.9K
-
-
251
gpt-oss-20b
Openai
30.0
1.2K
131K
¥0.32 / ¥1.3Input/Output
252
olmo-2-0325-32b-instruct
Allenai
29.7
567
-
-
253
glm-4-0520
Zai
29.4
1.5K
128K
¥108 / ¥108Input/Output
254
amazon-nova-lite-v1.0
Amazon
29.1
3K
300K
¥0.43 / ¥1.73Input/Output
255
mistral-small-24b-instruct-2501
Mistral
28.9
2.4K
262K
¥2.88 / ¥14.4Input/Output
256
gpt-4-0613
Openai
28.6
13.9K
8.19K
¥216 / ¥432Input/Output
257
gpt-4-0314
Openai
28.3
8.2K
8.19K
¥216 / ¥432Input/Output
258
mercury
Inception Ai
28.0
298
128K
¥1.8 / ¥5.4Input/Output
259
claude-3-sonnet-20240229
Anthropic
27.7
15.6K
200K
¥21.6 / ¥108Input/Output
260
phi-4
Microsoft
27.5
4.1K
128K
¥0.9 / ¥3.6Input/Output
261
llama-3.1-tulu-3-8b
Allenai
27.2
472
-
-
262
qwen2-72b-instruct
Alibaba
26.9
5.5K
131K
¥4.13 / ¥12.4Input/Output
263
ministral-8b-2410
Mistral
26.6
678
128K
¥0.72 / ¥0.72Input/Output
264
qwen2.5-coder-32b-instruct
Alibaba
26.3
836
131K
¥2.07 / ¥6.2Input/Output
265
amazon-nova-micro-v1.0
Amazon
26.1
3.1K
128K
¥0.25 / ¥1.01Input/Output
266
command-r-08-2024
Cohere
25.8
1.4K
128K
¥18 / ¥72Input/Output
267
c4ai-aya-expanse-8b
Cohere
25.5
1.5K
-
-
268
hunyuan-standard-256k
Tencent
25.2
357
-
-
269
jamba-1.5-mini
-
24.9
1.4K
256K
¥0 / ¥0Input/Output
270
mistral-large-2402
Mistral
24.6
9.1K
262K
¥2.88 / ¥14.4Input/Output
271
mistral-medium
Mistral
24.4
5.5K
262K
¥2.88 / ¥14.4Input/Output
272
claude-3-haiku-20240307
Anthropic
24.1
16.7K
200K
¥1.8 / ¥9Input/Output
273
llama-3.1-8b-instruct
Meta
23.8
7.3K
131K
¥0.79 / ¥0.79Input/Output
274
command-r
Cohere
23.5
7.8K
128K
¥18 / ¥72Input/Output
275
llama-3-8b-instruct
Meta
23.2
15.4K
8.19K
¥0.29 / ¥0.29Input/Output
276
wizardlm-70b
Microsoft
23.0
1.8K
-
-
277
qwen1.5-110b-chat
Alibaba
22.7
3.7K
-
-
278
gemma-2-2b-it
Google
22.4
7.1K
128K
¥0 / ¥0Input/Output
279
mixtral-8x22b-instruct-v0.1
Mistral
22.1
7.4K
64K
¥14.4 / ¥43.2Input/Output
280
qwen1.5-72b-chat
Alibaba
21.8
5.9K
-
-
281
yi-1.5-34b-chat
-
21.6
3.6K
-
-
282
qwq-32b-preview
Alibaba
21.3
454
131K
¥2.07 / ¥6.2Input/Output
283
reka-flash-21b-20240226-online
-
21.0
2.1K
-
-
284
gemini-pro-dev-api
Google
20.7
3K
1.05M
¥14.4 / ¥86.4Input/Output
285
granite-3.1-8b-instruct
Ibm
20.4
450
-
-
286
openchat-3.5
-
20.2
1.5K
-
-
287
vicuna-33b
-
19.9
4K
-
-
288
reka-flash-21b-20240226
-
19.6
3.5K
-
-
289
internlm2_5-20b-chat
-
19.3
1.4K
-
-
290
deepseek-coder-v2
Deepseek
19.0
2.3K
1M
¥1.01 / ¥2.02Input/Output
291
granite-3.1-2b-instruct
Ibm
18.8
442
-
-
292
zephyr-orpo-141b-A35b-v0.1
-
18.5
635
200K
¥108 / ¥432Input/Output
293
mixtral-8x7b-instruct-v0.1
Mistral
18.2
11.2K
32K
¥5.04 / ¥5.04Input/Output
294
yi-34b-chat
-
17.9
2.4K
-
-
295
phi-3-medium-4k-instruct
Microsoft
17.6
3.7K
4.1K
¥1.22 / ¥4.9Input/Output
296
tulu-2-dpo-70b
-
17.4
1.1K
-
-
297
dbrx-instruct-preview
-
17.1
4.8K
-
-
298
gemini-pro
Google
16.8
953
1.05M
¥14.4 / ¥86.4Input/Output
299
zephyr-7b-beta
-
16.5
2K
-
-
300
nous-hermes-2-mixtral-8x7b-dpo
-
16.2
616
1M
¥36 / ¥180Input/Output
301
starling-lm-7b-alpha
-
16.0
1.7K
200K
¥5.4 / ¥18.7Input/Output
302
openhermes-2.5-mistral-7b
-
15.7
811
1M
¥36 / ¥180Input/Output
303
starling-lm-7b-beta
-
15.4
2.4K
200K
¥5.4 / ¥18.7Input/Output
304
llama-3.2-3b-instruct
Meta
15.1
1.1K
131K
¥0.22 / ¥0.35Input/Output
305
openchat-3.5-0106
-
14.8
1.9K
-
-
306
gpt-3.5-turbo-0125
Openai
14.6
9.7K
16.4K
¥3.6 / ¥10.8Input/Output
307
qwen1.5-14b-chat
Alibaba
14.3
2.4K
-
-
308
llama2-70b-steerlm-chat
Nvidia
14.0
604
-
-
309
wizardlm-13b
Microsoft
13.7
1.4K
-
-
310
solar-10.7b-instruct-v1.0
-
13.4
698
128K
¥0 / ¥0Input/Output
311
falcon-180b-chat
-
13.2
270
-
-
312
dolphin-2.2.1-mistral-7b
-
12.9
291
262K
¥2.88 / ¥14.4Input/Output
313
qwen1.5-32b-chat
Alibaba
12.6
3K
-
-
314
phi-3-small-8k-instruct
Microsoft
12.3
2.4K
8.19K
¥1.08 / ¥4.32Input/Output
315
snowflake-arctic-instruct
-
12.0
4.8K
-
-
316
guanaco-33b
-
11.8
515
200K
¥14.4 / ¥57.6Input/Output
317
llama-2-70b-chat
Meta
11.5
6.4K
-
-
318
mpt-30b-chat
-
11.2
452
-
-
319
granite-3.0-8b-instruct
Ibm
10.9
895
-
-
320
zephyr-7b-alpha
-
10.6
399
-
-
321
mistral-7b-instruct-v0.2
Mistral
10.4
3K
262K
¥2.88 / ¥14.4Input/Output
322
deepseek-llm-67b-chat
Deepseek
10.1
789
1M
¥1.01 / ¥2.02Input/Output
323
gemma-1.1-7b-it
Google
9.8
3.4K
-
-
324
vicuna-13b
-
9.5
3.3K
-
-
325
llama-2-13b-chat
Meta
9.2
3.3K
-
-
326
granite-3.0-2b-instruct
Ibm
9.0
916
-
-
327
phi-3-mini-4k-instruct-june-2024
Microsoft
8.7
2K
4.1K
¥0.94 / ¥3.74Input/Output
328
phi-3-mini-4k-instruct
Microsoft
8.4
2.8K
4.1K
¥0.94 / ¥3.74Input/Output
329
gpt-3.5-turbo-1106
Openai
8.1
2.8K
16.4K
¥7.2 / ¥14.4Input/Output
330
qwen1.5-7b-chat
Alibaba
7.8
701
-
-
331
llama-2-7b-chat
Meta
7.6
2.3K
128K
¥4.03 / ¥48Input/Output
332
llama-3.2-1b-instruct
Meta
7.3
1.1K
16.4K
¥0.07 / ¥0.08Input/Output
333
codellama-34b-instruct
Meta
7.0
1.5K
-
-
334
smollm2-1.7b-instruct
-
6.7
341
-
-
335
stripedhyena-nous-7b
-
6.4
869
-
-
336
mistral-7b-instruct
Mistral
6.2
1.7K
262K
¥2.88 / ¥14.4Input/Output
337
qwen-14b-chat
Alibaba
5.9
898
32.8K
¥1.04 / ¥3.1Input/Output
338
phi-3-mini-128k-instruct
Microsoft
5.6
2.9K
128K
¥0.94 / ¥3.74Input/Output
339
gemma-7b-it
Google
5.3
1.3K
-
-
340
vicuna-7b
-
5.0
1.3K
-
-
341
gemma-1.1-2b-it
Google
4.8
1.5K
-
-
342
palm-2
Google
4.5
1.6K
-
-
343
olmo-7b-instruct
Allenai
4.2
1K
-
-
344
gemma-2b-it
Google
3.9
696
-
-
345
koala-13b
-
3.6
1.2K
-
-
346
gpt4all-13b-snoozy
-
3.4
250
1M
¥36 / ¥216Input/Output
347
chatglm3-6b
-
3.1
816
200K
¥5.4 / ¥18.7Input/Output
348
qwen1.5-4b-chat
Alibaba
2.8
1.1K
-
-
349
alpaca-13b
-
2.5
954
-
-
350
mpt-7b-chat
-
2.2
683
-
-
351
chatglm2-6b
-
2.0
566
200K
¥5.4 / ¥18.7Input/Output
352
RWKV-4-Raven-14B
-
1.7
874
-
-
353
oasst-pythia-12b
-
1.4
1.1K
-
-
354
fastchat-t5-3b
-
1.1
744
-
-
355
chatglm-6b
-
0.8
771
200K
¥5.4 / ¥18.7Input/Output
356
stablelm-tuned-alpha-7b
-
0.6
536
-
-
357
dolly-v2-12b
-
0.3
565
-
-
358
llama-13b
Meta
0.0
431
-
-
Top model analysis

claude-opus-4-6-thinking why it ranks first

claude-opus-4-6-thinking ranks first with a percent score of 100.0 and 5.5K samples. Use it as the first option for this leaderboard, then compare price, context and availability.

How to choose

Do not only look at rank #1

Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.

FAQ

FAQ

创意写作排行榜看什么指标?

主要看排名、百分制分数、样本量和来源。分数用于快速比较同一榜单内模型表现,样本量用于判断结果稳定性。

为什么不同榜单不能直接混合成总分?

不同榜单的任务、样本和评测口径不同,模力榜默认只在同一榜单内排序,避免把写作、代码、图像等能力强行合并。

创意写作模型应该怎么选?

优先看与你任务最接近的榜单,再结合价格、上下文长度、开源闭源和厂商可用性。排名靠前不代表适合所有预算和部署方式。

榜单多久更新?

页面展示的是最新成功采集的公开榜单数据。当前优先使用 LMArena leaderboard dataset,并在页面来源中保留原始链接。