Image · Text-to-Image · Overall Leaderboard

Ranking for Text-to-Image / Overall, based on public preference data.

Selection guide

Overall model ranking guide

Ranking for Text-to-Image / Overall, based on public preference data.

gpt-image-2 (medium)gemini-3.1-flash-image-preview (nano-banana-2) [web-search]mai-image-2.5-previewgemini-3-pro-image-preview-2k (nano-banana-pro)gpt-image-1.5-high-fidelity
Current DirectoryImage · Text-to-Image · Overall
Models64
Published2026/05/26
Arena public preference evaluationOriginal leaderboard: Text-to-Image / OverallPublished: 2026/05/26Leaderboard dataset: LMArena latest parquetOpen Arena sourceOpen leaderboard dataset
1
gpt-image-2 (medium)
Openai
100.0
40.4K
-
¥36.4 / ¥233Input/Output
2
gemini-3.1-flash-image-preview (nano-banana-2) [web-search]
Google
98.4
70.9K
131K
¥3.6 / ¥21.6Input/Output
3
mai-image-2.5-preview
Microsoft Ai
96.8
5.3K
32K
¥36 / ¥338Input/Output
4
gemini-3-pro-image-preview-2k (nano-banana-pro)
Google
95.2
109.6K
1.05M
¥14.4 / ¥86.4Input/Output
5
gpt-image-1.5-high-fidelity
Openai
93.7
114.2K
1M
¥36 / ¥216Input/Output
6
gemini-3-pro-image-preview (nano-banana-pro)
Google
92.1
82.6K
1.05M
¥14.4 / ¥86.4Input/Output
7
grok-imagine-image-quality
Xai
90.5
19.4K
1M
¥9 / ¥18Input/Output
8
uni-1.1-max
Luma Ai
88.9
8.9K
-
-
9
mai-image-2
Microsoft Ai
87.3
48.3K
32K
¥36 / ¥338Input/Output
10
uni-1.1
Luma Ai
85.7
10K
-
-
11
grok-imagine-image
Xai
84.1
166.8K
1M
¥9 / ¥18Input/Output
12
recraft-v4.1-utility-pro
Recraft
82.5
2.5K
-
-
13
qwen-image-2.0-pro-2026-04-22
Alibaba
81.0
5.2K
8.19K
¥0 / ¥0Input/Output
14
flux-2-max
Bfl
79.4
111.5K
-
-
15
grok-imagine-image-pro
Xai
77.8
93K
1M
¥9 / ¥18Input/Output
16
reve-v1.5
Reve
76.2
14.9K
-
-
17
flux-2-pro
Bfl
74.6
147.1K
-
-
18
flux-2-flex
Bfl
73.0
143.2K
-
-
19
gemini-2.5-flash-image-preview (nano-banana)
Google
71.4
779.8K
1.05M
¥2.16 / ¥18Input/Output
20
hunyuan-image-3.0
Tencent
69.8
172.9K
-
-
21
flux-2-dev
Bfl
68.3
58K
-
-
22
imagen-ultra-4.0-generate-001
Google
66.7
389.6K
-
-
23
seedream-4.5
Bytedance
65.1
189K
-
-
24
seedream-4-2k
Bytedance
63.5
12.6K
-
-
25
wan2.6-t2i
Alibaba
61.9
145.8K
-
-
26
qwen-image-2512
Alibaba
60.3
70K
8.19K
¥3.6 / ¥14.4Input/Output
27
recraft-v4.1-pro
Recraft
58.7
2.7K
-
-
28
imagen-4.0-generate-001
Google
57.1
516.9K
-
-
29
seedream-5.0-lite
Bytedance
55.6
68.1K
-
-
30
hidream-o1-image
Hidream
54.0
5.1K
200K
¥108 / ¥432Input/Output
31
seedream-4-fal
Bytedance
52.4
11.9K
-
-
32
wan2.5-t2i-preview
Alibaba
50.8
194.4K
-
-
33
qwen-image-2.0-2026-03-03
Alibaba
49.2
53.6K
8.19K
¥0 / ¥0Input/Output
34
gpt-image-1
Openai
47.6
265.5K
-
¥36 / ¥288Input/Output
35
seedream-4-high-res-fal
Bytedance
46.0
169.3K
-
-
36
recraft-v4
Recraft
44.4
69.2K
-
-
37
gpt-image-1-mini
Openai
42.9
157.8K
1M
¥36 / ¥216Input/Output
38
wan2.7-image-pro
Alibaba
41.3
28.5K
8.19K
¥0 / ¥0Input/Output
39
wan2.7-image
Alibaba
39.7
28.8K
8.19K
¥0 / ¥0Input/Output
40
mai-image-1
Microsoft Ai
38.1
94.5K
32K
¥36 / ¥338Input/Output
41
z-image-turbo
Alibaba
36.5
18.3K
-
-
42
seedream-3
Bytedance
34.9
36.9K
-
-
43
flux-1-kontext-max
Bfl
33.3
65.9K
-
-
44
flux-2-klein-9b
Bfl
31.7
133.8K
-
-
45
qwen-image-prompt-extend
Alibaba
30.2
697.8K
8.19K
¥3.6 / ¥14.4Input/Output
46
flux-1-kontext-pro
Bfl
28.6
332.1K
-
-
47
imagen-3.0-generate-002
Google
27.0
361.4K
-
-
48
qwen-image
Alibaba
25.4
85.1K
8.19K
¥3.6 / ¥14.4Input/Output
49
ideogram-v3-quality
Ideogram
23.8
115.3K
-
-
50
photon
Luma Ai
22.2
127.6K
-
-
51
p-image
-
20.6
97.3K
-
-
52
flux-2-klein-4b
Bfl
19.0
135.7K
-
-
53
runway-gen4
Runway
17.5
46.6K
-
-
54
recraft-v3
Recraft
15.9
192K
-
-
55
flux-1.1-pro
Bfl
14.3
70.5K
-
-
56
lucid-origin
Leonardo Ai
12.7
286.6K
-
-
57
ideogram-v2
Ideogram
11.1
72.1K
-
-
58
glm-image
Zai
9.5
4.6K
200K
¥5.4 / ¥18.7Input/Output
59
gemini-2.0-flash-preview-image-generation
Google
7.9
258.2K
1.05M
¥1.08 / ¥4.32Input/Output
60
flux-1-dev-fp8
Bfl
6.3
49.2K
-
-
61
dall-e-3
Openai
4.8
240.1K
-
-
62
flux-1-kontext-dev
Bfl
3.2
216.3K
-
-
63
stable-diffusion-v35-large
-
1.6
23.4K
0.26K
¥0.58 / ¥0Input/Output
64
bagel
Bytedance
0.0
12.4K
-
-
Top model analysis

gpt-image-2 (medium) why it ranks first

gpt-image-2 (medium) ranks first with a percent score of 100.0 and 40.4K samples. Use it as the first option for this leaderboard, then compare price, context and availability.

How to choose

Do not only look at rank #1

Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.

FAQ

FAQ

总榜排行榜看什么指标?

主要看排名、百分制分数、样本量和来源。分数用于快速比较同一榜单内模型表现,样本量用于判断结果稳定性。

为什么不同榜单不能直接混合成总分?

不同榜单的任务、样本和评测口径不同,模力榜默认只在同一榜单内排序,避免把写作、代码、图像等能力强行合并。

总榜模型应该怎么选?

优先看与你任务最接近的榜单,再结合价格、上下文长度、开源闭源和厂商可用性。排名靠前不代表适合所有预算和部署方式。

榜单多久更新?

页面展示的是最新成功采集的公开榜单数据。当前优先使用 LMArena leaderboard dataset,并在页面来源中保留原始链接。