WebDev Arena Leaderboard
WebDev Arena is a real-time AI coding competition where models go head-to-head in web development challenges, developed by LMArena
Leaderboard
OpenAI
Arena Score
1477.51
License
Proprietary
95% CI
+7.34 / -8.86
Votes
5,848
Arena Score
1472.43
License
Proprietary
95% CI
+10.07 / -9.85
Votes
5,312
Anthropic
Arena Score
1462.29
License
Proprietary
95% CI
+9.68 / -7.74
Votes
5,582
Anthropic
Arena Score
1420.81
License
Proprietary
95% CI
+16.95 / -19.17
Votes
1,337
Arena Score
1400.96
License
Proprietary
95% CI
+6.85 / -7.07
Votes
11,022
ZAI
Arena Score
1397.82
License
MIT
95% CI
+11.05 / -8.59
Votes
5,442
DeepSeek
Arena Score
1393.71
License
MIT
95% CI
+11.71 / -8.20
Votes
4,800
Anthropic
Arena Score
1385.18
License
Proprietary
95% CI
+9.83 / -11.39
Votes
4,127
Anthropic
Arena Score
1383.60
License
Proprietary
95% CI
+8.34 / -7.40
Votes
9,238
ZAI
Arena Score
1381.11
License
MIT
95% CI
+10.08 / -9.12
Votes
4,360
ZAI
Arena Score
1368.63
License
MIT
95% CI
+14.14 / -14.58
Votes
1,425
Alibaba
Arena Score
1366.55
License
Apache 2.0
95% CI
+5.56 / -5.93
Votes
9,741
DeepSeek
Arena Score
1363.46
License
DeepSeek
95% CI
+13.61 / -18.25
Votes
1,459
Anthropic
Arena Score
1363.32
License
Proprietary
95% CI
+7.92 / -5.77
Votes
11,526
Anthropic
Arena Score
1358.43
License
Proprietary
95% CI
+8.86 / -7.25
Votes
7,460
Arena Score
1350.82
License
Apache 2.0
95% CI
+19.54 / -21.52
Votes
992
DeepSeek
Arena Score
1341.48
License
DeepSeek
95% CI
+13.69 / -16.09
Votes
1,304
Anthropic
Arena Score
1337.97
License
Proprietary
95% CI
+14.61 / -15.02
Votes
1,832
Alibaba
Arena Score
1335.48
License
Proprietary
95% CI
+8.72 / -8.47
Votes
3,972
Moonshot
Arena Score
1316.75
License
Modified MIT
95% CI
+7.19 / -7.38
Votes
7,027
Arena Score
1288.64
License
Proprietary
95% CI
+6.74 / -5.17
Votes
11,543
OpenAI
Arena Score
1253.06
License
Proprietary
95% CI
+5.99 / -6.70
Votes
11,506
Anthropic
Arena Score
1238.17
License
Proprietary
95% CI
+5.25 / -6.08
Votes
26,267
DeepSeek
Arena Score
1207.92
License
MIT
95% CI
+19.12 / -19.43
Votes
1,094
DeepSeek
Arena Score
1199.36
License
MIT
95% CI
+9.58 / -10.20
Votes
3,755
OpenAI
Arena Score
1193.06
License
Proprietary
95% CI
+6.51 / -7.62
Votes
9,064
Alibaba
Arena Score
1189.52
License
Apache 2.0
95% CI
+8.49 / -8.41
Votes
5,600
OpenAI
Arena Score
1186.32
License
Proprietary
95% CI
+8.05 / -6.78
Votes
5,572
Mistral
Arena Score
1181.14
License
Proprietary
95% CI
+5.90 / -9.02
Votes
7,511
xAI
Arena Score
1174.62
License
Proprietary
95% CI
+7.25 / -7.91
Votes
7,685
Arena Score
1152.04
License
Proprietary
95% CI
+11.83 / -11.58
Votes
4,745
Arena Score
1143.36
License
Proprietary
95% CI
+11.05 / -7.51
Votes
5,764
OpenAI
Arena Score
1136.75
License
Proprietary
95% CI
+8.88 / -11.34
Votes
2,979
Anthropic
Arena Score
1133.41
License
Proprietary
95% CI
+5.27 / -4.79
Votes
22,213
MiniMax
Arena Score
1129.61
License
MIT
95% CI
+10.52 / -10.32
Votes
3,361
OpenAI
Arena Score
1117.95
License
Proprietary
95% CI
+7.94 / -7.56
Votes
8,850
OpenAI
Arena Score
1095.78
License
Apache 2.0
95% CI
+32.81 / -29.22
Votes
759
OpenAI
Arena Score
1092.18
License
Proprietary
95% CI
+8.11 / -8.07
Votes
6,369
Arena Score
1089.75
License
Proprietary
95% CI
+7.25 / -7.55
Votes
11,859
OpenAI
Arena Score
1045.17
License
Proprietary
95% CI
+7.53 / -8.42
Votes
9,235
OpenAI
Arena Score
1042.59
License
Proprietary
95% CI
+7.90 / -6.17
Votes
13,688
Arena Score
1040.27
License
Proprietary
95% CI
+6.27 / -6.31
Votes
10,498
Arena Score
1029.74
License
Proprietary
95% CI
+17.98 / -16.61
Votes
1,058
Arena Score
1027.01
License
Llama 4
95% CI
+9.97 / -9.71
Votes
5,474
Arena Score
980.08
License
Proprietary
95% CI
+5.84 / -6.26
Votes
14,454
Alibaba
Arena Score
975.53
License
Proprietary
95% CI
+6.66 / -8.37
Votes
11,073
OpenAI
Arena Score
964.00
License
Proprietary
95% CI
+6.48 / -6.48
Votes
18,601
DeepSeek
Arena Score
959.80
License
DeepSeek
95% CI
+8.95 / -6.83
Votes
7,699
Alibaba
Arena Score
901.98
License
Apache 2.0
95% CI
+6.81 / -6.79
Votes
16,199
Arena Score
901.04
License
Llama 4
95% CI
+24.53 / -24.33
Votes
687
Arena Score
892.57
License
Proprietary
95% CI
+6.92 / -6.68
Votes
15,159
Arena Score
809.62
License
Llama 3.1
95% CI
+15.34 / -21.47
Votes
1,117
More Statistics for WebDev Arena (Overall)
Confidence Interval for Model Strength
Figure 1
Average Win Rate Against All Other Models (Assuming Uniform Sampling and No Ties)
Figure 2
Fraction of Model A Wins for All Non-tied A vs. B Battles
Figure 3
Battle Count for Each Combination of Models (without Ties)
Figure 4