下拉刷新
January 2026 LM Ranking
rank_logo
RankModelScore
1gemini-3-pro
1490
2grok-4.1-thinking
1477
3gemini-3-flash
1471
4claude-opus-4-5-thinking-32k
1469
5grok-4.1
1466
6claude-opus-4-5
1465
7gemini-3-flash (thinking-minimal)
1464
8gpt-5.1-high
1457
9gemini-2.5-pro
1450
10claude-sonnet-4-5-thinking-32k
1450
11claude-opus-4-1-thinking-16k
1448
12claude-sonnet-4-5
1448
13ernie-5.0
1446
14gpt-4.5
1443
15claude-opus-4-1
1443
16glm-4.7
1443
17chatgpt-4o-latest
1442
18gpt-5.2
1440
19gpt-5.2-high
1440
20gpt-5-high
1436

The LMArena Ranking is a crowdsourced leaderboard for large language models. Users chat with two anonymous models and vote for the better response, with model ratings calculated using the Elo rating system. The leaderboard covers multiple capability dimensions including text, vision, and code, making it one of the most authoritative LLM evaluation benchmarks. Based on this ranking, we have done model name aggregation and cleaning work.