Back to Leaderboard
Claude Haiku 4.5
AnthropicRank #4 of 8 models
85.0%
+0.0 vs avg
Coverage
88.8%+9.9 vs avg
Validity
81.2%-9.8 vs avg
Local Score
81.7%-2.9 vs avg
Cross-File
89.1%+3.4 vs avg
Score Distribution
Performance by Language
Category Comparison
Local Logic
81.7%
Cross-File
89.1%
Judge Analysis (Sonnet vs GPT)
Latency (p50 / p90 / p99)
8ms
p50
16.7s
p90
18.9s
p99
GLM-5
6ms
Gemini 2.5 Pro
8ms
Kimi K2.5
8ms
Claude Haiku 4.5
8ms
Gemini 3 Flash
8ms
Claude Sonnet 4.5
10ms
Gemini 3.1 Pro
21ms
GPT-5.2
19.3s
Pass Rate
26.7%
Parse Rate
26.7%
Tests
75
Errors
55