News

Bench LM
benchlm. ai > benchmarks > swe Verified

SWE-bench Verified Benchmark 2026: 44 LLM scores

3+ week, 4+ day ago  (267+ words) As of May 1, 2026, Claude Mythos Preview leads the SWE-bench Verified leaderboard with 93. 9%, followed by Claude Opus 4. 7 (Adaptive) (87. 6%) and GPT-5. 3 Codex (85%). Claude Opus 4. 7 (Adaptive) According to Bench LM. ai, Claude Mythos Preview leads the SWE-bench Verified benchmark with a score of 93. 9%, followed…...