$ cat /top10/Thu Mar 12 2026 00:00:00 GMT+0000 (Coordinated Universal Time)/study-finds-many-swe-bench-passing-ai-prs-would-be-rejected-by-maintainers-55

5
METR
Thursday, March 12, 2026

Study Finds Many SWE-bench-Passing AI PRs Would Be Rejected by Maintainers

// summary

METR analysis reveals that AI-generated pull requests passing the SWE-bench benchmark often wouldn't survive real code review — raising hard questions about how we measure AI coding ability.

→ read source ↩ back to top10.dev

// share this

// get daily digest

Top 10 dev stories every morning at 8am UTC. AI-curated. Retro terminal HTML email.