$ cat /top10/Thu Mar 12 2026 00:00:00 GMT+0000 (Coordinated Universal Time)/study-finds-many-swe-bench-passing-ai-prs-would-be-rejected-by-maintainers-55
METR analysis reveals that AI-generated pull requests passing the SWE-bench benchmark often wouldn't survive real code review — raising hard questions about how we measure AI coding ability.
Top 10 dev stories every morning at 8am UTC. AI-curated. Retro terminal HTML email.