ai-arena
Pit AI coding agents against the same bug. Score them on tests, diff, cost, and time — pick the winning patch.