claude-opus-4-7
SWE-bench for your codebase. Turn merged PRs into reproducible coding-agent benchmarks.