gemini-3-1-pro
SWE-bench for your codebase. Turn merged PRs into reproducible coding-agent benchmarks.