PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Ai Benchmarks Python Packages

Python packages with the GitHub topic ai-benchmarks. Sorted by relevance, with stars and monthly downloads.
humanjudge
grandjury

Python SDK for HumanJudge — real human evaluations of AI models. 25,000+ blind reviews by 200+ verified reviewers across 58 models and 44 benchmarks. Free.

1K 1 0
zabinskirafal
agi-pragma

AI Action Firewall — seven-stage Decision Intelligence Core for safe agentic AI

483 0 0
zabinskirafal
guardex

Guardex - AI Control Plane for autonomous agents (closed source)

475 0 0
scicode-bench
nvidia-scicode

A benchmark that challenges language models to code solutions for scientific problems

243 196 34
    • Data from PyPI, GitHub, ClickHouse, and BigQuery