PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Agent Evals Python Packages

Python packages with the GitHub topic agent-evals. Sorted by relevance, with stars and monthly downloads.
kallemickelborg
nodetracer

The node-level tracing library for agentic software.

952 1 1
HumphreySun98
repoagentbench

SWE-bench for your codebase. Turn merged PRs into reproducible coding-agent benchmarks.

421 24 0
The-Swarm-Corporation
evalops

An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"

265 16 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery