PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Llm Evals Python Packages

Python packages with the GitHub topic llm-evals. Sorted by relevance, with stars and monthly downloads.
buildwithabid
ai-stability

Measure LLM output consistency from the command line.

508 0 0
SproutSeeds
dormant-behavior-audit

Benchmark assets, reproducibility tooling, and evidence checks for dormant behavior audit.

440 0 0
The-Swarm-Corporation
evalops

An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"

262 16 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery