PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Ai Evaluation Tools Python Packages

Python packages with the GitHub topic ai-evaluation-tools. Sorted by relevance, with stars and monthly downloads.
meshkovQA
eval-ai-library

Comprehensive AI Model Evaluation Framework with advanced techniques including Temperature-Controlled Verdict Aggregation via Generalized Power Mean. Support for multiple LLM providers and 15+ evaluation metrics for RAG systems and AI agents.

4K 37 3
raga-ai-hub
agentneo

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

2K 16K 4K
ianarawjo
evalstats

Statistically sane analysis methods for comparing AI model and prompt performance.

581 101 2
ianarawjo
promptstats

Statistical analysis methods for comparing prompt and model performance in LLM evaluations.

388 103 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery