PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Evaluations Python Packages

Python packages with the GitHub topic evaluations. Sorted by relevance, with stars and monthly downloads.
RailtownAI
railtracks

An agentic framework that helps developers build resilient agentic systems

3K 133 12
log10-io
log10-io

Python client library for improving your LLM app accuracy

1K 96 13
yisaienkov
evaluations

The library for models evaluation

1K 14 1
kaivid-labs
evret

Evals framework for Information Retrieval Systems

1K - -
gabe-mousa
apolien

AI Safety Evaluation Library

1K 5 1
evaluation-context-protocol
ecp-sdk

ECP is a standardized interface for orchestrating, auditing, and enforcing authority limits in AI Agent evaluations. It moves evaluation from "brittle Python scripts" to a deterministic infrastructure protocol

1K 8 1
evaluation-context-protocol
ecp-runtime

ECP is a standardized interface for orchestrating, auditing, and enforcing authority limits in AI Agent evaluations. It moves evaluation from "brittle Python scripts" to a deterministic infrastructure protocol

1K 8 1
RailtownAI
railtracks-cli

An agentic framework that helps developers build resilient agentic systems

861 133 12
aniketgopal
aniket-agentlens-sdk

Open-source observability, security, and evaluation platform for AI agents

504 0 0
mandoline-ai
mandoline

Official Python client for the Mandoline API

251 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery