PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Prompt Evaluation Python Packages

Python packages with the GitHub topic prompt-evaluation. Sorted by relevance, with stars and monthly downloads.
ankurpand3y
judicator

Who evaluates the evaluator? Judicator audits LLM-as-a-Judge systems for 7 documented bias types. Zero config. Works with any LLM.

1K 7 2
ianarawjo
evalstats

Statistically sane analysis methods for comparing AI model and prompt performance.

581 101 2
ianarawjo
promptstats

Statistical analysis methods for comparing prompt and model performance in LLM evaluations.

411 103 2
prompt-foundry
prompt-foundry-python-sdk

The prompt engineering, prompt management, and prompt evaluation tool for Python

302 8 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery