PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Llm As Evaluator Python Packages

Python packages with the GitHub topic llm-as-evaluator. Sorted by relevance, with stars and monthly downloads.
Pacific-AI-Corp
langtest

Deliver safe & effective language models

2K 557 49
JohnSnowLabs
nlptest

Deliver safe & effective language models

2K 557 49
trustyai-explainability
vllm-judge

A tiny, lightweight library for LLM-as-a-Judge evaluations on vLLM-hosted models.

565 2 2
IAAR-Shanghai
xfinder

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

304 178 7
rafaelsandroni
llm-antibodies

Antibodies for LLMs hallucinations (grouping LLM as a judge, NLI, reward models)

196 0 0
rafaelsandroni
antibodies-rafaelsandroni

Antibodies for LLMs hallucinations (grouping LLM as a judge, NLI, reward models)

133 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery