PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Llm Judge Python Packages

Python packages with the GitHub topic llm-judge. Sorted by relevance, with stars and monthly downloads.
black-yt
structai

StructAI offers a robust toolkit for LLM interaction—such as structured outputs, context management, and parallel execution.

2K 6 1
haizelabs
verdict

Inference-time scaling for LLMs-as-a-judge.

2K 339 26
regokan
eval-harness

A boring, config-driven harness for evaluating AI systems. One YAML drives the run, the trace is the source of truth. Offline, backtesting, and online-eval modes — works with any agent, RAG, or code-modifying system.

1K 0 0
gmitt98
fieldtest

LLM evaluation framework — define what correct, well-formed, and safe means before you measure

869 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery