PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Hallucination Detection Python Packages

Python packages with the GitHub topic hallucination-detection. Sorted by relevance, with stars and monthly downloads.
fathom-lab
styxx

Cognitive observability for LLM agents. Cognometric instruments + self-healing reflex (F10) + MCP server. Pure-Python, MIT, no LLM required. 9-for-9 on K=1 phase transition. Every Mind Leaves Vitals (DOI 10.5281/zenodo.19777921).

15K 5 1
juyterman1000
entroly-core

Open-source context engine that catches AI hallucinations and cuts your token bill 70–95%. The only AI helper that shows its work. Claude · Cursor · Codex,GPT & Custom Providers

13K 381 62
juyterman1000
entroly

Open-source context engine that catches AI hallucinations and cuts your token bill 70–95%. The only AI helper that shows its work. Claude · Cursor · Codex,GPT & Custom Providers

13K 381 62
cvs-health
uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

5K 1K 123
Nomadu27
insa-its

Runtime Security for Multi-Agent AI — Website & Documentation

5K 26 0
krlabsorg
lettucedetect

Lightweight hallucination detection framework for RAG applications

5K 573 40
uptrain-ai
uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

3K 2K 202
Basaltlabs-app
gauntlet-cli

Community-driven behavioral reliability benchmark for LLMs. 231 probes across 19 modules, deterministic scoring, perplexity correlation, layer sensitivity mapping, quant method capture, hardware-stratified community rankings. Every test contributes to the community dataset.

2K 6 0
groundlens-dev
groundlens

Geometric LLM grounding verification — deterministic, auditable, no second LLM. Python library for measuring how faithfully model outputs reflect their sources.

2K 0 0
mattijsmoens
sovereign-shield

Strictly deterministic AI defense framework: immutable input filtering, dual LLM cryptographic hash consensus, and self-learning adaptive rules. Zero dependencies. Hardware-sealed. Patent Pending.

2K 19 7
groundlens-dev
groundlens-mcp

MCP server for groundlens — LLM hallucination detection for Claude Desktop, Cursor, Windsurf, and any MCP-compatible client.

2K 0 0
ENDEVSOLS
longtrainer

Production-ready RAG framework for Python — multi-tenant chatbots with streaming, tool calling, agent mode (LangGraph), vector search (FAISS), and persistent MongoDB memory. Built on LangChain.

2K 28 3
anulum
director-ai

Real-time LLM hallucination guardrail — NLI + RAG fact-checking with token-level streaming halt

2K 0 0
pauti04
chaincheck

LLM hallucination detection toolkit — NLI, LLM-as-judge, self-consistency, logprobs, QA. FastAPI + streaming UI.

2K 1 0
MigoXLab
dingo-python

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

1K 700 72
hinanohart
yuragi

yuragi — LLM Confidence Fragility Analyzer. Perturbation-driven hallucination detection with workshop-grade real benchmarks (TruthfulQA n=412 ensemble AUC 0.73, TriviaQA n=200 confidence-inversion AUC 0.75).

1K 0 0
mattijsmoens
sovereign-mcp

Deterministic MCP Security Architecture. FrozenNamespace as Root of Trust for Model Context Protocol tool verification

1K 3 4
pulkitj
groundguard

Verify LLM output against your source documents. Catch hallucinations in RAG pipelines and agentic workflows before they reach users.

1K 0 0
ENDEVSOLS
longtracer

RAG verification guardrails — detect hallucinations in LLM responses using hybrid STS + NLI.

902 32 4
Vbj1808
dokis

Lightweight RAG provenance middleware. Verifies every claim in an LLM response is grounded in a retrieved source - without an LLM call.

898 36 0
QWED-AI
qwed

The Deterministic Verification Protocol for AI - 11 verification engines for math, logic, code, SQL, facts, images, and more. Now with Agentic Security Guards.

765 55 8
mattijsmoens
logicshield

Deterministic validation firewall that verifies AI-generated proposals against ground-truth state using immutable rules. Zero dependencies. Patent pending.

406 2 0
hinanohart
yuragi-ai

yuragi — LLM Confidence Fragility Analyzer. Perturbation-driven hallucination detection with workshop-grade real benchmarks (TruthfulQA n=412 ensemble AUC 0.73, TriviaQA n=200 confidence-inversion AUC 0.75).

383 0 0
hinanohart
yuragi-guardrails

yuragi — LLM Confidence Fragility Analyzer. Perturbation-driven hallucination detection with workshop-grade real benchmarks (TruthfulQA n=412 ensemble AUC 0.73, TriviaQA n=200 confidence-inversion AUC 0.75).

371 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery