hallucination-evaluation
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
Lightweight hallucination detection framework for RAG applications
Geometric LLM grounding verification — deterministic, auditable, no second LLM. Python library for measuring how faithfully model outputs reflect their sources.
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.