PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Llm Safety Python Packages

Python packages with the GitHub topic llm-safety. Sorted by relevance, with stars and monthly downloads.
NVIDIA-NeMo
nemoguardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

296K 6K 679
confident-ai
deepteam

DeepTeam is a framework to red team LLMs and LLM systems.

64K 2K 277
fathom-lab
styxx

Cognitive observability for LLM agents. Cognometric instruments + self-healing reflex (F10) + MCP server. Pure-Python, MIT, no LLM required. 9-for-9 on K=1 phase transition. Every Mind Leaves Vitals (DOI 10.5281/zenodo.19777921).

16K 5 1
sattyamjjain
agent-airlock

Open-source security firewall for AI agents — validates tool calls, strips ghost arguments, enforces type safety, PII masking, RBAC, cost tracking & sandbox isolation. Works with LangChain, OpenAI Agents SDK, PydanticAI & CrewAI.

11K 6 1
cvs-health
uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

5K 1K 123
AaditPani-RVU
neurosym-ai

Neuro-symbolic guardrails for LLMs — injection detection, harm filters, output guards, streaming safety, and action-plan validation.

3K 2 0
HeadyZhang
agent-audit

Static security scanner for LLM agents — prompt injection, MCP config auditing, taint analysis. 49 rules mapped to OWASP Agentic Top 10 (2026). Works with LangChain, CrewAI, AutoGen.

2K 170 18
QWED-AI
qwed-finance

Deterministic verification middleware for banking and financial AI. NPV, IRR, loan amortization, and interest calculations with QWED precision.

2K 2 2
kitelogik
kitelogik

Governance control plane for autonomous AI agents — OPA/Rego policy enforcement at the agent action layer. Apache 2.0.

1K 1 0
zunoworks
gateguard-ai

A fact-forcing hook gate for Claude Code. Makes the AI pause and investigate before editing.

1K 3 0
Chimera-Protocol
csl-core

CSL-Core: Deterministic Safety Layer for Probabilistic AI Systems

1K 12 9
QWED-AI
qwed

The Deterministic Verification Protocol for AI - 11 verification engines for math, logic, code, SQL, facts, images, and more. Now with Agentic Security Guards.

831 55 8
roli-lpci
lintlang

Static linter for AI agent configs, tool descriptions, and system prompts with zero-LLM CI gating

752 33 1
Serhii2009
brix-protocol

Runtime Reliability Infrastructure for LLM Pipelines — enforce deterministic rules, measure the Balance Index, and audit every decision.

590 8 0
open-bias
openbias

Open Source Reliability Harness: Make your agents follow rules. One line of code to‎ ‎enforce, trace, and improve. ‎ ‎

584 117 3
zabinskirafal
agi-pragma

AI Action Firewall — seven-stage Decision Intelligence Core for safe agentic AI

502 0 0
zabinskirafal
guardex

Guardex - AI Control Plane for autonomous agents (closed source)

469 0 0
vpdeva
blackwall-llm-shield-python

Security middleware for Python LLM apps and services. Blocks prompt injection, masks PII, inspects outputs, and gates agent tools.

423 1 0
theDoc001
fivedrisk

Per-action AI agent risk scoring and governance. Deterministic 5D scoring, HITL gating, FinOps, Agent Cost Management, Markov drift, audit log. Apache-2.0.

348 0 0
DilawarShafiq
unworldly-recorder

The flight recorder for AI agents. Tamper-proof, ISO 42001 + HIPAA-compliant audit trails for everything AI agents do on your system. File changes + shell commands + PHI detection + agent identity.

347 7 0
ThuCCSLab
jailbreakeval

[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.

274 192 12
open-sentinel
opensentinel

Reliability layer for AI agents - monitors workflow adherence and intervenes when agents deviate

220 101 3
sarvanithin
medguard-llm

AI-powered medical guardrails API — PHI detection, drug safety, hallucination checks

200 0 1
orchintel
ioa-core

IOA Core — the open-source governance-first kernel for AI orchestration. Clean public repo with OSS-only code, docs, and releases.

150 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery