PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Jailbreak Detection Python Packages

Python packages with the GitHub topic jailbreak-detection. Sorted by relevance, with stars and monthly downloads.
killertcell428
pyaigis

Deterministic, zero-dependency Python firewall for AI agents — MCP rug-pull, memory poisoning, indirect injection, exfil channels. 44 compliance templates (US/CN/JP/EU).

11K 27 0
uptrain-ai
uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

3K 2K 202
mattijsmoens
sovereign-shield

Strictly deterministic AI defense framework: immutable input filtering, dual LLM cryptographic hash consensus, and self-learning adaptive rules. Zero dependencies. Hardware-sealed. Patent Pending.

2K 19 7
maheshmakvana
llm-injection-guard

Drop-in prompt injection defense for LLM apps and AI agents — detect, block, and audit injection attacks in real time

848 0 0
Priyrajsinh
p1-hybrid-jailbreak-detector

Hybrid LLM jailbreak and prompt injection detector. ModernBERT + LoRA + perplexity gate + FAISS similarity search.

759 0 0
SoubhikGhosh
soweak

OWASP LLM Top 10 security middleware framework for Python.

682 1 0
yfedoseev
jailguard

Pure-Rust prompt-injection detector with 1.5MB embedded MLP classifier. 98.40% accuracy, p50 14ms CPU inference, bindings for Python/JS/Go. Apache-2.0/MIT alternative to Rebuff (archived) and Lakera Guard.

681 3 1
DmitrL-dev
sentinel-llm-security

AI Security Platform: Defense (61 Rust engines + Micro-Model Swarm) + Offense (39K+ payloads)

468 106 17
lockllm
lockllm

Official Python SDK for LockLLM

433 0 0
vpdeva
blackwall-llm-shield-python

Security middleware for Python LLM apps and services. Blocks prompt injection, masks PII, inspects outputs, and gates agent tools.

423 1 0
mattijsmoens
intentshield

Pre-execution intent verification for AI agents. Audits what your AI is about to do, not what it says. Zero dependencies, deterministic, hash-sealed.

355 19 5
stef41
injectionguard

Prompt injection detection for LLM applications and MCP servers. Detects jailbreaks, instruction override, encoded attacks. OWASP LLM #1 defense.

346 1 0
akshaymagapu
aisafeguard

Open-source LLM safety guardrails: prompt injection protection, PII redaction, toxicity filtering, and OpenAI-compatible AI proxy

343 0 0
DmitrL-dev
rlm-toolkit

AI Security Platform: Defense (61 Rust engines + Micro-Model Swarm) + Offense (39K+ payloads)

276 106 17
mattijsmoens
sovereign-shield-adaptive

Deterministic AI defense framework: immutable input filtering, n-model cryptographic hash consensus, and self-learning adaptive rules. Zero dependencies. Hardware-sealed. Patent Pending.

247 19 7
Adxzer
pydefend

AI security guardrails for LLM applications — scan inputs and check outputs with Claude, OpenAI, Gemini, Azure, or Ollama.

118 0 0
dronefreak
promptscreen

Protect your LLMs from prompt injection and jailbreak attacks. Easy-to-use Python package with multiple detection methods, CLI tool, and FastAPI integration.

104 9 4
uptrain-ai
vellum-uptrain-fork

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

76 2K 202
    • Data from PyPI, GitHub, ClickHouse, and BigQuery