PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Trustworthy Ai Python Packages

Python packages with the GitHub topic trustworthy-ai. Sorted by relevance, with stars and monthly downloads.
Giskard-AI
giskard

🐢 Open-Source Evaluation & Testing library for LLM Agents

36K 5K 458
Trusted-AI
adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

34K 6K 1K
akios-ai
akios

AKIOS runtime for secure AI agent execution

3K 7 3
encypherai
encypher-ai

Metadata encoding and extraction for AI-generated content

3K 30 3
yaniv-golan
proof-citations

Fetch URLs and verify that quoted text appears on the page. Extracted from proof-engine.

3K 7 1
yaniv-golan
proof-engine-wiki

Attach verified Proof Engine proofs to LLM-wiki claims.

2K 7 1
rhesis-ai
rhesis-sdk

The testing platform for AI teams. Bring engineers, PMs, and domain experts together to generate tests, simulate (adversarial) conversations, and trace every failure to its root cause.

2K 346 24
yaniv-golan
proof-engine-registry

Proof Registry protocol: client, reference server, and static-JSON emitter.

2K 7 1
Pacific-AI-Corp
langtest

Deliver safe & effective language models

2K 557 49
aiverify-foundation
aiverify-moonshot

AI Verify advances Gen AI testing with Project Moonshot.

2K 322 62
edadaltocg
detectors

Python package to accelerate research on generalized out-of-distribution (OOD) detection.

2K 15 1
JohnSnowLabs
nlptest

Deliver safe & effective language models

2K 557 49
rhesis-ai
rhesis

The testing platform for AI teams. Bring engineers, PMs, and domain experts together to generate tests, simulate (adversarial) conversations, and trace every failure to its root cause.

2K 346 24
Trustifai
trustifai

TrustifAI: A Comprehensive Framework for AI Trustworthiness

1K 10 1
HowieHwong
trustllm

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

1K 625 67
Shepard2154
vexrag

A Red Team framework that evaluates RAG functional correctness when the retrieval backend contains poisoned passages.

1K 1 0
Principled-Evolution
aicertify

Compliance-as-code for AI systems: evaluate AI apps against EU AI Act, NIST AI RMF, and OPA/Rego policies.

1K 2 0
Khanz9664
trustlens

Open-source Python library for evaluating ML model reliability beyond accuracy — with calibration, failure, and fairness diagnostics for informed deployment decisions.

1K 12 12
pulkitj
groundguard

Verify LLM output against your source documents. Catch hallucinations in RAG pipelines and agentic workflows before they reach users.

1K 0 0
IRT-SystemX
dqm-ml-images

Python library designed provide core dqm-ml metrics without huge dependencies, as well as common API shared by metrics

1K 5 3
akios-ai
enforcecore

Lightweight runtime enforcement for agentic AI. PII masking, policy checks, and Merkle audit trails as a decorator.

940 5 3
THU-BPM
markdiffusion

MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models

939 316 19
Vbj1808
dokis

Lightweight RAG provenance middleware. Verifies every claim in an LLM response is grounded in a retrieved source - without an LLM call.

911 36 0
IRT-SystemX
dqm-ml-pytorch

Python library designed to provide core dqml domain gap metrics, as well as common API shared by metrics

841 5 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery