PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Ai Testing Python Packages

Python packages with the GitHub topic ai-testing. Sorted by relevance, with stars and monthly downloads.
langwatch
langwatch-scenario

Agentic testing for agentic codebases

65K 880 60
Giskard-AI
giskard

🐢 Open-Source Evaluation & Testing library for LLM Agents

36K 5K 458
jhd3197
prompture

Prompture is an API-first library for requesting structured JSON output from LLMs (or any structure), validating it against a schema, and running comparative tests between models.

16K 9 0
ksgisang
aat-devqa

AI-powered automated E2E testing. Just enter a URL — AI generates and runs test scenarios.

3K 5 1
Pacific-AI-Corp
langtest

Deliver safe & effective language models

2K 557 49
JohnSnowLabs
nlptest

Deliver safe & effective language models

2K 557 49
Chatbot-TRACER
chatbot-tracer

An automated approach for exploring and testing conversational agents using large language models. TRACER discovers chatbot functionalities, generates user profiles, and creates comprehensive test suites for conversational AI systems.

1K 2 0
Swanand33
llm-behave

Behavioral testing for LLM applications. pytest plugin with semantic assertions, multi-turn conversation testing, and drift detection. No LLM judge needed.

818 1 0
kdunee
intentguard

A Python library for verifying code properties using natural language assertions.

817 35 0
ENDEVSOLS
longprobe

Sub-second RAG regression testing. Define golden questions, detect lost chunks in CI. pytest for your RAG pipeline.

769 7 1
alepot55
agentrial

Statistical evaluation framework for AI agents - pytest for agent trajectories

727 16 2
justinGrosvenor
alignmenter

Check if your AI sounds like your brand, stays safe, and behaves consistently. Works with your custom GPTs, hosted APIs, and local models. Get detailed reports in minutes, not days.

655 5 0
Harshit-J004
py-toolguard

Cloudflare for AI Agents. 7-layer security interceptor and observability dashboard.

475 12 3
AetherLabCo
aetherlab

Open-source tools, SDKs, and resources for AetherLab AI quality control platform

472 1 0
tenro-ai
tenro

Open-source simulation harness for testing AI agents. Simulate LLM and tool calls to test edge cases, failure paths, and agent logic without live API calls.

461 6 0
radoslaw-sz
maia-test-framework

A pytest-based framework for testing multi AI agents systems. It provides a flexible and extensible platform for complex multi-agent simulations. Supports many integrations like LiteLLM, CrewAI, LangChain etc.

436 1 0
Rowusuduah
llm-sentry

Unified AI Reliability Platform. One install, 12 diagnostic engines. Zero-dependency LLM pipeline monitoring.

359 0 0
Addepto
ccheck

A human-friendly framework for testing and evaluating LLMs, RAGs, and chatbots.

345 95 11
ctoapplymatic
sharingan-autotest

Autonomous testing agent for Claude Code. Discovers, tests, diagnoses, and fixes your web app.

290 1 0
RahulMK22
pyllmtest

🚀 Comprehensive testing framework for LLM applications with semantic assertions, multi-provider support, RAG testing, and prompt optimization. Test AI the right way!

267 1 0
sazed5055
llmtest-framework

pytest for LLM apps - Test for grounding failures, prompt injection, safety violations, and regressions

212 3 0
chigwell
llmtestr

A new package that helps developers integration-test AI and LLM applications by validating structured outputs. It takes a user's test scenario or prompt as input, sends it to an LLM, and uses pattern

179 1 0
Forge-NC
forge-nc

Local-first AI coding assistant with 9-layer security, adversarial model testing, and cryptographic audit trails. Break your AI before it breaks your code.

113 2 0
awrshift
housemonkey

Chaos testing for AI apps. 18 extreme personas attack your AI to find edge cases before users do. OWASP LLM Top 10 coverage.

102 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery