PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Regression Testing Python Packages

Python packages with the GitHub topic regression-testing. Sorted by relevance, with stars and monthly downloads.
manav8498
shadow-diff

Behavior contracts for AI agents

31K 9 0
max-sixty
pytest-accept

A pytest plugin for automatically updating doctest outputs

20K 78 7
reframe-hpc
reframe-hpc

A powerful Python framework for writing and running portable regression tests and benchmarks for HPC systems.

16K 275 122
blackwell-systems
mcp-assert

The deterministic testing standard for MCP servers. Connect over real stdio/SSE/HTTP transport, call tools with real arguments, assert results with 18 assertion types defined in YAML. Any language, any transport, no mocks. Single Go binary.

10K 8 1
blackwell-systems
pytest-mcp-assert

The deterministic testing standard for MCP servers. Connect over real stdio/SSE/HTTP transport, call tools with real arguments, assert results with 18 assertion types defined in YAML. Any language, any transport, no mocks. Single Go binary.

4K 8 1
hidai25
evalview

Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewAI, OpenAI, Anthropic.

3K 105 20
softwareTestingResearch
pytest-ranking

A Pytest plugin for faster fault detection via regression test prioritization

2K 4 1
sagikimhi
socx-cli

Unified command-line tool for EDA development teams to streamline common tasks and tools, and unify them under a single configurable CLI menu to increase accessibility and transparency of tools and scripts in collaborative development environments.

2K 0 1
BAder82t
fhe-attack-replay

Unified attack-replay regression harness for FHE libraries (SEAL, OpenFHE, Lattigo, tfhe-rs).

2K 2 0
qualixar
agentassay

Token-efficient stochastic testing for AI agents. 5-20x cost reduction. 10 framework adapters. Paper: arXiv:2603.02601

948 5 1
trytouca
touca

Touca SDK for Python

942 511 25
ENDEVSOLS
longprobe

Sub-second RAG regression testing. Define golden questions, detect lost chunks in CI. pytest for your RAG pipeline.

777 7 1
stef41
modeldiffx

Behavioral regression testing for LLMs. Capture outputs, diff behavior, detect drift — pytest for model upgrades.

722 1 0
ENDEVSOLS
langchain-longprobe

LangChain integration for LongProbe — sub-second RAG retrieval regression testing with chunk-level diffing

695 2 0
slxiao
conport

Generate continuous testing report

564 14 4
damies13
testdatatable

A shared data table store for use by testing applications

502 11 0
MigoXLab
webqa-agent

Autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding creators. 网站自主评估测试 Agent,支持 GUI/CLI 一键完成性能、功能使用与交互体验的测试评估

444 212 16
trytouca
touca-fbs

Auto-generated python implementation of Touca FlatBuffers schema

395 511 25
mozilla
tlscanary

TLS/SSL Test Suite for Mozilla Firefox

370 19 10
MPrazeres-1983
promptforge-llmops

Open-source LLMOps framework for prompt versioning, evaluation and regression testing in CI/CD pipelines.

351 0 0
davidchall
nrtest

Numerical regression testing

344 5 3
faivlex
tddf

Behaviour regression tests for AI agents — deterministic, local-first, no LLM-as-judge

340 2 0
locomotive-lib
locomotive

Load and regression performance testing for CI/CD pipelines

326 3 1
ericckzhou
falsifyai

Falsification-first reliability testing for AI systems: perturb inputs, preserve replayable evidence, diff reliability across model changes.

284 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery