PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Benchmark Python Packages

Python packages with the GitHub topic benchmark. Sorted by relevance, with stars and monthly downloads.
swe-bench
swebench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

36.2M 5K 862
ionelmc
pytest-benchmark

pytest fixture for benchmarking code

14.2M 1K 132
embeddings-benchmark
mteb

MTEB: Massive Text Embedding Benchmark

2.8M 3K 614
smarie
pytest-harvest

Store data created during your `pytest` tests execution, and retrieve it at the end of the session, e.g. for applicative benchmarking purposes.

503K 76 10
airspeed-velocity
asv

Airspeed Velocity: A simple Python benchmarking tool with web-based reporting

388K 1K 207
MichaelGrupp
evo

Python package for the evaluation of odometry and SLAM

213K 4K 792
cheind
motmetrics

:bar_chart: Benchmark multiple object trackers (MOT) in Python

175K 1K 262
google
google-benchmark

A microbenchmark support library

83K 10K 2K
python
pyperformance

Python Performance Benchmark Suite

83K 1K 203
tarasko
picows

Ultra-fast websocket client and server for asyncio

58K 269 18
membrowse
membrowse

Track and analyze binary size and memory footprint in embedded firmware

52K 20 1
beir-cellar
beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

47K 2K 244
MedMNIST
medmnist

[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification

46K 1K 207
open-mmlab
mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

38K 8K 1K
optuna
optunahub

Python library to use and implement packages in OptunaHub

33K 57 15
NyanKiyoshi
pytest-django-queries

Generate performance reports from your django database performance tests.

32K 84 2
evalplus
evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

26K 2K 198
bigcode-project
bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

17K 500 72
ethz-spylab
agentdojo

A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.

17K 564 147
blooop
holobench

A package for benchmarking the characteristics of arbitrary functions

17K 4 3
kdeldycke
chessboard

:game_die: CLI to solve combinatoric chess puzzles.

16K 8 4
Ceyron
apebench

[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Rollout Metrics)

16K 100 2
PyJobShop
fjsplib

Python package to read and write instances for the flexible job shop problem.

16K 8 0
huggingface
optimum-benchmark

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

12K 336 58
    • Data from PyPI, GitHub, ClickHouse, and BigQuery