PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Pruning Python Packages

Python packages with the GitHub topic pruning. Sorted by relevance, with stars and monthly downloads.
openvinotoolkit
nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

482K 1K 294
tensorflow
tensorflow-model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

111K 2K 347
intel
neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

32K 3K 305
quic
aimet-torch

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

30K 3K 451
VainF
torch-pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

25K 3K 382
quic
aimet-onnx

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

20K 3K 451
Ruya-AI
cozempic

Context cleaning for Claude Code — prune bloated sessions, protect Agent Teams from context loss, auto-guard with tiered pruning

20K 301 20
tensorflow
tf-model-optimization-nightly

A suite of tools that users, both novice and advanced can use to optimize machine learning models for deployment and execution.

10K 2K 347
neuralmagic
deepsparse

Sparsity-aware deep learning inference runtime for CPUs

4K 3K 192
neuralmagic
sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

3K 388 28
neuralmagic
sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

3K 2K 156
intel
neural-compressor-pt

Repository of Intel® Neural Compressor

1K 3K 305
neuralmagic
deepsparse-ent

Sparsity-aware deep learning inference runtime for CPUs

1K 3K 192
satabios
sconce

E2E AutoML Model Compression Package

1K 45 4
delve-team
delve

PyTorch model training and layer saturation monitor

1K 83 13
intel
neural-compressor-tf

Repository of Intel® Neural Compressor

1K 3K 305
FasterAI-Labs
fasterai

FasterAI: Prune and Distill your models with FastAI and PyTorch

1K 261 19
666DZY666
micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

938 2K 474
neuralmagic
sparsify

ML model optimization product to accelerate inference.

907 325 31
r-papso
torch-optim

PyTorch models optimization by neural network pruning

888 3 1
open-mmlab
mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

831 2K 244
EIDOSlab
torch-simplify

Simplification of pruned models for accelerated inference | SoftwareX https://doi.org/10.1016/j.softx.2021.100907

622 36 3
SforAiDL
kd-lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

536 650 61
tianyic
only-train-once

Only Train Once (OTO): Automatic One-Shot General DNN Training and Compression Framework

477 311 47
    • Data from PyPI, GitHub, ClickHouse, and BigQuery