PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Quantization Aware Training Python Packages

Python packages with the GitHub topic quantization-aware-training. Sorted by relevance, with stars and monthly downloads.
openvinotoolkit
nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

482K 1K 294
intel
neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

32K 3K 305
intel
neural-compressor-pt

Repository of Intel® Neural Compressor

1K 3K 305
intel
neural-compressor-tf

Repository of Intel® Neural Compressor

1K 3K 305
666DZY666
micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

938 2K 474
intel
neural-compressor-full

Repository of Intel® Neural Compressor

464 3K 305
balditommaso
pylandscape

This project propose the loss landscape analysis as effective methodology to understand the robustness against natural perturbation of QNN.

457 3 1
intel
neural-solution

Repository of Intel® Neural Compressor

423 3K 305
intel
neural-insights

Repository of Intel® Neural Compressor

355 3K 305
intel
lpot

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

285 3K 306
mlzxy
qsparse

train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules

230 42 2
IST-DASLab
pyllmq

Quantized LLM training in pure CUDA/C++.

109 246 14
intel
ilit

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

105 3K 306
intel
neural-compressor-3x-pt

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

16 3K 305
intel
neural-compressor-3x-ort

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

3 3K 305
intel
neural-compressor-3x-tf

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

3 3K 305
openvinotoolkit
otxdet

OpenVINO Training Extensions Object Detection

1 93 30
    • Data from PyPI, GitHub, ClickHouse, and BigQuery