PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Auto Tuning Python Packages

Python packages with the GitHub topic auto-tuning. Sorted by relevance, with stars and monthly downloads.
intel
neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

32K 3K 305
SID-Devu
isat-tuner

Inference Stack Auto-Tuner — one command to find the fastest ONNX inference config for any model on any GPU. Bayesian optimization, Pareto analysis, thermal-aware benchmarking, multi-provider (ROCm/CUDA/TensorRT/OpenVINO).

10K 0 0
intel
neural-compressor-pt

Repository of Intel® Neural Compressor

1K 3K 305
intel
neural-compressor-tf

Repository of Intel® Neural Compressor

1K 3K 305
KernelTuner
kernel-tuner

Kernel Tuner

957 395 67
HAL-42
alchemy-cat

Alchemy Cat —— 🔥Config System for SOTA

925 113 7
LexicHQ
smartloop

AI orchestration on your device

607 4 0
fjwillemsen
autotuning-methodology

This software package accompanies the paper "A Methodology for Comparing Auto-Tuning Optimization Algorithms" (https://doi.org/10.1016/j.future.2024.05.021), making the guidelines in the methodology easy to apply.

534 7 3
intel
neural-compressor-full

Repository of Intel® Neural Compressor

454 3K 305
intel
neural-solution

Repository of Intel® Neural Compressor

416 3K 305
intel
neural-insights

Repository of Intel® Neural Compressor

349 3K 305
intel
lpot

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

273 3K 306
isazi
tuning-metrics

Library to compute auto-tuning and performance metrics.

147 1 0
intel
ilit

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

98 3K 306
intel
neural-compressor-3x-pt

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

17 3K 305
intel
neural-compressor-3x-ort

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

3 3K 305
intel
neural-compressor-3x-tf

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

3 3K 305
    • Data from PyPI, GitHub, ClickHouse, and BigQuery