Vram Optimization Python Packages

dynabatch

PyTorch/Hugging Face batching utility that sorts variable-length text by difficulty, then dynamically increases batch size on easier samples using a pre-trained VRAM predictor to improve GPU utilization and throughput while reducing OOM risk with fallback handling.

653 1 0

gemma4-adaptive-router

Complexity + VRAM-aware routing for local dual-tier LLM deployments

307 1 1

sparsemma

INT8 Sparse Tensor Core GEMM for PyTorch — built for Windows

69 1 0