svdquant
A PyTorch-native inference engine with cache, parallelism, quantization for Diffusion Transformers.