gradient-clipping
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
Object Classification Training/Inferring Framework
Clip gradient norm automatically
PyTorch extension for alternative backward rules and gradient transforms (STE, gradient jamming, non-standard activations).