gemma4
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
🔥 Python / Mojo Interface for Google Gemma 4
Adaptive dual-tier serving for Gemma 4 on consumer 16GB GPUs. Complexity + real-time VRAM routing between vLLM E4B and llama.cpp 27B. Production stack with OpenWebUI, monitoring, and more.