huggingface-spaces
NUMA-aware GPU provisioning and orchestration for stateless MoE workloads of all sizes