conditional-computation
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch