Related items
AIHugging Face Blog
Welcome Llama 4 Maverick & Scout on Hugging Face
AIarXiv cs.AI
MobileMoE: Scaling On-Device Mixture of Experts
Mixture-of-Experts (MoE) has become the de facto architecture for hundred-billion-parameter language models, yet its advantages at sub-billion scales for on-device deployment remain largely unexplored. To close this gap, we present MobileMoE, a family of on-device MoE language models with sub-billion active paramete...
AIHugging Face Blog
Welcome spaCy to the Hugging Face Hub
AIHugging Face Blog
Welcome fastai to the Hugging Face Hub
AIHugging Face Blog
Mixture of Experts Explained
AIHugging Face Blog