Mixture of Experts (MoEs) in Transformers

Hugging Face Blog · 2026-02-26

Open source

Related items

AIHugging Face Blog2023-12-11

Mixture of Experts Explained

AIHugging Face Blog2024-02-03

SegMoE: Segmind Mixture of Diffusion Experts

AIarXiv cs.AI2026-05-26

MobileMoE: Scaling On-Device Mixture of Experts

Mixture-of-Experts (MoE) has become the de facto architecture for hundred-billion-parameter language models, yet its advantages at sub-billion scales for on-device deployment remain largely unexplored. To close this gap, we present MobileMoE, a family of on-device MoE language models with sub-billion active paramete...

AIHugging Face Blog2023-12-11

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

AIHugging Face Blog2026-04-23

How to Use Transformers.js in a Chrome Extension

AIHugging Face Blog2025-06-23