Scalable and Interpretable Mixture of Experts Models in Machine Learning: Foundations, Applications, and Challenges

Rajab Jafar
Fawzi Gamal
Rais Raheem

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Mixture of Experts (MoE) models have emerged as a powerful framework in machine learning, combining multiple specialized expert networks through a gating mechanism to enable scalable, efficient, and adaptive computation. This survey provides a comprehensive and mathematically rigorous overview of efficient and explainable MoE architectures, encompassing their theoretical foundations, optimization properties, and generalization guarantees. We explore a broad range of applications across natural language processing, computer vision, reinforcement learning, healthcare, and industrial domains, illustrating the versatility and empirical effectiveness of MoE models. A central focus is placed on explainability: we formalize attribution methods that leverage the modular structure of MoE, discuss quantitative metrics for interpretability, and examine strategies to enhance transparency and trustworthiness. Finally, we identify key open challenges and promising research directions, aiming to bridge the gap between scalable model design and human-centric interpretability. This survey serves as a foundational resource for advancing the development of efficient, explainable, and robust Mixture of Experts in modern machine learning.

Version published to 10.20944/preprints202507.0283.v1
Jul 3, 2025

Practical Guidelines for Building Explainable, Efficient, and Robust Machine Learning Systems

This article has 2 authors:
1. Huan Zheng
2. Zhihao Ru
This article has no evaluationsLatest version Jun 29, 2025
Achieving Explainable, Scalable, and Robust Machine Learning for Real-World Applications

This article has 2 authors:
1. Yong Nuan
2. Zhihao Ru
This article has no evaluationsLatest version Jun 30, 2025
Towards Robust and Scalable Mixture of Experts Architectures for Large Language and Vision Models

This article has 3 authors:
1. Aamina Yousra
2. Jumanah Fawziya
3. Fawzi Gamal
This article has no evaluationsLatest version Jul 2, 2025

Listed in

Abstract

Article activity feed

Related articles

Practical Guidelines for Building Explainable, Efficient, and Robust Machine Learning Systems

Achieving Explainable, Scalable, and Robust Machine Learning for Real-World Applications

Towards Robust and Scalable Mixture of Experts Architectures for Large Language and Vision Models