Theory-Trained Deep Neural Networks: Insights, Techniques, and Future Directions

Aradhana Reva
Jai Sekhar
Meera Sharma
Karthika Nasir

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep neural networks have achieved remarkable success across numerous domains, yet a comprehensive theoretical understanding of their training dynamics and generalization remains a fundamental challenge. Theory-Trained Deep Neural Networks (TT-DNNs) represent an emerging paradigm that integrates rigorous theoretical insights into the design and training of neural architectures. This survey provides a systematic overview of TT-DNNs, categorizing key approaches based on optimization theory, statistical learning theory, approximation theory, and information theory. We discuss theory-informed training paradigms that improve convergence, robustness, and interpretability, and highlight notable applications across computer vision, natural language processing, scientific computing, and healthcare. Furthermore, we identify open challenges and future directions to bridge the gap between theory and practice. Our aim is to offer a comprehensive resource that fosters deeper understanding and innovation at the intersection of theory and deep learning practice.

Version published to 10.20944/preprints202507.0652.v1
Jul 8, 2025

Understanding and Designing Deep Neural Networks Through Theory-Guided Training

This article has 3 authors:
1. Karthika Nasir
2. Aradhana Reva
3. Jai Sekhar
This article has no evaluationsLatest version Jul 7, 2025
Scalable and Interpretable Mixture of Experts Models in Machine Learning: Foundations, Applications, and Challenges

This article has 3 authors:
1. Rajab Jafar
2. Fawzi Gamal
3. Rais Raheem
This article has no evaluationsLatest version Jul 3, 2025
NeuronSeek: On Stability and Expressivity of Task-Driven Neurons

This article has 7 authors:
1. Hanyu Pei
2. Jing-Xiao Liao
3. Qibin Zhao
4. Ting Gao
5. Shijun Zhang
6. Xiaoge Zhang
7. Feng-Lei Fan
This article has no evaluationsLatest version Jun 19, 2025

Listed in

Abstract

Article activity feed

Related articles

Understanding and Designing Deep Neural Networks Through Theory-Guided Training

Scalable and Interpretable Mixture of Experts Models in Machine Learning: Foundations, Applications, and Challenges

NeuronSeek: On Stability and Expressivity of Task-Driven Neurons