Symmetry Breaking in Neural Network Optimization: Insights from Input Dimension Expansion

Deyu Meng
Jun-Jie Zhang
Nan Cheng
Fu-Peng Li
Xiu-Cheng Wang
Jian-Nan Chen
Long-Gang Pang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Understanding how neural networks learn and optimize remains a central point in machine learning, with implications for designing better models. While techniques like dropout and batch normalization are widely used, the underlying principles driving their success—such as symmetry breaking, a concept in physics—are underexplored. We propose the symmetry breaking hypothesis, showing that breaking symmetries during training (e.g., via input expansion) substantially improves performance across tasks. We develop a metric to quantify symmetry breaking in networks, revealing its role in common optimization methods and its connection to properties like equivariance. This metric offers a practical tool to evaluate architectures without exhaustive training or full datasets, enabling more efficient design choices. Our work positions symmetry breaking as a unifying principle behind optimization techniques, bridging theoretical gaps and providing actionable insights for improving model efficiency.

Version published to 10.21203/rs.3.rs-5768541/v1 on Research Square
Apr 23, 2025

Element-wise Multiplicative Interactions in Neural Networks: Theory, Advances, and Open Problems

This article has 2 authors:
1. Gurpreet Inayatullah
2. Purnima Shafiq
This article has no evaluationsLatest version Apr 28, 2025
Physics-guided machine learning is unlocking new capabilities in modeling complex systems

This article has 2 authors:
1. Kishan Prakash
2. Karthika Nasir
This article has no evaluationsLatest version Apr 13, 2025
A Novel Differential Loss Function for Enhancing Generalization in Machine Learning Models

This article has 1 author:
1. Eyas Gaffar A. Osman
This article has no evaluationsLatest version May 13, 2025

Listed in

Abstract

Article activity feed

Related articles

Element-wise Multiplicative Interactions in Neural Networks: Theory, Advances, and Open Problems

Physics-guided machine learning is unlocking new capabilities in modeling complex systems

A Novel Differential Loss Function for Enhancing Generalization in Machine Learning Models