Renormalization-Group Principles for Deep Neural Architectures

Robin Bisht

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep learning achieves remarkable success across diverse domains, yet a fundamental theoretical framework explaining why depth enables effective multi-scale representation learning remains incomplete. Here we establish a formal correspondence between neural network depth and renormalization group (RG) scale transformations from statistical physics. We derive this correspondence through representation dynamics: each layer implements a learned coarse-graining operator that contracts the Fisher information geometry of the data manifold. This framework yields three falsifiable hypotheses with operational definitions: (H1) layer-k representations correspond to effective theories at correlation scale ξk, measurable through the Jacobian spectral distribution; (H2) required network depth scales logarithmically with the intrinsic correlation length of the data distribution; and (H3) RG-inspired architectures with explicit scale-structure exhibit improved multi-scale generalization. We validate these predictions on both controlled hierarchical datasets and standard computer vision benchmarks, demonstrating that the depth-correlation scaling follows the theoretically derived exponential decay. This work provides a mathematically disciplined framework for understanding deep learning hierarchy, offering both conceptual insights and principled architectural guidance.

Version published to 10.21203/rs.3.rs-9005595/v1 on Research Square
Mar 9, 2026

Scaffolded representation learning in deep networks

This article has 4 authors:
1. Philipp Stecher
2. Sandro Radovanović
3. Vlasta Sikimić
4. Reinhard Kahle
This article has no evaluationsLatest version Apr 16, 2026
Beyond Integration: Neural Dimensionality and the Landau–Ginzburg Physics of Awareness

This article has 2 authors:
1. Alessandro Rossi
2. Antonio Smecca
This article has no evaluationsLatest version Apr 24, 2026
reEtym: A Natively Feature-Disentangled Transformer for Interpretability

This article has 1 author:
1. Hongyu Shi
This article has no evaluationsLatest version Apr 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Scaffolded representation learning in deep networks

Beyond Integration: Neural Dimensionality and the Landau–Ginzburg Physics of Awareness

reEtym: A Natively Feature-Disentangled Transformer for Interpretability