Phase Structure in Continuous Wave Fields Enables Speech Classification Without Backpropagation

Jeremy Slater
Gardar Thorvardsson

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Physical neural networks promise energy-efficient computation by exploiting the intrinsic dynamics of physical substrates, but existing implementations rely on discrete elements or restrict learning to digital readout layers. Here we demonstrate that a continuous Landau-Ginzburg wave field, operating in the underdamped regime, generates phase structure sufficient for classifying spoken words at 74.1% accuracy on a 10-class spoken command recognition task (Google Speech Commands V2) — using only a linear readout and no backpropagation through the physical dynamics. Systematic ablation across ten conditions reveals a three-tier hierarchy of contributions. First, operating in the underdamped regime accounts for ~20 percentage points (pp): a from-scratch baseline trained with EP but initialized in the overdamped regime (γ/ω = 1.0) reaches only 53%, while theoretically-motivated underdamped initialization (γ/ω < 0.05) yields 74.1% (EP fine-tuning contributed an additional 0.81 pp; ablation 7). Second, within the architecture, readout design is decisive: explicit phase extraction — cosine and sine of the phase angle, amplitude, and amplitude gradient — contributes 7.8 points over implicit complex-component encoding, revealing that linear readouts cannot exploit phase information unless it is explicitly projected into trigonometric form. Third, individual physics components — cross-phase modulation, spatial parameter grids, evanescent coupling, and EP fine-tuning of material parameters — each contribute less than 1 percentage point individually, indicating that the underdamped LG regime is robust to specific parameter choices once the architecture is correctly designed. Notably, Equilibrium Propagation drives the lateral inhibition strength toward opposite optima depending on the readout — higher under amplitude measurement (D → 0.27), lower under phase-sensitive measurement (D → 0.025) — demonstrating that EP co-adapts the physical substrate to the measurement apparatus. Every learned parameter in principle maps directly to fabrication specifications for photonic or acoustic hardware.

Version published to 10.21203/rs.3.rs-9205518/v1 on Research Square
Mar 30, 2026

Equilibrium Propagation Discovers Top-Down Feedback for Audio-Visual Binding in Continuous Wave Fields

This article has 2 authors:
1. Jeremy Slater
2. Gardar Thorvardsson
This article has no evaluationsLatest version Apr 15, 2026
Contactless optical decoding of cortical language responses via region-transferable speckle dynamics

This article has 8 authors:
1. Natalya Segal
2. Moshe Bar
3. Daniel Rubinstein
4. Yehor Krapovnytskyi
5. Sergey Agdarov
6. Yevgeny Beiderman
7. Yafim Beiderman
8. Zeev Zalevsky
This article has no evaluationsLatest version Mar 26, 2026
Nonlinear-Feature K-R Receiver for LiFi: Physics-Driven Residual Correction with Closed-Form Per-Slot Training

This article has 1 author:
1. Ramakrishna Pasupuleti
This article has no evaluationsLatest version Apr 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Equilibrium Propagation Discovers Top-Down Feedback for Audio-Visual Binding in Continuous Wave Fields

Contactless optical decoding of cortical language responses via region-transferable speckle dynamics

Nonlinear-Feature K-R Receiver for LiFi: Physics-Driven Residual Correction with Closed-Form Per-Slot Training