Streaming Propagation Through Time: A New Computational Paradigm for Recurrent Neural Networks

Huachuan Wang
Weihao Xia
Yunpeng Guan
Yuanhao Wang
Chaoyi Ke
Enshuo Yan
Ping Wang
Chen Qiu
Xiangping Zheng
Yuan Yao
Yuanfei Bi
James Lo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recurrent Neural Networks (RNNs) are foundational to numerous advances in artificial intelligence, yet their training has for decades predominantly relies on Backpropagation Through Time (BPTT), a paradigm that struggles with substantial computational and memory demands for long sequences. The inherently batch-oriented nature of BPTT further constrains RNNs’ ability to learn from streaming or online data. Earlier efforts toward online RNN training have been hindered by prohibitive costs in both computation and memory. Here, we introduce Streaming Propagation Through Time (SPTT), a new computational paradigm for RNN training. SPTT employs a streaming low-rank matrix decomposition to decouple gradient computation into two independent components: an optimization direction and an update magnitude. This incremental exploration of the gradient landscape enables efficient long-sequence processing while maintaining learning continuity.Across diverse sequence modeling benchmarks, SPTT outperforms BPTT, with stronger generalization and improved computational efficiency, thereby opening new possibilities for real-time and resource-constrained RNN applications.

Version published to 10.21203/rs.3.rs-7583281/v1 on Research Square
Oct 17, 2025

GRAIN: Gated Recurrent Adaptive Integration Network

This article has 1 author:
1. Keshav Gupta
This article has no evaluationsLatest version Nov 11, 2025
Optimizing Deep Learning Architectures forEnhanced Computational Efficiency

This article has 2 authors:
1. Ying Wang
2. Hui Li
This article has no evaluationsLatest version Sep 22, 2025
Data-Driven Reduced Modeling of Recurrent Neural Networks

This article has 4 authors:
1. Alice Marraffa
2. Renate Krause
3. Valerio Mante
4. George Haller
This article has no evaluationsLatest version Oct 7, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

GRAIN: Gated Recurrent Adaptive Integration Network

Optimizing Deep Learning Architectures forEnhanced Computational Efficiency

Data-Driven Reduced Modeling of Recurrent Neural Networks