GRAIN: Gated Recurrent Adaptive Integration Network

Keshav Gupta

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the world of deep learning, very often RNN architectures such as GRU and LSTM areused for sequential data processing tasks which sometimes lead to overfitting scenarios, poorgeneralisation and highly unstable hidden state transitions during training phase. In this re-search paper, we are introducing a modified architecture in GRU which incorporates the conceptof dynamic EWMA(Exponentially weighted moving average) of previous hidden cell states inorder to stabilize the evolution of hidden state transitions, improving generalisation as well asdecreasing the chances of abrupt/sudden fluctuations in hidden state thereby increasing the per-formance. Experiments performed as part of this research on certain benchmark datasets usingthe proposed architecture have shown significant improvement in accuracy when compared withexisting architectures such as Vanilla LSTM, LSTM with dropout, Vanilla GRU, etc. therebyindicating that incorporating adaptive temporal smoothing within recurrent updates can en-hance the robustness and stability of deep sequence models without significant computational overhead.

Version published to 10.21203/rs.3.rs-8070545/v1 on Research Square
Nov 11, 2025

Streaming Propagation Through Time: A New Computational Paradigm for Recurrent Neural Networks

This article has 12 authors:
1. Huachuan Wang
2. Weihao Xia
3. Yunpeng Guan
4. Yuanhao Wang
5. Chaoyi Ke
6. Enshuo Yan
7. Ping Wang
8. Chen Qiu
9. Xiangping Zheng
10. Yuan Yao
11. Yuanfei Bi
12. James Lo
This article has no evaluationsLatest version Oct 17, 2025
KOSLM: A Kalman-Optimal Hybrid State-Space Memory Network for Long-Term Time Series Forecasting

This article has 4 authors:
1. Xin Tan
2. Lei Wang
3. Mingwei Wang
4. Ying Zhang
This article has no evaluationsLatest version Oct 24, 2025
TCLformer: Enhancing Multi-Scale Time Series Forecasting with Temporal Decomposition and Sparse Convolutional Attention

This article has 5 authors:
1. Xiaohe Wu
2. Kun Zhang
3. Chenxi Cai
4. Dianying Chen
5. Yaodi Liu
This article has no evaluationsLatest version Oct 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Streaming Propagation Through Time: A New Computational Paradigm for Recurrent Neural Networks

KOSLM: A Kalman-Optimal Hybrid State-Space Memory Network for Long-Term Time Series Forecasting

TCLformer: Enhancing Multi-Scale Time Series Forecasting with Temporal Decomposition and Sparse Convolutional Attention