An Adaptive Attention-Based GRU Framework with Reinforcement Learning for Cold-Start Prediction and Mitigation in Serverless Computing

Saravana Kumar N
Selvakumara Samy S

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Serverless computing, exemplified by AWS Lambda, suffers from cold start latencies that impair performance in dynamic workloads. This study proposes an Adaptive Attention-GRU framework integrated with a Deep Deterministic Policy Gradient reinforcement learning agent to predict and preempt cold starts through dynamic provisioned concurrency adjustments. The Adaptive Attention-GRU leverages hierarchical attention and multi-scale feature encoding to capture short- and long-term patterns in Lambda metrics from AWS CloudWatch, while a dynamic thresholding mechanism triggers pre-warming via Step Functions and PyTorch Lightning with Ray RLlib integration. Evaluated on 90 days of real-world traces across 127 functions, the model achieves 93.63% accuracy, 98.61% ROC-AUC, and 92.46% recall, reducing cold starts by 94.7% with only 23.1% cost increase over baseline scaling. Key innovations include adaptive instantaneous feature weighting, multi-scale temporal modeling, and cost-aware RL autoscaling, offering a validated tool for optimizing latency, cost, and resource efficiency in production serverless environments.

Version published to 10.21203/rs.3.rs-8663212/v1 on Research Square
Feb 4, 2026

Characterizing Attention-Based Sequence Models for WirelessEdge Cache Replacement: A Short-Horizon Steady-StateBaseline

This article has 2 authors:
1. Lalfakzuala Hmar
2. Lalhruaizela Chhangte
This article has no evaluationsLatest version Jan 14, 2026
Multimodal Interest-shifting Sequence Recommendation Algorithm Based on Reinforcement Learning

This article has 10 authors:
1. Changcheng Shao
2. Cheng Zeng
3. Xiaogang Ye
4. Lili Chen
5. Qianyu Zou
6. Hongzhen Zhu
7. Zhouqiang Qiu
8. Yunhua Chen
9. Pinghua Chen
10. Hongsong Zheng
This article has no evaluationsLatest version Jan 29, 2026
A Proactive Virtual Machine Consolidation Framework Based on Multi-Dimensional Workload Awareness and Deep Reinforcement Learning

This article has 4 authors:
1. Guanghao Yang
2. Biying Zhang
3. Yanping Chen
4. Youbo Lyu
This article has no evaluationsLatest version Jan 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Characterizing Attention-Based Sequence Models for WirelessEdge Cache Replacement: A Short-Horizon Steady-StateBaseline

Multimodal Interest-shifting Sequence Recommendation Algorithm Based on Reinforcement Learning

A Proactive Virtual Machine Consolidation Framework Based on Multi-Dimensional Workload Awareness and Deep Reinforcement Learning