Weakly Coupled MDP for Load Balancing in Containerized Cloud: A Scalable Control Design

Adam Houmairi
El Mehdi Kandoussi
Yassine Maleh
Soufyane Mounir

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Load balancing is a core challenge in containerized cloud systems. We formulate the dispatcher’s decision problem as a weakly coupled Markov Decision Process (MDP) and derive a scalable policy via a Lagrangian linear-programming relaxation that decouples per-VM control. A toy-scale study is first used to expose structural behavior and guide design; we then run full-scale simulations to assess performance and robustness. We prove that the optimal value function is non-decreasing in total backlog and establish a stochastic-dominance corollary. Several appealing conjectures (e.g., per-VM monotonicity) hold at low demand but fail near saturation, clarifying when “balance everything” heuristics become suboptimal. Motivated by these insights, we propose a deployable load-aware dispatcher that blends JSQ-like behavior at low load with advantage-based routing at high load. Across scaled experiments, the dispatcher consistently reduces blocking and tail delay versus standard baselines, and competes favorably with a state-of-the-art heuristic, while incurring modest control overhead. The study bridges stochastic control and deployable cloud scheduling, offering both analytical insights and a practical policy design.

Version published to 10.21203/rs.3.rs-8051212/v1 on Research Square
Nov 28, 2025

Latency-Aware Service Placement on the Fog--Edge--Cloud Continuum via Integer Programming

This article has 1 author:
1. Deo Shankar
This article has no evaluationsLatest version Feb 4, 2026
Research on OTA Task Scheduling and Adaptive Fault-Tolerance Algorithm Under Cloud-Edge Collaboration

This article has 2 authors:
1. Wei Zhang
2. Michael R. Lewis
This article has no evaluationsLatest version Jan 16, 2026
Near-Optimal Universal Scheduling for Moldable Tasks: The Fair Algorithm

This article has 3 authors:
1. Lucas Perotin
2. Thomas Verrecchia
3. Padma Raghavan
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Latency-Aware Service Placement on the Fog--Edge--Cloud Continuum via Integer Programming

Research on OTA Task Scheduling and Adaptive Fault-Tolerance Algorithm Under Cloud-Edge Collaboration

Near-Optimal Universal Scheduling for Moldable Tasks: The Fair Algorithm