Multi agent federated reinforcement learning for Distributed MPTCP Agents

Jorge Abraham Rios Suarez
Min Jia
C. Shibwabo Anyembe

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The integration of Multi access Edge Computing (MEC) with low Earth orbit (LEO) satellite constellations is a promising paradigm for global, low latency connectivity. However, the dynamic topology and heterogeneous link qualities of satellite networks pose significant challenges for efficient multipath transport protocol (MPTCP) scheduling. Traditional schedulers, often based on heuristics or designed for fixed size inputs, struggle to adapt to the variable number of available paths. We propose a novel Multi Agent Federated Reinforcement Learning (MAFRL) framework that leverages Set Transformers for permutation invariant encoding of variable path sets. Each agent learns a local policy using Proximal Policy Optimization (PPO), augmented with a soft fairness constraint to ensure equitable performance. A federated learning scheme, using FedProx aggregation, enables collaborative training across distributed agents without sharing raw data, preserving privacy and improving robustness to non IID data. Extensive emulation experiments show our approach outperforms heuristic and learning based baselines in aggregate throughput, latency, and fairness, particularly under path variability. This work demonstrates the viability of set based learning and federated optimization for intelligent resource management in next generation satellite terrestrial networks.

Version published to 10.21203/rs.3.rs-7612387/v1 on Research Square
Nov 11, 2025

BeamCraft: Deep Reinforcement Learning-DrivenMulti-Objective Beamforming for ISAC

This article has 2 authors:
1. Duc Nguyen Dao
2. Yang Miao
This article has no evaluationsLatest version Feb 3, 2026
A Hybrid DMPC–DQN Framework for Adaptive and Low-Latency Control in Distributed Software-Defined Networks

This article has 2 authors:
1. Elham hajian
2. mehran garmehi
This article has no evaluationsLatest version Feb 3, 2026
QoS-Aware Reinforcement Learning Routing for Entanglement Networks

This article has 2 authors:
1. Diego Abreu
2. Antônio Abelém
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

BeamCraft: Deep Reinforcement Learning-DrivenMulti-Objective Beamforming for ISAC

A Hybrid DMPC–DQN Framework for Adaptive and Low-Latency Control in Distributed Software-Defined Networks

QoS-Aware Reinforcement Learning Routing for Entanglement Networks