Multi-Attention Meets Pareto Optimization: A Reinforcement Learning Method for Adaptive UAV Formation Control

Li Zheng
Junjie Zeng
Long Qin
Rusheng Ju

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Autonomous multi-UAV formation control in cluttered urban environments remains challenging due to partial observability, dense and dynamic obstacles, and conflicting objectives (task efficiency, energy use, and safety). Yet many MARL-based approaches still collapse vector-valued objectives into a single hand-tuned reward and lack selective information fusion, leading to brittle trade-offs and poor scalability in urban clutter. We introduce a model-agnostic MARL framework—instantiated on MADDPG for concreteness—that augments a CTDE backbone with three lightweight attention modules (self, inter-agent, and entity) for selective information fusion, and a Pareto optimization module that maintains a compact archive of non-dominated policies to adaptively guide objective tradeoffs using simple, interpretable rewards rather than fragile weightings. On city-scale navigation tasks, the approach improves final team success by 13–27 percentage points for N=2–5 while simultaneously reducing collisions, tightening formation, and lowering control effort. These gains require no algorithmspecific tuning and scale smoothly beyond two agents, underscoring a stronger safety–efficiency trade-off and robust applicability in cluttered, partially observable settings.

Version published to 10.20944/preprints202509.1303.v1
Sep 16, 2025

Heterogeneous Multi-Agent Coverage Control through Adaptive Weighting and Corrective Potential Function

This article has 4 authors:
1. Adha Imam Cahyadi
2. Prapto Nugroho
3. Igi Ardiyanto
4. Hanung Adi Nugroho
This article has no evaluationsLatest version Sep 22, 2025
Deep Reinforcement Learning-Based Autonomous Navigation for Mobile Robots in Dynamic Environments

This article has 2 authors:
1. Lei Deng
2. Ming Chen
This article has no evaluationsLatest version Aug 27, 2025
An Improved Black-Winged Kite Algorithm for Unmanned Aerial Vehicle Path Planning

This article has 3 authors:
1. Shuhao Jiang
2. Tingting Yu
3. Shengliang Cui
This article has no evaluationsLatest version Sep 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Heterogeneous Multi-Agent Coverage Control through Adaptive Weighting and Corrective Potential Function

Deep Reinforcement Learning-Based Autonomous Navigation for Mobile Robots in Dynamic Environments

An Improved Black-Winged Kite Algorithm for Unmanned Aerial Vehicle Path Planning