Autonomous navigation with minimal sensors in dynamic warehouse environments: a multi-agent reinforcement learning approach with curriculum learning enhancement

Zhilong Shen
Yongwei Chi
Yu Bai
Xiaojing Liao
Peiyu Zhao
Jianjiang Lu
Qian Niu
Qinya Dai

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper introduces an innovative approach to augmenting the capabilities of autonomous mobile robots (AMRs) in dynamic, multi-agent warehouse environments. We propose an integrated system that combines Multi-Agent Deep Deterministic Policy Gradient (MADDPG) a cutting-edge multi-agent reinforcement learning (MARL) algorithm with Curriculum Learning (CL). This hybrid method is specifically tailored to meet the complex demands of warehouse operations. Our approach is distinguished by the development of decentralized policies within a centralized training framework, progressively incorporating complexity through a carefully structured curriculum. This strategy enables autonomous agents to master sophisticated navigation and coordination tactics, adapting adeptly to the variable requirements of real-time warehouse scenarios. Through comprehensive experiments, we demonstrate significant enhancements in autonomous decision-making and operational efficiency, thereby validating the effectiveness of merging MARL with CL in intricate, multi-agent contexts. Our findings align with recent advancements in MARL applications for robotics 1 and underscore the potential of curriculum-based learning strategies in complex environments.

Version published to 10.21203/rs.3.rs-4980820/v1 on Research Square
Sep 26, 2024

Adaptive Confidence-Weighted Policy Aggregation: A Novel Method for Federated Reinforcement Learning

This article has 3 authors:
1. Nematollah Ab Azar
2. Aref Shahmansoorian
3. Mohsen Davoudi
This article has no evaluationsLatest version May 21, 2025
Safe Reinforcement Learning for Vision-Based Robotic Manipulation in Human-Centered Environments

This article has 7 authors:
1. Fawad Khan
2. Wei Feng
3. Zhiyong Wang
4. Tianlun Huang
5. Xiao Liu
6. Yunduan Cui
7. Wang Weijun
This article has no evaluationsLatest version Jun 16, 2025
UniROS: ROS-Based Reinforcement Learning Across Simulated and Real-World Robotics

This article has 4 authors:
1. Jayasekara Kapukotuwa
2. Brian Lee
3. Declan Devine
4. Yuansong Qiao
This article has no evaluationsLatest version Jul 1, 2025

Listed in

Abstract

Article activity feed

Related articles

Adaptive Confidence-Weighted Policy Aggregation: A Novel Method for Federated Reinforcement Learning

Safe Reinforcement Learning for Vision-Based Robotic Manipulation in Human-Centered Environments

UniROS: ROS-Based Reinforcement Learning Across Simulated and Real-World Robotics