Multi-AGV Path Planning Using Deep Reinforcement Learning with Internal Curiosity

Huilin Yin
Shengkai Su
Yinjia Lin
Karin Festl
Jun Yan
Daniel Watzenig

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep Reinforcement Learning (DRL) is promising for multi-agent path planning problems in which sparse external environmental rewards may cause the agent group to make overly conservative decisions and explore the environment inefficiently. In general, the reward shaping mechanism is used to mitigate the above problems with the additional reward function setting. However, it requires specific domain knowledge, which limits the general applicability, and the added reward functions are not necessarily applicable to all environments. This paper aims to improve the path planning efficiency of single agents and groups of agents with the Internal Curiosity Module (ICM) mechanism to boost the generalization abilities of the agents in different environments. To this end, we incorporate the internal curiosity mechanism into the soft actor-critic model for enhancing exploration strategies, adapting to environmental changes, and improving learning effectiveness. Then, we propose a multi-agent path planning method in which the curiosity mechanism is integrated with the Multi-Agent POsthumous Credit Assignment (MA-POCA) algorithm. The neural networks can automatically calculate the additional intrinsic rewards based on observed information about the environment and the actions taken by the group of agents. Based on our experiments, we make qualitative and quantitative analyses of the performance of the proposed methods and the baseline DRL methods. The experimental results show that our proposed methods can decline the number of learning episodes and the training time of path planning, so the proposed algorithms can accelerate the exploration of single agents or agent groups in the sparse reward environment.

Version published to 10.21203/rs.3.rs-4453111/v1 on Research Square
Jun 17, 2024

Robotic pursuit evasion problem in a constrained game area using deep reinforcement learning and self-play training

This article has 4 authors:
1. Chiraz BEN JABEUR
2. Hassene SEDDIK
3. Khaled KHNISSI
4. Ahmad HABLY
This article has no evaluationsLatest version Apr 1, 2025
Transformer-Enhanced Deep Q-Learning for AdaptiveRobot Path Planning in Dynamic Environments

This article has 4 authors:
1. Harish Sharma HS
2. Ritu Tiwari RT
3. Shubham Shukla SS
4. Sushant Kumar SK
This article has no evaluationsLatest version Apr 28, 2025
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution

This article has 7 authors:
1. Yufei Lin
2. Chengwei Ye
3. Huanzhen Zhang
4. Kangsheng Wang
5. Linuo Xu
6. Shuyan Liu
7. Zeyu Zhang
This article has no evaluationsLatest version May 7, 2025

Listed in

Abstract

Article activity feed

Related articles

Robotic pursuit evasion problem in a constrained game area using deep reinforcement learning and self-play training

Transformer-Enhanced Deep Q-Learning for AdaptiveRobot Path Planning in Dynamic Environments

CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution