Deep reinforcement learning based coverage path planning in unknown environments

Tianyao Zheng
Yuhui Jin
Haopeng Zhao
Zhichao Ma
Yongzhou Chen
Kunpeng Xu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm offers a robust solution for the coverage path planning problem, where a robot must effectively and efficiently cover a designated area, ensuring minimal redundancy and maximum coverage. Traditional methods for path planning often lack the adaptability required for dynamic and unstructured environments. In contrast, TD3 utilizes twin Q-networks to reduce overestimation bias, delayed policy updates for increased stability, and target policy smoothing to maintain smooth transitions in the robot's path. These features allow the robot to learn an optimal path strategy in real-time, effectively balancing exploration and exploitation. This paper explores the application of TD3 to coverage path planning, demonstrating that it enables a robot to adaptively and efficiently navigate complex coverage tasks, showing significant advantages over conventional methods in terms of coverage rate, total length, and adaptability.

Version published to 10.31224/4260
Jan 15, 2025

Bio-Inspired Hybrid Path Planning for Efficient and Smooth Robotic Navigation

This article has 8 authors:
1. Fujiang Yuan
2. Zhihao Lin
3. Zhen Tian
4. Boyu Chen
5. Qifan Zhou
6. Chunhong Yuan
7. Hongyu Sun
8. Zixuan Huang
This article has no evaluationsLatest version May 19, 2025
Enhancing Learning of Collective Transport with Global State Prediction under Local, Bandwidth-Limited Communication Constraints

This article has 3 authors:
1. Joshua Bloom
2. Julian Poindexter
3. Carlo Pinciroli
This article has no evaluationsLatest version Jun 5, 2025
Dynamic path planning in the unknown environment with mobile target and obstacles

This article has 2 authors:
1. Md Hasibuzzaman
2. Gene-Eu Jan
This article has no evaluationsAppears in 1 listLatest version Jun 26, 2025

Listed in

Abstract

Article activity feed

Related articles

Bio-Inspired Hybrid Path Planning for Efficient and Smooth Robotic Navigation

Enhancing Learning of Collective Transport with Global State Prediction under Local, Bandwidth-Limited Communication Constraints

Dynamic path planning in the unknown environment with mobile target and obstacles