Trust Guided Reinforcement Learning for Safe Robot Navigation with Dynamic Window Approach

Yuhan Wang
Haonan Li
Hu Luo
Gebel Elena Sergeevna

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

End-to-end deep reinforcement learning (DRL) policies offer flexible navigation capabilities but often suffer from poor generalization and unsafe behaviors in unseen or complex environments. In contrast, classical local planners like the Dynamic Window Approach (DWA) provide strong short-term safety guarantees yet frequently fail in cluttered static scenes due to limited horizon reasoning. To bridge this gap, we propose Trust-SAC, a novel trust-aware reinforcement learning framework that enables an agent to dynamically assess the reliability of its own actions by comparing them against a DWA expert—without executing the expert’s commands. The policy learns to output both control actions $(v, \omega)$ and a scalar trust weight $\tau$, which modulates a trust-based reward derived from the critic’s evaluation of the policy versus the expert. This mechanism allows the agent to adaptively balance exploration, efficiency, and safety based on real-time environmental risk. Evaluated across four diverse Gazebo environments with increasing complexity—including one where DWA completely fails—Trust-SAC demonstrates significantly higher task success rates than SAC, PPO, and DWA, while maintaining competitive path efficiency. Our results highlight that embedding a learnable self-assessment mechanism grounded in expert comparison can enhance the robustness and generalization of end-to-end navigation policies without compromising their autonomy.

Version published to 10.21203/rs.3.rs-9283070/v1 on Research Square
Apr 17, 2026

A hierarchical multi-agent reinforcement learning framework with high-level guidance from large language models

This article has 10 authors:
1. Jinyin Bai
2. Wei Zhu
3. Xiangchen Wang
4. KaiYang Kou
5. Shiluo Guo
6. Shuhong Liu
7. Dong Li
8. Tianjin Ni
9. Jinji Zhou
10. Yihao Zhong
This article has no evaluationsLatest version May 25, 2026
Multi-Agent based Dynamic Anchors for Interpretation of Deep Learning Classifiers

This article has 2 authors:
1. Supreeth Suresh
2. Suresh Muknahallipatna
This article has no evaluationsLatest version Apr 14, 2026
Toward Zero-Human-Intervention Autonomous Robot Learning: A Continuous Result-Driven Self-Reward and Correction Framework

This article has 1 author:
1. Hong Su
This article has no evaluationsLatest version Apr 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A hierarchical multi-agent reinforcement learning framework with high-level guidance from large language models

Multi-Agent based Dynamic Anchors for Interpretation of Deep Learning Classifiers

Toward Zero-Human-Intervention Autonomous Robot Learning: A Continuous Result-Driven Self-Reward and Correction Framework