Deep Reinforcement Learning for Personalized Clinical Decision Support

Lincoln Bhattarai

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Clinical decision making is inherently sequential and must consider the complex, high-dimensional, and rapidly changing physiological state of each patient. Traditional supervised models provide static risk predictions, but they cannot suggest how to act over time. Deep learning (DL) is skilled at extracting concise representations from multimodal electronic health record (EHR) data, while reinforcement learning (RL) optimizes sequences of actions to maximize long-term rewards. Their combination—deep reinforcement learning (DRL)—has become a natural approach for personalized, data-driven treatment planning.This paper presents a comprehensive DRL study in health informatics and is intentionally written with full methodological transparency to ensure reproducibility. After an extensive review of literature covering critical care, oncology, hospital operations, and “safe-RL” techniques, we detail the construction of two benchmarking tasks: (i) fluid-vasopressor titration for septic shock patients in the publicly available MIMIC-IV v3.1 ICU database, and (ii) bed-capacity management in a high-fidelity hospital-operations simulator. A multimodal transformer encoder feeds weighted-dueling–double-DQN (WD-DDQN) and DQN agents trained with Conservative Q-Learning (CQL) regularization. Rigorous offline policy evaluation (OPE) using doubly robust and importance sampling estimators demonstrates that the learned policies reduce estimated 90-day mortality by 10.3% and average length of stay by 10.5% compared to historical clinician behavior—without violating predefined safety constraints. These improvements were statistically significant and supported by rigorous offline policy evaluation using bootstrapped confidence intervals from doubly robust and importance sampling estimators. We conclude by discussing interpretability, fairness, regulatory pathways, and open technical challenges such as causal-RL, continuous learning under drift, and benchmark ecosystems. Our findings reinforce the potential of DRL as a key component of the next generation of clinical decision-support systems.

Version published to 10.20944/preprints202507.1207.v1
Jul 15, 2025

Assumption-Agnostic Deep Learning Framework for Holistic Clinical Trial Monitoring

This article has 3 authors:
1. Shaoming Yin
2. Zheyang Wu
3. Jianchang Lin
This article has no evaluationsLatest version Aug 1, 2025
Machine Learning and Deep Learning Approaches for Predicting Diabetes Progression: A Comparative Analysis

This article has 3 authors:
1. Oluwafisayo Babatope Ayoade
2. Seyed Shahrestani
3. Chun Ruan
This article has no evaluationsLatest version Jun 26, 2025
A Comparative Study of Machine Learning and Deep Learning Models for Predicting Medical Insurance Costs with Explainable AI

This article has 1 author:
1. Saikiran Bodha
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Article activity feed

Related articles

Assumption-Agnostic Deep Learning Framework for Holistic Clinical Trial Monitoring

Machine Learning and Deep Learning Approaches for Predicting Diabetes Progression: A Comparative Analysis

A Comparative Study of Machine Learning and Deep Learning Models for Predicting Medical Insurance Costs with Explainable AI