Deep Diffusion Reinforcement Learning for Options Hedging

Yaling Liu
Jing Li
Yijun Chen
Liyu Du
Jiawei Xu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Options hedging in real-world markets is challenged by strong asset correlations, partial observability, and nonstationary price dynamics. These conditions limit the effectiveness of traditional strategies and pose challenges for reinforcement learning (RL) methods, which often overfit or become unstable in high-dimensional environments. Existing RL hedging methods suffer from static risk measures and inflexible policies, leading to issues in stability, adaptability, and generalization. To address these challenges, we propose Deep Diffusion Reinforcement Learning (DDRL), a new RL framework that integrates the Soft Actor-Critic (SAC) algorithm with diffusion-based generative policy networks for dynamic hedging. By modeling policy distributions through a denoising diffusion process, DDRL effectively captures nonlinear dependencies and generates more robust actions. To improve stability, DDRL also incorporates double critic networks, entropy regularization, soft target updates, and other technical enhancements. We implement DDRL in a simulated trading environment based on historical market data. Empirical results show that DDRL reduces hedging losses and transaction costs compared to baseline RL methods, while maintaining stable performance across varied portfolio configurations and market conditions. These results highlight the potential of generative diffusion policies to enhance the robustness and reliability of RL-based financial decision-making.

Version published to 10.20944/preprints202507.0360.v1
Jul 4, 2025

Predicting implied volatility using deep learning for option pricing

This article has 3 authors:
1. Prem Jemitkumar Patel
2. Harsh Sanghavi
3. Rashmi Bhattad
This article has no evaluationsLatest version Jun 4, 2025
DynamicRL: Data-Driven Estimation of Trial-by-Trial Reinforcement Learning Parameters

This article has 4 authors:
1. Hua-Dong Xiong
2. Li Ji-An
3. Marcelo G Mattar
4. Robert C Wilson
This article has no evaluationsLatest version Jun 1, 2025
Robust Bellman State Prediction with Model Preferences and Learning

This article has 1 author:
1. Clayton Estey
This article has no evaluationsLatest version May 19, 2025

Listed in

Abstract

Article activity feed

Related articles

Predicting implied volatility using deep learning for option pricing

DynamicRL: Data-Driven Estimation of Trial-by-Trial Reinforcement Learning Parameters

Robust Bellman State Prediction with Model Preferences and Learning