Comparative Analysis of Reinforcement Learning Approaches for Dynamic Pricing of Perishable Goods

Murali Krishna Panthangi
Vasudevan VN

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Dynamic pricing of perishable products is a challenging optimization problem with limited shelf life, random demand, and inventory capacity constraints. Fixed or rule-based price policies fail to change in response to market movements and do not yield maximum revenue. In this research, we consider the use of reinforcement learning (RL) techniques for learning adaptive price policies that maximize profitability and inventory usage. We train and contrast four leading RL methods Deep Q-Networks(DQN), Double DQN(DDQN), Proximal Policy Optimization(PPO) and Quantile Regression DQN(QR-DQN) in a simulated retail setting with price and age sensitivity in demand. We compare the RL agents to fixed-price policies in order to measure revenue, inventory loss, and pricing conduct. Our findings show that PPO attains the maximum revenue with minimal waste, performing better than both baselines and other learning-based methods.

Version published to 10.21203/rs.3.rs-6756652/v1 on Research Square
Jun 18, 2025

LLM for Secure Reserve Price Optimization in Real-time Bidding

This article has 6 authors:
1. Wendi Wu
2. Shanghua Wen
3. Minglong Li
4. Kun Hu
5. Yongjun Dai
6. Jing Zhao*
This article has no evaluationsLatest version Jun 4, 2025
Dynamic Pricing with Elastic-ARIMA Demand

This article has 1 author:
1. J.Christopher Westland
This article has no evaluationsLatest version Jul 3, 2025
Deep Reinforcement Learning for Optimal Replenishment in Stochastic Assembly Systems

This article has 5 authors:
1. Lativa Sid Ahmed Abdellahi
2. Zeinebou Zoubeir
3. Yahya Mohamed
4. Ahmedou Haouba
5. Sidi Hmetty
This article has no evaluationsLatest version Jul 9, 2025

Listed in

Abstract

Article activity feed

Related articles

LLM for Secure Reserve Price Optimization in Real-time Bidding

Dynamic Pricing with Elastic-ARIMA Demand

Deep Reinforcement Learning for Optimal Replenishment in Stochastic Assembly Systems