Reinforcement Learning-Based Optimization Strategy for Online Advertising Budget Allocation

Mengfei Yang
Qiong Cao
Lingyun Tong
Jiawen Shi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper proposes a reinforcement learning-based optimization framework that defines a structured state space (real-time conversion rates, channel ROI, historical CTR), action space (budget-compliant allocations), and reward function (balancing revenue, cost, and placement effectiveness). To enhance adaptability, we introduce a multi-channel synergy mechanism using behavioral correlation matrices and a time-sequence update model for predictive, real-time budget adjustment. Trained with Proximal Policy Optimization (PPO) in a high-fidelity simulation, the model outperforms traditional rule-based and DQN baselines in CTR (+8.7%), ROI (+12.4%), and policy stability, while reducing latency and memory usage.

Version published to 10.20944/preprints202505.2063.v1
May 28, 2025

Evaluating Reinforcement Learning algorithms forLunarLander-v2: A Comparative Analysis

This article has 1 author:
1. Anurag Awasthi
This article has no evaluationsLatest version Apr 15, 2025
Reinforcement Learning Based Multi-Stage Ad Sorting and Personalized Recommendation System Design

This article has 4 authors:
1. Mengfei Yang
2. Yunyang Wang
3. Jiawen Shi
4. Lingyun Tong
This article has no evaluationsLatest version May 26, 2025
Reinforcement Learning for Optimal Replenishment in Stochastic Assembly Systems

This article has 5 authors:
1. Lativa Sid Ahmed Abdellahi
2. Zeinebou Zoubeir
3. Yahya Mohamed
4. Ahmedou Haouba
5. Sidi Hmetty
This article has no evaluationsLatest version May 27, 2025

Listed in

Abstract

Article activity feed

Related articles

Evaluating Reinforcement Learning algorithms forLunarLander-v2: A Comparative Analysis

Reinforcement Learning Based Multi-Stage Ad Sorting and Personalized Recommendation System Design

Reinforcement Learning for Optimal Replenishment in Stochastic Assembly Systems