Cost-Optimal Coordination for Peak Demand Reduction in Saudi Residential Buildings Using Physics-Informed Deep Reinforcement Learning

Hamzah Faraj

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

When multiple On/Off split air-conditioning units in Saudi residential buildings ac-tivate simultaneously, the resulting peak demand spike stresses the electrical gridand inflates monthly bills under the kingdom’s two-tier tariff (0.18 SAR/kWh ≤6,000 kWh; 0.30 SAR/kWh above). This paper proposes a Physics-Informed ProximalPolicy Optimization (PI-PPO) framework that learns a stationary scheduling policy—applicable over an infinite time horizon without re-solving any optimization—to co-ordinate 18,500 BTU On/Off split units (1.8 kW input, EER 10.25) across multiplezones. Each zone is abstracted as a scheduling task with formally analyzed minimumutilization and feasibility conditions. The model incorporates inter-zone thermal cou-pling, enabling the scheduler to exploit thermal buffering through shared walls. PI-PPO embeds heat balance equations directly into the reinforcement learning reward,yielding a controller that maintains thermal comfort within the specified bounds at alltimes—a guarantee absent from standard deep reinforcement learning methods. Wefurther show that extending the comfort range by ±1◦C (from 23–25◦C to 22–26◦C)reduces each zone’s minimum utilization by 36.9%. Simulations using EnergyPluswith Jeddah weather data across four months (January, April, July, October) showthat PI-PPO reduces peak demand by 40–60% and July cost by 22.5% for a 5-zonevilla, rising to 47.0% for a 20-zone compound with comfort extension. Ablation stud-ies attribute 6.0 percentage points to physics-informed shaping, 4.5 pp to tiered-tariffawareness, 2.0 pp to inter-zone coupling, and 14.0 pp to comfort extension.

Version published to 10.21203/rs.3.rs-9127190/v1 on Research Square
Mar 30, 2026

Deep Reinforcement Learning–Driven Energy Management for Electric Vehicles in 6G-Connected Smart Grids

This article has 5 authors:
1. Al Imran
2. Tan Chee Pin
3. ASM Bakibillah
4. Changbiao Li
5. Yanpeng Zhang
This article has no evaluationsLatest version Apr 17, 2026
Benchmarking Adaptive EV Charging Recommendation Under Feasibility Constraints and Non-Stationary Infrastructure

This article has 1 author:
1. vibhor joshi
This article has no evaluationsLatest version Apr 15, 2026
PEGA-EV: A Data-Driven Policy Learning Method for Energy-Efficient Electric Vehicle Routing with Time Windows

This article has 2 authors:
1. Abdelkader Kaddour
2. Lamri Sayad
This article has no evaluationsLatest version Apr 1, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Deep Reinforcement Learning–Driven Energy Management for Electric Vehicles in 6G-Connected Smart Grids

Benchmarking Adaptive EV Charging Recommendation Under Feasibility Constraints and Non-Stationary Infrastructure

PEGA-EV: A Data-Driven Policy Learning Method for Energy-Efficient Electric Vehicle Routing with Time Windows