A Reinforcement Learning Approach Based on Group Relative Policy Optimization for Economic Dispatch in Smart Grids

Adil Rizki
Achraf Touil
Abdelwahed Echchatbi
Rachid Oucheikh

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The Economic Dispatch Problem (EDP) plays a critical role in power system operations by trying to allocate power generation across multiple units at minimal cost while satisfying complex operational constraints. Traditional optimization techniques struggle with the non-convexities introduced by factors such as valve-point effects, prohibited operating zones, and spinning reserve requirements. While metaheuristics methods have shown promise, they often suffer from convergence issues and constraint-handling limitations. In this study, we introduce a novel application of Group Relative Policy Optimization (GRPO), a reinforcement learning framework that extends Proximal Policy Optimization by integrating group-based learning and relative performance assessments. The proposed GRPO approach incorporates smart initialization, adaptive exploration, and elite-guided updates tailored to the EDP’s structure. Our method consistently produces high-quality, feasible solutions with faster convergence compared to state-of-the-art metaheuristics and learning-based methods. For instance, in the case of the 15-unit system, GRPO achieved the best cost of USD 32,421.67/h with full constraint satisfaction in just 4.24 s, surpassing many previous solutions. The algorithm also demonstrates excellent scalability, generalizability, and stability across larger-scale systems without requiring parameter retuning. These results highlight GRPO’s potential as a robust and efficient tool for real-time energy scheduling in smart grid environments.

Version published to 10.3390/electricity6030049
Sep 1, 2025
Version published to 10.20944/preprints202507.1958.v1
Jul 23, 2025

Machine Learning Based Optimization of a Distributed Generation Power System

This article has 3 authors:
1. Denis Nasasira
2. Masoud Masih Tehrani
3. M Hakimollahi
This article has no evaluationsLatest version Jan 3, 2026
Adaptive Parameter Setting for Genetic Algorithms Using Reinforcement Learning: A Case Study on the Capacitated Vehicle Routing Problem

This article has 1 author:
1. Vipul Razdan
This article has no evaluationsLatest version Jan 7, 2026
AI Driven Virtual Power Plants: A Comprehensive Review

This article has 3 authors:
1. Jian Li
2. Chenxi Wang
3. Yonghe Liu
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Machine Learning Based Optimization of a Distributed Generation Power System

Adaptive Parameter Setting for Genetic Algorithms Using Reinforcement Learning: A Case Study on the Capacitated Vehicle Routing Problem

AI Driven Virtual Power Plants: A Comprehensive Review