Smart Pricing for Smart Charging: A Deep Reinforcement Learning Framework for Residential EV Infrastructure

Christos Pergamalis
Eleftherios Tsampasis
Panagiotis K. Gkonis
Charalampos N. Ilias

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The growing use of electric vehicles in the residential building sector presents new challenges in the management of the charging infrastructure, especially in deciding how to best price the use of it to balance operator revenue and user satisfaction with grid stability. Traditional pricing methods like fixed pricing rates and time-of-use tariffs cannot accommodate the dynamic nature of charging demand, which fluctuates depending on temporal patterns and weather conditions as well as user behavior. This limitation means that the resources are not used optimally and the revenue opportunities are lost during periods of high demand. To overcome this issue, we propose a reinforcement learning framework for dynamic pricing for residential electric vehicle charging stations. The framework models the pricing problem as a Markov Decision Process and uses Proximal Policy Optimization to learn a policy for setting optimal prices of private and shared charging stations according to real-time conditions. The state representation is done using ten features such as temporal indicators, current loading on the grid, grid status, traffic volume, and weather data. A multi-objective reward function is an approach to balance four objectives - revenue maximization, station utilization, grid stability, and user satisfaction. The system is trained on actual charging data from a residential complex in Trondheim, Norway. 6878 charging sessions during a 13-month period are used for training. We compare the learned policy with three baseline technologies: fixed pricing, time-of-use pricing and rule-based pricing. Experimental results show that the proposed approach reaches an overall score of 0.569, which is 32.9% and 48.9% improvements in comparison to fixed pricing and time-of-use pricing, respectively. The learned policy is able to successfully adjust the prices based on different conditions and sustain a balanced performance for all the goals. The main contributions include a custom reinforcement learning environment for residential EV charging pricing, a multi-objective reward formulation, and empirical evidence that learned policies outperform traditional pricing approaches.

Version published to 10.20944/preprints202603.2101.v1
Mar 26, 2026

Smart Home Energy Management Using Deep Q-Learning Networks for Cost Reduction Under ToD Tariff

This article has 3 authors:
1. Ganesh Shirsat
2. Ankit Kumar Sharma
3. Satish Markad
This article has no evaluationsLatest version Feb 6, 2026
Cost-Optimal Coordination for Peak Demand Reduction in Saudi Residential Buildings Using Physics-Informed Deep Reinforcement Learning

This article has 1 author:
1. Hamzah Faraj
This article has no evaluationsLatest version Mar 30, 2026
PEGA-EV: A Data-Driven Policy Learning Method for Energy-Efficient Electric Vehicle Routing with Time Windows

This article has 2 authors:
1. Abdelkader Kaddour
2. Lamri Sayad
This article has no evaluationsLatest version Apr 1, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Smart Home Energy Management Using Deep Q-Learning Networks for Cost Reduction Under ToD Tariff

Cost-Optimal Coordination for Peak Demand Reduction in Saudi Residential Buildings Using Physics-Informed Deep Reinforcement Learning

PEGA-EV: A Data-Driven Policy Learning Method for Energy-Efficient Electric Vehicle Routing with Time Windows