Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid

CAO Jun; SUN Yingying; ZHAO Hang

doi:10.11805/tkyda2020178

Journal of Terahertz Science and Electronic Information Technology , Volume. 21, Issue 1, 112(2023)

Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid

CAO Jun^*, SUN Yingying, and ZHAO Hang

Author Affiliations

[in Chinese]

show less

Dynamic pricing is one of the most effective ways to encourage customers to change their consumption pattern. Therefore, Reinforcement Learning-based Optimizing Dynamic Pricing(RLODP) algorithm is proposed for energy management in a hierarchical electricity market by considering both service provider's profit and customers' costs. Using Reinforcement Learning, the SP can adaptively determine the retail electricity price. Dynamic pricing problem is formulated as a discrete finite Markov Decision Process(MDP), and Q-learning is adopted to solve this decision-making problem. Simulation results show that the RLODP algorithm can reduce energy costs for customers, balance the energy supply and the demands in the electricity market.

Keywords

demand response discrete finite Markov Decision Process electricity price Reinforcement Learning smart grid

Tools

Get Citation

Copy Citation Text

CAO Jun, SUN Yingying, ZHAO Hang. Reinforcement Learning-based Optimizing Dynamic Pricing algorithm in smart grid[J]. Journal of Terahertz Science and Electronic Information Technology , 2023, 21(1): 112

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Apr. 28, 2020

Accepted: --

Published Online: Mar. 14, 2023

The Author Email: Jun CAO (huxyu_82@sohu.com)

DOI:10.11805/tkyda2020178

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology