Abstract : This paper proposes a Deep Reinforcement Learning approach for optimally managing multi-energy systems in smart grids. The optimal control problem of the production and storage units within the smart grid is formulated as a Partially Observable Markov Decision Process (POMDP), and is solved using an actor-critic Deep Reinforcement Learning algorithm. The framework is tested on a novel multi-energy residential microgrid model that encompasses electrical, heating and cooling storage as well as thermal production systems and renewable energy generation. One of the main challenges faced when dealing with real-time optimal control of such multi-energy systems is the need to take multiple continuous actions simultaneously. The proposed Deep Deterministic Policy Gradient (DDPG) agent has shown to handle well the continuous state and action spaces and learned to simultaneously take multiple actions on the production and storage systems that allow to jointly optimize the electrical, heating and cooling usages within the smart grid. This allows the approach to be applied for the real-time optimal energy management of larger scale multi-energy Smart Grids like eco-distrits and smart cities where multiple continuous actions need to be taken simultaneously.
https://hal-mines-paristech.archives-ouvertes.fr/hal-03587262 Contributor : Dhekra BousninaConnect in order to contact the contributor Submitted on : Thursday, February 24, 2022 - 2:08:16 PM Last modification on : Saturday, March 5, 2022 - 3:12:10 AM Long-term archiving on: : Wednesday, May 25, 2022 - 7:55:57 PM
File
Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed
until : 2023-02-24