Deep Reinforcement Learning for Optimal Hydropower Reservoir Operation

Xu, W; Meng, F; Guo, W; Li, X; Fu, G

dc.contributor.author	Xu, W
dc.contributor.author	Meng, F
dc.contributor.author	Guo, W
dc.contributor.author	Li, X
dc.contributor.author	Fu, G
dc.date.accessioned	2021-02-22T11:10:43Z
dc.date.issued	2021-05-21
dc.description.abstract	Optimal operation of hydropower reservoir systems is a classical optimization problem of high dimensionality and stochastic nature. A key challenge lies in improving the interpretability of operation strategies, i.e., the cause-effect relationship between system outputs (or actions) and contributing variables such as states and inputs. Here we report for the first time a new Deep Reinforcement Learning (DRL) framework for optimal operation of reservoir systems based on Deep Q-Networks (DQN), which provides a significant advance in understanding the performance of optimal operations. DQN combines Q-learning and two deep ANN networks and acts as the agent to interact with the reservoir system through learning its states and providing actions. Three knowledge forms of learning considering the states, actions and rewards are constructed to improve the interpretability of operation strategies. The impacts of these knowledge forms and DRL learning parameters on operation performance are analysed. The DRL framework is tested on the Huanren hydropower system in China, using 400-year synthetic flow data for training and 30-year observed flow data for verification. The discretization levels of reservoir water level and energy output yield contrasting effects: finer discretization of water level improves performance in terms of annual hydropower generated and hydropower production reliability; however, finer discretization of hydropower production can reduce search efficiency and thus resulting DRL performance. Compared with benchmark algorithms including dynamic programming, stochastic dynamic programming, and decision tree, the proposed DRL approach can effectively factor in future inflow uncertainties when deciding optimal operations and generate markedly higher hydropower. This study provides new knowledge on the performance of DRL in the context of hydropower system characteristics and data input features, and shows promise of potentially being implemented in practice to derive operation policies that can be automatically updated by learning on new data.	en_GB
dc.description.sponsorship	National Natural Science Foundation of China (NSFC)	en_GB
dc.description.sponsorship	Royal Society	en_GB
dc.description.sponsorship	Engineering and Physical Sciences Research Council (EPSRC)	en_GB
dc.identifier.citation	Vol. 147 (8), article 04021045	en_GB
dc.identifier.doi	10.1061/(ASCE)WR.1943-5452.0001409
dc.identifier.grantnumber	51609025	en_GB
dc.identifier.grantnumber	IF160108	en_GB
dc.identifier.grantnumber	EP/N510129/1	en_GB
dc.identifier.uri	http://hdl.handle.net/10871/124834
dc.language.iso	en	en_GB
dc.publisher	American Society of Civil Engineers (ASCE)	en_GB
dc.rights	© 2021 American Society of Civil Engineers
dc.subject	Artificial Intelligence	en_GB
dc.subject	Deep Q-Network	en_GB
dc.subject	Deep Reinforcement Learning	en_GB
dc.subject	Hydropower System	en_GB
dc.subject	Reservoir Operation	en_GB
dc.title	Deep Reinforcement Learning for Optimal Hydropower Reservoir Operation	en_GB
dc.type	Article	en_GB
dc.date.available	2021-02-22T11:10:43Z
dc.identifier.issn	0733-9496
dc.description	This is the author accepted manuscript. The final version is available from ASCE via the DOI in this record	en_GB
dc.description	Data Availability Statement: Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request. Data include the synthetic and observed flow time series. The code that has been used for the deep reinforcement learning is also available.	en_GB
dc.identifier.journal	Journal of Water Resources Planning and Management	en_GB
dc.rights.uri	http://www.rioxx.net/licenses/all-rights-reserved	en_GB
dcterms.dateAccepted	2021-02-21
exeter.funder	::Royal Society (Government)	en_GB
exeter.funder	::Royal Society (Government)	en_GB
exeter.funder	::Alan Turing Institute	en_GB
rioxxterms.version	AM	en_GB
rioxxterms.licenseref.startdate	2021-02-21
rioxxterms.type	Journal Article/Review	en_GB
refterms.dateFCD	2021-02-22T09:12:56Z
refterms.versionFCD	AM
refterms.dateFOA	2021-07-05T14:56:40Z
refterms.panel	B	en_GB

Files in this item

Name:: WRENG-4801_R2 with figures.pdf
Size:: 3.554Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Engineering

Show simple item record

Show Statistical Information