Intraday Multireservoir Hydropower Optimization with Alternative Deep Reinforcement Learning Configurations

Castro Freibott, Rodrigo; García Sánchez, Álvaro; Espiga-Fernández, Francisco; González-Santander de la Cruz, Guillermo

doi:10.1007/978-3-031-78238-1_34

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14778))

Included in the following conference series:

Decision Science Alliance International Summer Conference

15 Accesses

Abstract

Managing multireservoir hydropower systems in an intraday context poses unique challenges due to the need for frequent decisions in response to fluctuating energy prices. While Reinforcement Learning (RL) methods have been applied to long-term management, this paper addresses the gap in short-term planning within a single day. We use an alternative RL algorithm and investigate various modeling approaches for the intraday multireservoir optimization problem. Through extensive experiments using real hydropower system data, we analyze the performance of different RL agents and benchmark them against random and greedy policies. Results demonstrate that optimal modeling choices, including reward adjustment, sufficient forecast information, and grouping of actions, significantly impact performance. Moreover, our findings suggest that Soft Actor-Critic, a RL algorithm that has not been applied before in this domain, is a viable alternative to methods such as Q-learning. Overall, this study contributes to the understanding of RL techniques in hydropower optimization and provides valuable insights for practical implementation in real-world scenarios.

This work was supported both by the project Project IA4TES (Advanced Intelligent Technologies for Sustainable Energy Transition) with file number TSI-100408-2021 from the 2021 AI R&D Missions Program, within the framework of the Spain Digital Agenda 2025 and the National Artificial Intelligence Strategy, funded by the Recovery, Transformation, and Resilience Plan and co-financed with European funds from the Recovery and Resilience Facility (RRF), Next Generation EU and the Spanish Agencia Estatal de Investigación for the support provided by the Ministerio de Ciencia e Innovación of Spain (Grant Ref. PID2022-137748OB-C31 funded by MCIN/AEI/10.13039/501100011033) and “ERDF A way of making Europe”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pearson, K.: On lines and planes of closest fit to systems of points in space. Philos. Mag. 2(11), 559–572 (1901). https://doi.org/10.1080/14786440109462720
Article MATH Google Scholar
Lee, J.-H., Labadie, J.W.: Stochastic optimization of multireservoir systems via reinforcement learning. Water Resour. Res. 43(11), W11408 (2007). https://doi.org/10.1029/2006WR005627
Article MATH Google Scholar
Castelletti, A., et al.: Tree-based reinforcement learning for optimal water reservoir operation. Water Resour. Res. 46(9), W09507 (2010). https://doi.org/10.1029/2009WR008898
Article MATH Google Scholar
Dariane, A.B., Moradi, A.M.: Comparative analysis of evolving artificial neural network and reinforcement learning in stochastic optimization of multireservoir systems. Hydrol. Sci. J. 61(6), 1141–1156 (2016). https://doi.org/10.1080/02626667.2014.986485
Zarghami, M.: Short term management of hydro-power system using reinforcement learning. École de technologie supérieure (2018). https://books.google.es/books?id=xHscvwEACAAJ
Haarnoja, T., et al.: Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor. arXiv preprint arXiv:1801.01290 (2018)
Matheussen, B. V. et al.: Hydropower optimization using deep learning. In: International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (2019). https://api.semanticscholar.org/CorpusID:195755546
Xu, W., et al.: Deep RL for cascaded hydropower reservoirs considering inflow forecasts. Water Resour. Manage. 34(9), 3003–3018 (2020). https://doi.org/10.1007/s11269-020-02600-w
Article MATH Google Scholar
Xu, W., et al.: Deep RL for optimal hydropower reservoir operation. J. Water Resour. Plann. Manage. 147(8), 04021045 (2021). https://doi.org/10.1061/(ASCE)WR.1943-5452.0001409
Article Google Scholar
Raffin, A. et al.: Stable-baselines3: reliable reinforcement learning implementations. J. Mach. Learn. Res. 22(268), 1–8 (2021). http://jmlr.org/papers/v22/20-1364.html
Towers, M., et al.: Gymnasium. Zenodo (2023). https://doi.org/10.5281/zenodo.8127026, https://zenodo.org/record/8127025
OMIE: Operador del Mercado Ibérico de Energía. OMIGROUP. https://www.omie.es/es/file-access-list?parents%5B0%5D=/&parents%5B1%5D=Mercado%20Diario &parents%5B2%5D=1.%20Precios &dir=Precios%20horarios%20del%20mercado%20diario%20en%20Espa%C3%B1a &realdir=marginalpdbc. Accessed 6 Feb 2023

Download references

Acknowledgements

This work was supported both by the project Project IA4TES (Advanced Intelligent Technologies for Sustainable Energy Transition) with file number TSI-100408-2021 from the 2021 AI R &D Missions Program, within the framework of the Spain Digital Agenda 2025 and the National Artificial Intelligence Strategy, funded by the Recovery, Transformation, and Resilience Plan and co-financed with European funds from the Recovery and Resilience Facility (RRF), Next Generation EU and the Spanish Agencia Estatal de Investigación for the support provided by the Ministerio de Ciencia e Innovación of Spain (Grant Ref. PID2022-137748OB-C31 funded by MCIN/AEI/10.13039/501100011033) and “ERDF A way of making Europe”.

Author information

Authors and Affiliations

Industrial Engineering, Business Administration and Statistics Department, ETSII, Universidad Politécnica de Madrid, José Gutiérez Abascal 2, 28006, Madrid, Spain
Rodrigo Castro Freibott, Álvaro García Sánchez & Francisco Espiga-Fernández
baobab soluciones, José Abascal 55, 28003, Madrid, Spain
Guillermo González-Santander de la Cruz

Authors

Rodrigo Castro Freibott
View author publications
You can also search for this author in PubMed Google Scholar
Álvaro García Sánchez
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Espiga-Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo González-Santander de la Cruz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Álvaro García Sánchez .

Editor information

Editors and Affiliations

Universidad Politécnica de Valencia, Valencia, Spain
Angel A. Juan
Public University of Navarre, Pamplona, Spain
Javier Faulin
ESADE Business School, Sant Cugat, Spain
David Lopez-Lopez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Castro Freibott, R., García Sánchez, Á., Espiga-Fernández, F., González-Santander de la Cruz, G. (2025). Intraday Multireservoir Hydropower Optimization with Alternative Deep Reinforcement Learning Configurations. In: Juan, A.A., Faulin, J., Lopez-Lopez, D. (eds) Decision Sciences. DSA ISC 2024. Lecture Notes in Computer Science, vol 14778. Springer, Cham. https://doi.org/10.1007/978-3-031-78238-1_34

Download citation

DOI: https://doi.org/10.1007/978-3-031-78238-1_34
Published: 31 January 2025
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-78237-4
Online ISBN: 978-3-031-78238-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics