Abstract
Managing multireservoir hydropower systems in an intraday context poses unique challenges due to the need for frequent decisions in response to fluctuating energy prices. While Reinforcement Learning (RL) methods have been applied to long-term management, this paper addresses the gap in short-term planning within a single day. We use an alternative RL algorithm and investigate various modeling approaches for the intraday multireservoir optimization problem. Through extensive experiments using real hydropower system data, we analyze the performance of different RL agents and benchmark them against random and greedy policies. Results demonstrate that optimal modeling choices, including reward adjustment, sufficient forecast information, and grouping of actions, significantly impact performance. Moreover, our findings suggest that Soft Actor-Critic, a RL algorithm that has not been applied before in this domain, is a viable alternative to methods such as Q-learning. Overall, this study contributes to the understanding of RL techniques in hydropower optimization and provides valuable insights for practical implementation in real-world scenarios.
This work was supported both by the project Project IA4TES (Advanced Intelligent Technologies for Sustainable Energy Transition) with file number TSI-100408-2021 from the 2021 AI R&D Missions Program, within the framework of the Spain Digital Agenda 2025 and the National Artificial Intelligence Strategy, funded by the Recovery, Transformation, and Resilience Plan and co-financed with European funds from the Recovery and Resilience Facility (RRF), Next Generation EU and the Spanish Agencia Estatal de Investigación for the support provided by the Ministerio de Ciencia e Innovación of Spain (Grant Ref. PID2022-137748OB-C31 funded by MCIN/AEI/10.13039/501100011033) and “ERDF A way of making Europe”.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pearson, K.: On lines and planes of closest fit to systems of points in space. Philos. Mag. 2(11), 559–572 (1901). https://doi.org/10.1080/14786440109462720
Lee, J.-H., Labadie, J.W.: Stochastic optimization of multireservoir systems via reinforcement learning. Water Resour. Res. 43(11), W11408 (2007). https://doi.org/10.1029/2006WR005627
Castelletti, A., et al.: Tree-based reinforcement learning for optimal water reservoir operation. Water Resour. Res. 46(9), W09507 (2010). https://doi.org/10.1029/2009WR008898
Dariane, A.B., Moradi, A.M.: Comparative analysis of evolving artificial neural network and reinforcement learning in stochastic optimization of multireservoir systems. Hydrol. Sci. J. 61(6), 1141–1156 (2016). https://doi.org/10.1080/02626667.2014.986485
Zarghami, M.: Short term management of hydro-power system using reinforcement learning. École de technologie supérieure (2018). https://books.google.es/books?id=xHscvwEACAAJ
Haarnoja, T., et al.: Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor. arXiv preprint arXiv:1801.01290 (2018)
Matheussen, B. V. et al.: Hydropower optimization using deep learning. In: International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (2019). https://api.semanticscholar.org/CorpusID:195755546
Xu, W., et al.: Deep RL for cascaded hydropower reservoirs considering inflow forecasts. Water Resour. Manage. 34(9), 3003–3018 (2020). https://doi.org/10.1007/s11269-020-02600-w
Xu, W., et al.: Deep RL for optimal hydropower reservoir operation. J. Water Resour. Plann. Manage. 147(8), 04021045 (2021). https://doi.org/10.1061/(ASCE)WR.1943-5452.0001409
Raffin, A. et al.: Stable-baselines3: reliable reinforcement learning implementations. J. Mach. Learn. Res. 22(268), 1–8 (2021). http://jmlr.org/papers/v22/20-1364.html
Towers, M., et al.: Gymnasium. Zenodo (2023). https://doi.org/10.5281/zenodo.8127026, https://zenodo.org/record/8127025
OMIE: Operador del Mercado Ibérico de Energía. OMIGROUP. https://www.omie.es/es/file-access-list?parents%5B0%5D=/&parents%5B1%5D=Mercado%20Diario &parents%5B2%5D=1.%20Precios &dir=Precios%20horarios%20del%20mercado%20diario%20en%20Espa%C3%B1a &realdir=marginalpdbc. Accessed 6 Feb 2023
Acknowledgements
This work was supported both by the project Project IA4TES (Advanced Intelligent Technologies for Sustainable Energy Transition) with file number TSI-100408-2021 from the 2021 AI R &D Missions Program, within the framework of the Spain Digital Agenda 2025 and the National Artificial Intelligence Strategy, funded by the Recovery, Transformation, and Resilience Plan and co-financed with European funds from the Recovery and Resilience Facility (RRF), Next Generation EU and the Spanish Agencia Estatal de Investigación for the support provided by the Ministerio de Ciencia e Innovación of Spain (Grant Ref. PID2022-137748OB-C31 funded by MCIN/AEI/10.13039/501100011033) and “ERDF A way of making Europe”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Castro Freibott, R., García Sánchez, Á., Espiga-Fernández, F., González-Santander de la Cruz, G. (2025). Intraday Multireservoir Hydropower Optimization with Alternative Deep Reinforcement Learning Configurations. In: Juan, A.A., Faulin, J., Lopez-Lopez, D. (eds) Decision Sciences. DSA ISC 2024. Lecture Notes in Computer Science, vol 14778. Springer, Cham. https://doi.org/10.1007/978-3-031-78238-1_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-78238-1_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-78237-4
Online ISBN: 978-3-031-78238-1
eBook Packages: Computer ScienceComputer Science (R0)