Skip to main content

Intraday Multireservoir Hydropower Optimization with Alternative Deep Reinforcement Learning Configurations

  • Conference paper
  • First Online:
Decision Sciences (DSA ISC 2024)

Abstract

Managing multireservoir hydropower systems in an intraday context poses unique challenges due to the need for frequent decisions in response to fluctuating energy prices. While Reinforcement Learning (RL) methods have been applied to long-term management, this paper addresses the gap in short-term planning within a single day. We use an alternative RL algorithm and investigate various modeling approaches for the intraday multireservoir optimization problem. Through extensive experiments using real hydropower system data, we analyze the performance of different RL agents and benchmark them against random and greedy policies. Results demonstrate that optimal modeling choices, including reward adjustment, sufficient forecast information, and grouping of actions, significantly impact performance. Moreover, our findings suggest that Soft Actor-Critic, a RL algorithm that has not been applied before in this domain, is a viable alternative to methods such as Q-learning. Overall, this study contributes to the understanding of RL techniques in hydropower optimization and provides valuable insights for practical implementation in real-world scenarios.

This work was supported both by the project Project IA4TES (Advanced Intelligent Technologies for Sustainable Energy Transition) with file number TSI-100408-2021 from the 2021 AI R&D Missions Program, within the framework of the Spain Digital Agenda 2025 and the National Artificial Intelligence Strategy, funded by the Recovery, Transformation, and Resilience Plan and co-financed with European funds from the Recovery and Resilience Facility (RRF), Next Generation EU and the Spanish Agencia Estatal de Investigación for the support provided by the Ministerio de Ciencia e Innovación of Spain (Grant Ref. PID2022-137748OB-C31 funded by MCIN/AEI/10.13039/501100011033) and “ERDF A way of making Europe”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Pearson, K.: On lines and planes of closest fit to systems of points in space. Philos. Mag. 2(11), 559–572 (1901). https://doi.org/10.1080/14786440109462720

    Article  MATH  Google Scholar 

  2. Lee, J.-H., Labadie, J.W.: Stochastic optimization of multireservoir systems via reinforcement learning. Water Resour. Res. 43(11), W11408 (2007). https://doi.org/10.1029/2006WR005627

    Article  MATH  Google Scholar 

  3. Castelletti, A., et al.: Tree-based reinforcement learning for optimal water reservoir operation. Water Resour. Res. 46(9), W09507 (2010). https://doi.org/10.1029/2009WR008898

    Article  MATH  Google Scholar 

  4. Dariane, A.B., Moradi, A.M.: Comparative analysis of evolving artificial neural network and reinforcement learning in stochastic optimization of multireservoir systems. Hydrol. Sci. J. 61(6), 1141–1156 (2016). https://doi.org/10.1080/02626667.2014.986485

  5. Zarghami, M.: Short term management of hydro-power system using reinforcement learning. École de technologie supérieure (2018). https://books.google.es/books?id=xHscvwEACAAJ

  6. Haarnoja, T., et al.: Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor. arXiv preprint arXiv:1801.01290 (2018)

  7. Matheussen, B. V. et al.: Hydropower optimization using deep learning. In: International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (2019). https://api.semanticscholar.org/CorpusID:195755546

  8. Xu, W., et al.: Deep RL for cascaded hydropower reservoirs considering inflow forecasts. Water Resour. Manage. 34(9), 3003–3018 (2020). https://doi.org/10.1007/s11269-020-02600-w

    Article  MATH  Google Scholar 

  9. Xu, W., et al.: Deep RL for optimal hydropower reservoir operation. J. Water Resour. Plann. Manage. 147(8), 04021045 (2021). https://doi.org/10.1061/(ASCE)WR.1943-5452.0001409

    Article  Google Scholar 

  10. Raffin, A. et al.: Stable-baselines3: reliable reinforcement learning implementations. J. Mach. Learn. Res. 22(268), 1–8 (2021). http://jmlr.org/papers/v22/20-1364.html

  11. Towers, M., et al.: Gymnasium. Zenodo (2023). https://doi.org/10.5281/zenodo.8127026, https://zenodo.org/record/8127025

  12. OMIE: Operador del Mercado Ibérico de Energía. OMIGROUP. https://www.omie.es/es/file-access-list?parents%5B0%5D=/&parents%5B1%5D=Mercado%20Diario &parents%5B2%5D=1.%20Precios &dir=Precios%20horarios%20del%20mercado%20diario%20en%20Espa%C3%B1a &realdir=marginalpdbc. Accessed 6 Feb 2023

Download references

Acknowledgements

This work was supported both by the project Project IA4TES (Advanced Intelligent Technologies for Sustainable Energy Transition) with file number TSI-100408-2021 from the 2021 AI R &D Missions Program, within the framework of the Spain Digital Agenda 2025 and the National Artificial Intelligence Strategy, funded by the Recovery, Transformation, and Resilience Plan and co-financed with European funds from the Recovery and Resilience Facility (RRF), Next Generation EU and the Spanish Agencia Estatal de Investigación for the support provided by the Ministerio de Ciencia e Innovación of Spain (Grant Ref. PID2022-137748OB-C31 funded by MCIN/AEI/10.13039/501100011033) and “ERDF A way of making Europe”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Álvaro García Sánchez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Castro Freibott, R., García Sánchez, Á., Espiga-Fernández, F., González-Santander de la Cruz, G. (2025). Intraday Multireservoir Hydropower Optimization with Alternative Deep Reinforcement Learning Configurations. In: Juan, A.A., Faulin, J., Lopez-Lopez, D. (eds) Decision Sciences. DSA ISC 2024. Lecture Notes in Computer Science, vol 14778. Springer, Cham. https://doi.org/10.1007/978-3-031-78238-1_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-78238-1_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-78237-4

  • Online ISBN: 978-3-031-78238-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics