Towards Understanding of Deep Reinforcement Learning Agents Used in Cloud Resource Management

Małota, Andrzej; Koperek, Paweł; Funika, Włodzimierz

doi:10.1007/978-3-031-36021-3_55

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14074))

Included in the following conference series:

International Conference on Computational Science

702 Accesses

Abstract

Cloud computing resource management is a critical component of the modern cloud computing platforms, aimed to manage computing resources for a given application by minimizing the cost of the infrastructure while maintaining a Quality-of-Service (QoS) conditions. This task is usually solved using rule-based policies. Due to their limitations more complex solutions, such as Deep Reinforcement Learning (DRL) agents are being researched. Unfortunately, deploying such agents in a production environment can be seen as risky because of the lack of transparency of DRL decision-making policies. There is no way to know why a certain decision is made. To foster the trust in DRL generated policies it is important to provide means of explaining why certain decisions were made given a specific input. In this paper we present a tool applying the Integrated Gradients (IG) method to Deep Neural Networks used by DRL algorithms. This allowed to obtain feature attributions that show the magnitude and direction of each feature’s influence on the agent’s decision. We verify the viability of the proposed solution by applying it to a number of sample use cases with different DRL agents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Barredo Arrieta, A., et al.: Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion 58, 82–115 (2020). https://doi.org/10.1016/j.inffus.2019.12.012
Article Google Scholar
Cheng, M., et al.: DRL-cloud: deep reinforcement learning-based resource provisioning and task scheduling for cloud service providers. In: 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC), pp. 129–134 (2018)
Google Scholar
Cobbe, K., et al.: Quantifying Generalization in Reinforcement Learning (2018). https://doi.org/10.48550/ARXIV.1812.02341
Cuayáhuitl, H.: SimpleDS: A Simple Deep Reinforcement Learning Dialogue System (2016). https://doi.org/10.48550/ARXIV.1601.04574
Dutta, S., et al.: SmartScale: automatic application scaling in enterprise clouds. In: 2012 IEEE Fifth International Conference on Cloud Computing, pp. 221–228 (2012)
Google Scholar
Funika, W., Koperek, P.: Evaluating the use of policy gradient optimization approach for automatic cloud resource provisioning. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K. (eds.) PPAM 2019. LNCS, vol. 12043, pp. 467–478. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-43229-4_40
Chapter Google Scholar
Funika, W., Koperek, P., Kitowski, J.: Automatic management of cloud applications with use of proximal policy optimization. In: Krzhizhanovskaya, V.V., et al. (eds.) ICCS 2020. LNCS, vol. 12137, pp. 73–87. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-50371-0_6
Chapter Google Scholar
Gregurić, M., et al.: Application of deep reinforcement learning in traffic signal control: an overview and impact of open traffic data. Appl. Sci. 10(11) (2020)
Google Scholar
Greydanus, S., et al.: Visualizing and Understanding Atari Agents (2017). https://doi.org/10.48550/ARXIV.1711.00138
Guo, W., et al.: EDGE: Explaining Deep Reinforcement Learning Policies. In: Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W. (eds.) Advances in Neural Information Processing Systems, vol. 34, pp. 12222–12236. Curran Associates, Inc. (2021)
Google Scholar
Heuillet, A., et al.: Explainability in Deep Reinforcement Learning. CoRR abs/2008.06693 (2020)
Google Scholar
Hilton, J., et al.: Understanding RL Vision. Distill (2020). https://doi.org/10.23915/distill.00029
Małota, A., et al.: Trainloop-driver (2023). https://github.com/andrzejmalota/trainloop-driver/tree/master/examples
Milani, S., et al.: A Survey of Explainable Reinforcement Learning (2022). https://doi.org/10.48550/ARXIV.2202.08434
Mnih, V., et al.: Playing Atari with deep reinforcement learning. In: NIPS Deep Learning Workshop (2013). http://arxiv.org/abs/1312.5602
Mott, A., et al.: Towards Interpretable Reinforcement Learning Using Attention Augmented Agents (2019). https://doi.org/10.48550/ARXIV.1906.02500
Olah, C., et al.: The Building Blocks of Interpretability. Distill (2018). https://doi.org/10.23915/distill.00010
OpenAI, et al.: Solving Rubik’s Cube with a Robot Hand (2019). https://doi.org/10.48550/ARXIV.1910.07113
Ribeiro, M.T., et al.: “Why Should I Trust You?”: explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD 2016, New York, NY, USA, pp. 1135–1144. Association for Computing Machinery (2016)
Google Scholar
Schulman, J., et al.: Proximal Policy Optimization Algorithms (2017). https://doi.org/10.48550/ARXIV.1707.06347
Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vision 128(2), 336–359 (2019). https://doi.org/10.1007/s11263-019-01228-7
Article Google Scholar
Campos da Silva Filho, M., et al.: CloudSim plus: a cloud computing simulation framework pursuing software engineering principles for improved modularity, extensibility and correctness. In: 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), pp. 400–406 (2017)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn. The MIT Press (2018)
Google Scholar
Tighe, M., Bauer, M.: Integrating cloud application autoscaling with dynamic VM allocation. In: 2014 IEEE Network Operations and Management Symposium (NOMS), pp. 1–9 (2014)
Google Scholar
Wang, Z., et al.: Automated Cloud Provisioning on AWS using Deep Reinforcement Learning (2017). https://doi.org/10.48550/ARXIV.1709.04305
Zhang, Y., et al.: Intelligent cloud resource management with deep reinforcement learning. IEEE Cloud Comput. 4(6), 60–69 (2017). https://doi.org/10.1109/MCC.2018.1081063
Article Google Scholar

Download references

Acknowledgements

The research presented in this paper was supported by the funds assigned to AGH University of Krakow by the Polish Ministry of Education and Science.

Author information

Authors and Affiliations

Faculty of Computer Science, Electronics and Telecommunication, Institute of Computer Science, AGH University of Krakow, al. Mickiewicza 30, 30-059, Kraków, Poland
Andrzej Małota, Paweł Koperek & Włodzimierz Funika

Authors

Andrzej Małota
View author publications
You can also search for this author in PubMed Google Scholar
Paweł Koperek
View author publications
You can also search for this author in PubMed Google Scholar
Włodzimierz Funika
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Paweł Koperek .

Editor information

Editors and Affiliations

Czech Technical University in Prague, Prague, Czech Republic
Jiří Mikyška
University of Amsterdam, Amsterdam, The Netherlands
Clélia de Mulatier
AGH University of Science and Technology, Krakow, Poland
Maciej Paszynski
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M.A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Małota, A., Koperek, P., Funika, W. (2023). Towards Understanding of Deep Reinforcement Learning Agents Used in Cloud Resource Management. In: Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational Science – ICCS 2023. ICCS 2023. Lecture Notes in Computer Science, vol 14074. Springer, Cham. https://doi.org/10.1007/978-3-031-36021-3_55

Download citation

DOI: https://doi.org/10.1007/978-3-031-36021-3_55
Published: 26 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36020-6
Online ISBN: 978-3-031-36021-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards Understanding of Deep Reinforcement Learning Agents Used in Cloud Resource Management