Management of Heterogeneous Cloud Resources with Use of the PPO

Funika, Włodzimierz; Koperek, Paweł; Kitowski, Jacek

doi:10.1007/978-3-030-71593-9_12

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12480))

Included in the following conference series:

European Conference on Parallel Processing

913 Accesses

Abstract

Reinforcement learning has been recently a very active field of research. Thanks to combining it with Deep Learning, many newly designed algorithms improve the state of the art. In this paper we present the results of our attempt to use the recent advancements in Reinforcement Learning to automate the management of heterogeneous resources in an environment which hosts a compute-intensive evolutionary process. We describe the architecture of our system and present evaluation results. The experiments include autonomous management of a sample workload and a comparison of its performance to the traditional automatic management approach. We also provide the details of training of the management policy using the Proximal Policy Optimization algorithm. Finally, we discuss the feasibility to extend the presented approach to other scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automated cloud resources provisioning with the use of the proximal policy optimization

Article Open access 10 November 2022

Automatic Management of Cloud Applications with Use of Proximal Policy Optimization

Continuous Self-adaptation of Control Policies in Automatic Cloud Management

References

Amazon Web Services Elastic Compute Cloud. https://aws.amazon.com/ec2/. Accessed 30 Dec 2019
Graphite Project. https://graphiteapp.org/. Accessed 28 Nov 2019
PyTorch DNN Evolution. https://gitlab.com/pkoperek/pytorch-dnn-evolution. Accessed 01 Dec 2019
Ashraf, A., et al.: CRAMP: cost-efficient resource allocation for multiple web applications with proactive scaling. In: 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings, pp. 581–586, December 2012
Google Scholar
Brockman, G., et al.: OpenAI Gym (2016). http://arxiv.org/abs/1606.01540
Chen, L., et al.: AuTO: scaling deep reinforcement learning for datacenter-scale automatic traffic optimization. In: SIGCOMM 2018, New York, USA, pp. 191–205 (2018)
Google Scholar
Ferretti, S., et al.: Qos-aware clouds. In: 2010 IEEE 3rd International Conference on Cloud Computing, pp. 321–328, July 2010
Google Scholar
Filho, M.C.S., et al.: Cloudsim plus: a cloud computing simulation framework pursuing software engineering principles for improved modularity, extensibility and correctness. In: IFIP/IEEE Symposium on Integrated Network and Service Management, pp. 400–406, May 2017
Google Scholar
Funika, W., et al.: Towards autonomic semantic-based management of distributed applications. Comput. Sci. 11, 51–64 (2010)
Google Scholar
Funika, W., Koperek, P.: Evaluating the use of policy gradient optimization approach for automatic cloud resource provisioning. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K. (eds.) PPAM 2019. LNCS, vol. 12043, pp. 467–478. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-43229-4_40
Chapter Google Scholar
Gu, S., et al.: Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates. In: Proceedings 2017 IEEE International Conference on Robotics and Automation (ICRA), Piscataway, NJ, USA. IEEE, May 2017
Google Scholar
Kaelbling, L.P., et al.: Reinforcement learning: a survey. CoRR cs.AI/9605103 (1996). http://arxiv.org/abs/cs.AI/9605103
Kitowski, J., et al.: Computer simulation of heuristic reinforcement learning system for nuclear plant load changes control. Comput. Phys. Commun. 18, 339–352 (1979)
Article Google Scholar
Koperek, P., Funika, W.: Dynamic business metrics-driven resource provisioning in cloud environments. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds.) PPAM 2011. LNCS, vol. 7204, pp. 171–180. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31500-8_18
Chapter Google Scholar
LeCun, Y., Cortes, C.: MNIST handwritten digit database (2010). http://yann.lecun.com/exdb/mnist/
Minarolli, D., Freisleben, B.: Distributed resource allocation to virtual machines via artificial neural networks. In: 2014 Proceedings of the 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2014, pp. 490–499. IEEE Computer Society, Washington, DC (2014)
Google Scholar
Mnih, V., et al.: Playing Atari with deep reinforcement learning (2013). http://arxiv.org/abs/1312.5602
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Mnih, V., et al.: Asynchronous methods for deep reinforcement learning. In: Balcan, M.F., Weinberger, K.Q. (eds.) Proceedings of the 33rd International Conference on Machine Learning, vol. 48, pp. 1928–1937. PMLR, 20–22 June 2016
Google Scholar
Schulman, J., et al.: Proximal policy optimization algorithms. CoRR abs/1707.06347 (2017). http://arxiv.org/abs/1707.06347
Silver, D., et al.: Mastering the game of go without human knowledge. Nature 550, 354–359 (2017)
Article Google Scholar
Sutton, R.S.: Temporal credit assignment in reinforcement learning. Ph.D. thesis (1984)
Google Scholar
Wang, Z., et al.: Automated cloud provisioning on AWS using deep reinforcement learning. CoRR abs/1709.04305 (2017). http://arxiv.org/abs/1709.04305
Xiong, P., et al.: SmartSLA: cost-sensitive management of virtualized resources for CPU-bound database services. IEEE Trans. Parallel Distrib. Syst. 26, 1441–1451 (2014)
Article Google Scholar

Download references

Acknowledgements

The research presented in this paper was supported by the funds assigned to AGH University of Science and Technology by the Polish Ministry of Science and Higher Education. The experiments have been carried out on the PL-Grid infrastructure resources of ACC Cyfronet AGH and on the Amazon Web Services Elastic Compute Cloud.

Author information

Authors and Affiliations

Faculty of Computer Science, Electronics and Telecommunication, Department of Computer Science, AGH, al. Mickiewicza 30, 30-059, Kraków, Poland
Włodzimierz Funika, Paweł Koperek & Jacek Kitowski
AGH, ACC CYFRONET AGH, ul. Nawojki 11, 30-950, Kraków, Poland
Jacek Kitowski

Authors

Włodzimierz Funika
View author publications
You can also search for this author in PubMed Google Scholar
Paweł Koperek
View author publications
You can also search for this author in PubMed Google Scholar
Jacek Kitowski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Włodzimierz Funika .

Editor information

Editors and Affiliations

AGH University of Science and Technology, Krakow, Poland
Bartosz Balis
CiTIUS, Santiago de Compostela, Spain
Dora B. Heras
ICAR-CNR, Naples, Italy
Laura Antonelli
University of Stirling, Stirling, UK
Andrea Bracciali
Friedrich-Alexander-Universität, Erlangen, Germany
Thomas Gruber
Konkuk University, Seoul, Korea (Republic of)
Jin Hyun-Wook
Otto von Guericke University Magdeburg, Magdeburg, Germany
Michael Kuhn
Tennessee Tech University, Cookeville, TN, USA
Stephen L. Scott
Koç University, Istanbul, Turkey
Didem Unat
Czestochowa University of Technology, Czestochowa, Poland
Roman Wyrzykowski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Funika, W., Koperek, P., Kitowski, J. (2021). Management of Heterogeneous Cloud Resources with Use of the PPO. In: Balis, B., et al. Euro-Par 2020: Parallel Processing Workshops. Euro-Par 2020. Lecture Notes in Computer Science(), vol 12480. Springer, Cham. https://doi.org/10.1007/978-3-030-71593-9_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-71593-9_12
Published: 14 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71592-2
Online ISBN: 978-3-030-71593-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics