Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction

Suleiman, Basem; Alibasa, Muhammad Johan; Chang, Ya-Yuan; Anaissi, Ali

doi:10.1007/978-981-97-0989-2_1

Basem Suleiman^16,17,
Muhammad Johan Alibasa¹⁸,
Ya-Yuan Chang¹⁶ &
…
Ali Anaissi¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14518))

Included in the following conference series:

International Conference on Service-Oriented Computing

137 Accesses

Abstract

Auto-scaling, also known as elasticity, provides the capacity to efficiently allocate computing resources on demand, rendering it beneficial for a wide array of applications, particularly web-based ones. However, the dynamic and unpredictable nature of workloads in web applications poses considerable challenges in designing effective strategies for cloud auto-scaling. Existing research primarily relies on single-step prediction methods or focuses solely on forecasting request arrival rates, thus overlooking the intricate nature of workload characteristics and system dynamics, which significantly affect resource demands in the cloud. In this study, we propose an innovative approach to address this limitation by introducing a multi-step workload prediction method using the Long Short-Term Memory (LSTM) model. By considering workload attributes over a specific time frame, our approach enables accurate predictions of future workloads over designated time intervals through multi-step forecasting. By utilising two real-world web workload datasets, our experiments aim to underscore the significance of using real-world data in delivering a comparative performance analysis between single-step and multi-step predictions. The results demonstrate that our proposed multi-step prediction model outperforms single-step predictions and other baseline models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arlitt, M., Jin, T.: A workload characterization study of the 1998 world cup web site. IEEE Netw. 14(3), 30–37 (2000). https://doi.org/10.1109/65.844498
Article Google Scholar
Cao, Q., Ewing, B.T., Thompson, M.A.: Forecasting wind speed with recurrent neural networks. Eur. J. Oper. Res. 221(1), 148–154 (2012)
Article MathSciNet Google Scholar
Chai, T., Draxler, R.: Root mean square error (RMSE) or mean absolute error (MAE)? Geosci. Model Dev. 7(3), 1247–1250 (2014). https://doi.org/10.5194/gmdd-7-1525-2014
Article Google Scholar
Coutinho, E.F., de Carvalho Sousa, F.R., Rego, P.A.L., Gomes, D.G., de Souza, J.N.: Elasticity in cloud computing: a survey. Ann. Telecommun. 70(7–8), 289–309 (2014). https://doi.org/10.1007/s12243-014-0450-7
Article Google Scholar
Fu, Y., Hu, W., Tang, M., Yu, R., Liu, B.: Multi-step ahead wind power forecasting based on recurrent neural networks. In: 2018 IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), pp. 217–222 (2018)
Google Scholar
Gao, J., Wang, H., Shen, H.: Task failure prediction in cloud data centers using deep learning. IEEE Trans. Serv. Comput. 15(3), 1411–1422 (2020)
Article Google Scholar
Gomez-Perez, A., Fernández-López, M., Corcho, O.: Ontological Engineering: With Examples from the Areas of Knowledge Management, E-Commerce and the Semantic Web (2004)
Google Scholar
Halpin, T.: Metaschemas for ER, ORM and UML data models: a comparison. J. Database Manag. 13, 20–30 (2002). https://doi.org/10.4018/jdm.2002040102
Article Google Scholar
Hyndman, R., Athanasopoulos, G.: Forecasting: Principles and Practice, 2nd edn. OTexts (2018)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2017)
Google Scholar
Kirchoff, D.F., Xavier, M., Mastella, J., De Rose, C.A.F.: A preliminary study of machine learning workload prediction techniques for cloud applications. In: 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (2019)
Google Scholar
Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 12(4), 559–592 (2014). https://doi.org/10.1007/s10723-014-9314-7
Article Google Scholar
Masdari, M., Khoshnevis, A.: A survey and classification of the workload forecasting methods in cloud computing. Cluster Comput. 23(4), 2399–2424 (2019). https://doi.org/10.1007/s10586-019-03010-3
Article Google Scholar
Pan, Y., Xiao, Z., Wang, X., Yang, D.: A multiple support vector machine approach to stock index forecasting with mixed frequency sampling. KBS 122, 90–102 (2017)
Google Scholar
Sahoo, D., Sood, N., Rani, U., Abraham, G., Dutt, V., Dileep, A.: Comparative analysis of multi-step time-series forecasting for network load dataset (2020)
Google Scholar
Samarawickrama, A., Fernando, T.: 2019 14th Conference on Industrial and Information Systems (ICIIS) (2019). https://doi.org/10.1109/iciis47346.2019.9063310
Sutcliffe, A.: Time-series forecasting using fractional differencing. J. Forecast. 13(4), 383–393 (1994)
Article Google Scholar
Taieb, S., Bontempi, G.: Recursive multi-step time series forecasting by perturbing data. In: Cook, D., Pei, J., Wang, W., Zaiane, O., Wu, X. (eds.) 11th IEEE International Conference on Data Mining, ICDM 2011, pp. 695–704. IEEE Computer Society (2011). https://doi.org/10.1109/ICDM.2011.123
Yang, J., Liu, C., Shang, Y., Mao, Z., Junliang, C.: Workload predicting-based automatic scaling in service clouds, pp. 810–815 (2013)
Google Scholar
Zhang, L., Zhang, Y., Jamshidi, P., Xu, L., Pahl, C.: Workload patterns for quality-driven dynamic cloud service configuration and auto-scaling. In: 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, UCC 2014, pp. 156–165. IEEE Computer Society (2014)
Google Scholar
Zhu, Y., Zhang, W., Chen, Y., Gao, H.: A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment. EURASIP J. Wirel. Commun. Netw. 2019, 1–18 (2019). https://doi.org/10.1186/s13638-019-1605-z
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of New South Wales, Sydney, Australia
Basem Suleiman, Ya-Yuan Chang & Ali Anaissi
The University of Sydney, Sydney, Australia
Basem Suleiman
Telkom University, Bandung, Indonesia
Muhammad Johan Alibasa

Authors

Basem Suleiman
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Johan Alibasa
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Yuan Chang
View author publications
You can also search for this author in PubMed Google Scholar
Ali Anaissi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Basem Suleiman .

Editor information

Editors and Affiliations

Sapienza University of Rome, Rome, Italy
Flavia Monti
Politecnico di Milano, Milan, Italy
Pierluigi Plebani
École de Technologie Supérieure, Montréal, QC, Canada
Naouel Moha
University of New South Wales, Sydney, NSW, Australia
Hye-young Paik
University of Stuttgart, Stuttgart, Germany
Johanna Barzen
Queensland University of Technology, Brisbane, QLD, Australia
Gowri Ramachandran
University of Brescia, Brescia, Italy
Devis Bianchini
TU/e – JADS, Politecnico di Milano, Milan, Italy
Damian A. Tamburri
Sapienza University of Rome, Rome, Italy
Massimo Mecella

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suleiman, B., Alibasa, M.J., Chang, YY., Anaissi, A. (2024). Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction. In: Monti, F., et al. Service-Oriented Computing – ICSOC 2023 Workshops. ICSOC 2023. Lecture Notes in Computer Science, vol 14518. Springer, Singapore. https://doi.org/10.1007/978-981-97-0989-2_1

Download citation

DOI: https://doi.org/10.1007/978-981-97-0989-2_1
Published: 16 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0988-5
Online ISBN: 978-981-97-0989-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics