Abstract
Auto-scaling, also known as elasticity, provides the capacity to efficiently allocate computing resources on demand, rendering it beneficial for a wide array of applications, particularly web-based ones. However, the dynamic and unpredictable nature of workloads in web applications poses considerable challenges in designing effective strategies for cloud auto-scaling. Existing research primarily relies on single-step prediction methods or focuses solely on forecasting request arrival rates, thus overlooking the intricate nature of workload characteristics and system dynamics, which significantly affect resource demands in the cloud. In this study, we propose an innovative approach to address this limitation by introducing a multi-step workload prediction method using the Long Short-Term Memory (LSTM) model. By considering workload attributes over a specific time frame, our approach enables accurate predictions of future workloads over designated time intervals through multi-step forecasting. By utilising two real-world web workload datasets, our experiments aim to underscore the significance of using real-world data in delivering a comparative performance analysis between single-step and multi-step predictions. The results demonstrate that our proposed multi-step prediction model outperforms single-step predictions and other baseline models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arlitt, M., Jin, T.: A workload characterization study of the 1998 world cup web site. IEEE Netw. 14(3), 30–37 (2000). https://doi.org/10.1109/65.844498
Cao, Q., Ewing, B.T., Thompson, M.A.: Forecasting wind speed with recurrent neural networks. Eur. J. Oper. Res. 221(1), 148–154 (2012)
Chai, T., Draxler, R.: Root mean square error (RMSE) or mean absolute error (MAE)? Geosci. Model Dev. 7(3), 1247–1250 (2014). https://doi.org/10.5194/gmdd-7-1525-2014
Coutinho, E.F., de Carvalho Sousa, F.R., Rego, P.A.L., Gomes, D.G., de Souza, J.N.: Elasticity in cloud computing: a survey. Ann. Telecommun. 70(7–8), 289–309 (2014). https://doi.org/10.1007/s12243-014-0450-7
Fu, Y., Hu, W., Tang, M., Yu, R., Liu, B.: Multi-step ahead wind power forecasting based on recurrent neural networks. In: 2018 IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), pp. 217–222 (2018)
Gao, J., Wang, H., Shen, H.: Task failure prediction in cloud data centers using deep learning. IEEE Trans. Serv. Comput. 15(3), 1411–1422 (2020)
Gomez-Perez, A., Fernández-López, M., Corcho, O.: Ontological Engineering: With Examples from the Areas of Knowledge Management, E-Commerce and the Semantic Web (2004)
Halpin, T.: Metaschemas for ER, ORM and UML data models: a comparison. J. Database Manag. 13, 20–30 (2002). https://doi.org/10.4018/jdm.2002040102
Hyndman, R., Athanasopoulos, G.: Forecasting: Principles and Practice, 2nd edn. OTexts (2018)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2017)
Kirchoff, D.F., Xavier, M., Mastella, J., De Rose, C.A.F.: A preliminary study of machine learning workload prediction techniques for cloud applications. In: 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (2019)
Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 12(4), 559–592 (2014). https://doi.org/10.1007/s10723-014-9314-7
Masdari, M., Khoshnevis, A.: A survey and classification of the workload forecasting methods in cloud computing. Cluster Comput. 23(4), 2399–2424 (2019). https://doi.org/10.1007/s10586-019-03010-3
Pan, Y., Xiao, Z., Wang, X., Yang, D.: A multiple support vector machine approach to stock index forecasting with mixed frequency sampling. KBS 122, 90–102 (2017)
Sahoo, D., Sood, N., Rani, U., Abraham, G., Dutt, V., Dileep, A.: Comparative analysis of multi-step time-series forecasting for network load dataset (2020)
Samarawickrama, A., Fernando, T.: 2019 14th Conference on Industrial and Information Systems (ICIIS) (2019). https://doi.org/10.1109/iciis47346.2019.9063310
Sutcliffe, A.: Time-series forecasting using fractional differencing. J. Forecast. 13(4), 383–393 (1994)
Taieb, S., Bontempi, G.: Recursive multi-step time series forecasting by perturbing data. In: Cook, D., Pei, J., Wang, W., Zaiane, O., Wu, X. (eds.) 11th IEEE International Conference on Data Mining, ICDM 2011, pp. 695–704. IEEE Computer Society (2011). https://doi.org/10.1109/ICDM.2011.123
Yang, J., Liu, C., Shang, Y., Mao, Z., Junliang, C.: Workload predicting-based automatic scaling in service clouds, pp. 810–815 (2013)
Zhang, L., Zhang, Y., Jamshidi, P., Xu, L., Pahl, C.: Workload patterns for quality-driven dynamic cloud service configuration and auto-scaling. In: 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, UCC 2014, pp. 156–165. IEEE Computer Society (2014)
Zhu, Y., Zhang, W., Chen, Y., Gao, H.: A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment. EURASIP J. Wirel. Commun. Netw. 2019, 1–18 (2019). https://doi.org/10.1186/s13638-019-1605-z
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Suleiman, B., Alibasa, M.J., Chang, YY., Anaissi, A. (2024). Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction. In: Monti, F., et al. Service-Oriented Computing – ICSOC 2023 Workshops. ICSOC 2023. Lecture Notes in Computer Science, vol 14518. Springer, Singapore. https://doi.org/10.1007/978-981-97-0989-2_1
Download citation
DOI: https://doi.org/10.1007/978-981-97-0989-2_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0988-5
Online ISBN: 978-981-97-0989-2
eBook Packages: Computer ScienceComputer Science (R0)