Skip to main content

Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction

  • Conference paper
  • First Online:
Service-Oriented Computing – ICSOC 2023 Workshops (ICSOC 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14518))

Included in the following conference series:

  • 137 Accesses

Abstract

Auto-scaling, also known as elasticity, provides the capacity to efficiently allocate computing resources on demand, rendering it beneficial for a wide array of applications, particularly web-based ones. However, the dynamic and unpredictable nature of workloads in web applications poses considerable challenges in designing effective strategies for cloud auto-scaling. Existing research primarily relies on single-step prediction methods or focuses solely on forecasting request arrival rates, thus overlooking the intricate nature of workload characteristics and system dynamics, which significantly affect resource demands in the cloud. In this study, we propose an innovative approach to address this limitation by introducing a multi-step workload prediction method using the Long Short-Term Memory (LSTM) model. By considering workload attributes over a specific time frame, our approach enables accurate predictions of future workloads over designated time intervals through multi-step forecasting. By utilising two real-world web workload datasets, our experiments aim to underscore the significance of using real-world data in delivering a comparative performance analysis between single-step and multi-step predictions. The results demonstrate that our proposed multi-step prediction model outperforms single-step predictions and other baseline models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arlitt, M., Jin, T.: A workload characterization study of the 1998 world cup web site. IEEE Netw. 14(3), 30–37 (2000). https://doi.org/10.1109/65.844498

    Article  Google Scholar 

  2. Cao, Q., Ewing, B.T., Thompson, M.A.: Forecasting wind speed with recurrent neural networks. Eur. J. Oper. Res. 221(1), 148–154 (2012)

    Article  MathSciNet  Google Scholar 

  3. Chai, T., Draxler, R.: Root mean square error (RMSE) or mean absolute error (MAE)? Geosci. Model Dev. 7(3), 1247–1250 (2014). https://doi.org/10.5194/gmdd-7-1525-2014

    Article  Google Scholar 

  4. Coutinho, E.F., de Carvalho Sousa, F.R., Rego, P.A.L., Gomes, D.G., de Souza, J.N.: Elasticity in cloud computing: a survey. Ann. Telecommun. 70(7–8), 289–309 (2014). https://doi.org/10.1007/s12243-014-0450-7

    Article  Google Scholar 

  5. Fu, Y., Hu, W., Tang, M., Yu, R., Liu, B.: Multi-step ahead wind power forecasting based on recurrent neural networks. In: 2018 IEEE PES Asia-Pacific Power and Energy Engineering Conference (APPEEC), pp. 217–222 (2018)

    Google Scholar 

  6. Gao, J., Wang, H., Shen, H.: Task failure prediction in cloud data centers using deep learning. IEEE Trans. Serv. Comput. 15(3), 1411–1422 (2020)

    Article  Google Scholar 

  7. Gomez-Perez, A., Fernández-López, M., Corcho, O.: Ontological Engineering: With Examples from the Areas of Knowledge Management, E-Commerce and the Semantic Web (2004)

    Google Scholar 

  8. Halpin, T.: Metaschemas for ER, ORM and UML data models: a comparison. J. Database Manag. 13, 20–30 (2002). https://doi.org/10.4018/jdm.2002040102

    Article  Google Scholar 

  9. Hyndman, R., Athanasopoulos, G.: Forecasting: Principles and Practice, 2nd edn. OTexts (2018)

    Google Scholar 

  10. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2017)

    Google Scholar 

  11. Kirchoff, D.F., Xavier, M., Mastella, J., De Rose, C.A.F.: A preliminary study of machine learning workload prediction techniques for cloud applications. In: 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (2019)

    Google Scholar 

  12. Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 12(4), 559–592 (2014). https://doi.org/10.1007/s10723-014-9314-7

    Article  Google Scholar 

  13. Masdari, M., Khoshnevis, A.: A survey and classification of the workload forecasting methods in cloud computing. Cluster Comput. 23(4), 2399–2424 (2019). https://doi.org/10.1007/s10586-019-03010-3

    Article  Google Scholar 

  14. Pan, Y., Xiao, Z., Wang, X., Yang, D.: A multiple support vector machine approach to stock index forecasting with mixed frequency sampling. KBS 122, 90–102 (2017)

    Google Scholar 

  15. Sahoo, D., Sood, N., Rani, U., Abraham, G., Dutt, V., Dileep, A.: Comparative analysis of multi-step time-series forecasting for network load dataset (2020)

    Google Scholar 

  16. Samarawickrama, A., Fernando, T.: 2019 14th Conference on Industrial and Information Systems (ICIIS) (2019). https://doi.org/10.1109/iciis47346.2019.9063310

  17. Sutcliffe, A.: Time-series forecasting using fractional differencing. J. Forecast. 13(4), 383–393 (1994)

    Article  Google Scholar 

  18. Taieb, S., Bontempi, G.: Recursive multi-step time series forecasting by perturbing data. In: Cook, D., Pei, J., Wang, W., Zaiane, O., Wu, X. (eds.) 11th IEEE International Conference on Data Mining, ICDM 2011, pp. 695–704. IEEE Computer Society (2011). https://doi.org/10.1109/ICDM.2011.123

  19. Yang, J., Liu, C., Shang, Y., Mao, Z., Junliang, C.: Workload predicting-based automatic scaling in service clouds, pp. 810–815 (2013)

    Google Scholar 

  20. Zhang, L., Zhang, Y., Jamshidi, P., Xu, L., Pahl, C.: Workload patterns for quality-driven dynamic cloud service configuration and auto-scaling. In: 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, UCC 2014, pp. 156–165. IEEE Computer Society (2014)

    Google Scholar 

  21. Zhu, Y., Zhang, W., Chen, Y., Gao, H.: A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment. EURASIP J. Wirel. Commun. Netw. 2019, 1–18 (2019). https://doi.org/10.1186/s13638-019-1605-z

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Basem Suleiman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Suleiman, B., Alibasa, M.J., Chang, YY., Anaissi, A. (2024). Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction. In: Monti, F., et al. Service-Oriented Computing – ICSOC 2023 Workshops. ICSOC 2023. Lecture Notes in Computer Science, vol 14518. Springer, Singapore. https://doi.org/10.1007/978-981-97-0989-2_1

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0989-2_1

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0988-5

  • Online ISBN: 978-981-97-0989-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics