Skip to main content

Advertisement

Log in

A novel approach for CPU load prediction of cloud server combining denoising and error correction

  • Special Issue Article
  • Published:
Computing Aims and scope Submit manuscript

Abstract

Computer servers in cloud data centers are known to consume a huge amount of energy in their operations. For energy saving, load balancing has been used but it is only effective when CPU loads are predicted accurately. Noise in the energy consumption data is often a detrimental factor responsible for the CPU load prediction error. In prior arts, denoising has not been considered as an approach to subside the prediction error. Therefore, a novel prediction approach called CEEMDAN-RIDGE that is centered on denoising is proposed and reported in this paper. Firstly, CEEMDAN is applied to decompose the CPU consumption data which is in the form of a time series. The curvature similarity between a pair of the original series and its corresponding decomposed series is measured. By referencing to this similarity measure, an effective series is obtained from filtration of the noise series. The effective series after the noise series is filtered out is reconstructed to a new fitting curve for CPU load prediction. The prediction accuracy is further enhanced by doing some error correction called RIDGE, which is made possible by predicting the error a priori from the historical error data from the previous prediction. In order to validate CEEMDAN-RIDGE, a series of experiments are conducted with Google trace data. The experiment results show that the LSTM model using the proposed CPU load prediction approach outperforms other models significantly in three performance metrics: RMSE, MAE and MAPE.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Guitart J (2017) Toward sustainable data centers: a comprehensive energy management strategy. Computing 99(6):597–615

    Article  MathSciNet  Google Scholar 

  2. Duggan M, Mason K, Duggan J, Howley E, Barrett E (2017) Predicting host CPU utilization in cloud computing using recurrent neural networks. In: 2017 12th international conference for internet technology and secured transactions (ICITST), IEEE, pp 67–72

  3. Chang Y, Chang R, Chuang F (2014) A predictive method for workload forecasting in the cloud environment. In: Advanced technologies, embedded and multimedia for human-centric computing. Springer, Dordrecht, pp 577–585

  4. Gupta S, Singh V, Mittal AP, Rani A (2016) Weekly load prediction using wavelet neural network approach. In: 2016 second international conference on computational intelligence & communication technology (CICT), IEEE, pp 174–179

  5. Jia Y, Gong Q, Li J, Zhan J (2017) The power load combined forecasting based on CEEMDAN and QPSO-SVM. Electr Meas Instrum 1:3

    Google Scholar 

  6. Jakob Themeßl M, Gobiet A, Leuprecht A (2011) Empirical-statistical downscaling and error correction of daily precipitation from regional climate models. Int J Climatol 31(10):1530–1544

    Article  Google Scholar 

  7. Chen Y (2017) Fast dictionary learning for noise attenuation of multidimensional seismic data. Geophys J Int 209(1):21–31

    Article  Google Scholar 

  8. Qin Y, Song D, Chen H, Cheng W, Jiang G, Cottrell G (2017) A dual-stage attention-based recurrent neural network for time series prediction. Preprint arXiv:1704.02971

  9. Eiter T, Mannila H (1994) Computing discrete Fréchet distance. 64(3):636–637

  10. Greff K, Srivastava RK, Koutník J, Steunebrink BR, Schmidhuber J (2016) LSTM: a search space odyssey. IEEE Trans Neural Netw Learn Syst 28(10):2222–2232

    Article  MathSciNet  Google Scholar 

  11. Torres ME, Colominas MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 4144–4147

  12. Hoerl Arthur E, Kennard Robert W (1970) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12(1):55–67

    Article  MATH  Google Scholar 

  13. Huang NE, Shen Z, Long SR et al (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc Lond Ser A Math Phys Eng Sci 454(1971):903–995

    Article  MathSciNet  MATH  Google Scholar 

  14. Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 1(01):1–41

    Article  Google Scholar 

  15. Shehabi A, Smith S, Sartor D, et al (2016) United states data center energy usage report. In: Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States)

  16. Gonçalves GE, Endo PT, Rodrigues M, Sadok DH, Kelner J, Curescu C (2020) Resource allocation based on redundancy models for high availability cloud. Computing 102(1):43–63

    Article  MathSciNet  MATH  Google Scholar 

  17. Sanjeevi P, Viswanathan P (2017) NUTS scheduling approach for cloud data centers to optimize energy consumption. Computing 99(12):1179–1205

    Article  Google Scholar 

  18. Lin W, Wu G, Wang X, Li K (2019) An artificial neural network approach to power consumption model construction for servers in cloud data centers. IEEE Trans Sustain Comput

  19. Liang J, Cao J, Wang J, Xu Y (2011) Long-term CPU load prediction. In: 2011 IEEE ninth international conference on dependable, autonomic and secure computing, IEEE, pp 23–26

  20. Gbaguidi FAR, Boumerdassi S, Milocco R, Ezin EC (2018) Adaptive ARMA based prediction of CPU consumption of servers into datacenters. In: International conference on mobile, secure, and programmable networking. Springer, Cham, pp 277–288

  21. Lin W, Peng G, Bian X, Xu S, Chang V, Li Y (2019) Scheduling algorithms for heterogeneous cloud environment: main resource load balancing algorithm and time balancing algorithm. J Grid Comput:699–726

  22. Zhou Y, Li T, Shi J, Qian Z (2019) A CEEMDAN and XGBOOST-based approach to forecast crude oil prices. Complexity 2019

  23. Zhang W, Qu Z, Zhang K, Mao W, Ma Y, Fan X (2017) A combined model based on CEEMDAN and modified flower pollination algorithm for wind speed forecasting. Energy Convers Manag 136:439–451

    Article  Google Scholar 

  24. Granger CWJ, Weiss AA (1983) Time series analysis of error-correction models. In: Studies in econometrics, time series, and multivariate statistics. Academic Press, pp 255–278

  25. Tang ZH, Zhao G, Cao S, Wang G, Xue W (2019) Wind speed prediction based on CEEMDAN-ESN and error correction strategy. In: 2019 Chinese automation congress (CAC), IEEE, pp 1568–1571

  26. Pang X, Zhou Y, Wang P, Lin W, Chang V (2020) An innovative neural network approach for stock market prediction. J Supercomput:2098–2118

  27. Aronov B, Har-Peled S, Knauer C, Wang Y, Wenk C (2006) Fréchet distance for curves, revisited. In: European symposium on algorithms. Springer, Berlin, Heidelberg, pp 52–63

  28. Niu S (2017) The research of denoising method based on empirical mode decomposition theory. Adv Appl Math 6(7):881–891

    Article  Google Scholar 

  29. Wilkes J (2011) More Google cluster data. Google research blog

  30. Nikolaenko V, Weinsberg U, Ioannidis S, Joye M, Taft N (2013) Privacy-preserving ridge regression on hundreds of millions of records. In: 2013 IEEE symposium on security and privacy, IEEE, pp 334–348

  31. Cai P, Wang Y, Lu G, Chen P, Ding C, Sun J (2016) A spatiotemporal correlative k-nearest neighbor model for short-term traffic multistep forecasting. Transp Res Part C Emerg Technol 62:21–34

    Article  Google Scholar 

  32. Biau G, Scornet E (2016) A random forest guided tour. Test 25(2):197–227

    Article  MathSciNet  MATH  Google Scholar 

  33. Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T (2017) Lightgbm: a highly efficient gradient boosting decision tree. In: Advances in neural information processing systems, pp 3146–3154

  34. Uddin MT, Uddiny MA (2015) Human activity recognition from wearable sensors using extremely randomized trees. In: 2015 international conference on electrical engineering and information communication technology (ICEEICT), IEEE, pp 1–6

Download references

Acknowledgements

This work is supported by National Natural Science Foundation of China (62072187, 61872084, 61772205), Guangzhou Science and Technology Program key projects (202007040002, 201902010040, 201907010001), Guangzhou Development Zone Science and Technology (Grant No. 2018GH17), Guangdong Major Project of Basic and Applied Basic Research (2019B030302002), Key-Area Research and Development Program of Guangdong Province (2020B010164003), and the Fundamental Research Funds for the Central Universities, SCUT (Grant No. 2019ZD26).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weiwei Lin.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

You, D., Lin, W., Shi, F. et al. A novel approach for CPU load prediction of cloud server combining denoising and error correction. Computing 105, 577–594 (2023). https://doi.org/10.1007/s00607-020-00865-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-020-00865-y

Keywords

Mathematics Subject Classification

Navigation