ABSTRACT
In the cloud environment, application elastic scaling is very important. The number of copies can be dynamically adjusted according to load. A good elastic scaling scheme can not only ensure the stability of application, but also improve resource utilization of platform. The existing responsive scaling strategy of Kubernetes platform has many problems, which can not meet requirements of web system for service quality. This paper optimizes the default elastic scaling scheme in Kubernetes cluster, and proposes a container dynamic scaling scheme LP-HPA (load predict horizon pod autoscaling) based on load prediction. This scheme uses LSTM-GRU model to predict the application load, comprehensively considers predicted data and current data, realizes dynamic scaling of container, and ensures the service quality of application. Finally, by building Kubernetes cluster, this paper uses open source data set to verify the LP-HPA scheme. Experimental results show that our proposed scheme is better than Kubernetes' default scaling scheme in three scenarios: load rise, load drop and load jitter.
- YANG Y, ZHAO L, LI Z, Elax: Provisioning resource elastically for containerized online cloud services; proceedings of the 2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS), F, 2019 [C]. IEEE.Google Scholar
- QU C, CALHEIROS R N, BUYYA R. Auto-scaling web applications in clouds: A taxonomy and survey [J]. ACM Computing Surveys (CSUR), 2018, 51(4): 1-33.Google Scholar
- CALHEIROS R N, MASOUMI E, RANJAN R, Workload prediction using ARIMA model and its impact on cloud applications’ QoS [J]. IEEE transactions on cloud computing, 2014, 3(4): 449-58.Google Scholar
- SHI X, CHEN Z, WANG H, Convolutional LSTM network: A machine learning approach for precipitation nowcasting [J]. Advances in neural information processing systems, 2015, 28.Google Scholar
- CHO K, VAN MERRIëNBOER B, GULCEHRE C, Learning phrase representations using RNN encoder-decoder for statistical machine translation [J]. arXiv preprint arXiv:14061078, 2014.Google Scholar
- BAI S, KOLTER J Z, KOLTUN V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling [J]. arXiv preprint arXiv:180301271, 2018.Google Scholar
Index Terms
- LP-HPA:Load Predict-Horizontal Pod Autoscaler for Container Elastic Scaling
Recommendations
Elastic edge cloud resource management based on horizontal and vertical scaling
AbstractThe resources in the edge cloud are numerous and complex, and elastic scaling services can make efficient use of these resources. However, the elastic scaling services need to suspend the user’s application tasks forcibly when carrying out ...
Optimization enabled elastic scaling in cloud based on predicted load for resource management
Cloud computing epitomizes an important invention in the field of Information Technology, which presents users with a way of providing on-demand access to a pool of shared computing resources. A major challenge faced by the cloud system is to assign the ...
Container Scaling Strategy Based on Reinforcement Learning
Elasticity capability is one of the most important capabilities of cloud computing, which combines large-scale resource allocation capability to quickly achieve minute-level resource demand provisioning to meet the elasticity requirements of different ...
Comments