ABSTRACT
In cloud systems, an accurate capacity planning is very important for cloud provider to improve service availability. Traditional methods simply predicting "when the available resources is exhausted" are not effective due to customer demand fragmentation and platform allocation constraints. In this paper, we propose a novel prediction approach which proactively predicts the level of resource allocation failures from the perspective of low capacity status. By jointly considering the data from different sources in both time series form and static form, the proposed approach can make accurate LCS predictions in a complex and dynamic cloud environment, and thereby improve the service availability of cloud systems. The proposed approach is evaluated by real-world datasets collected from a large scale public cloud platform, and the results confirm its effectiveness.
- Ayman Amin, Lars Grunske, and Alan Colman. 2013. An approach to software reliability prediction based on time series modeling. Journal of Systems and Software, 86, 7 (2013), 1923–1932.Google ScholarDigital Library
- Mathias Björkqvist, Sebastiano Spicuglia, Lydia Y. Chen, and Walter Binder. 2013. QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis. In Proceedings of ICSOC 2013. 69–83.Google ScholarDigital Library
- Rodrigo N Calheiros, Enayat Masoumi, Rajiv Ranjan, and Rajkumar Buyya. 2015. Workload prediction using ARIMA model and its impact on cloud applications’ QoS. IEEE Transactions on Cloud Computing, 3, 4 (2015), 449–458.Google ScholarDigital Library
- Martin Duggan, Karl Mason, Jim Duggan, Enda Howley, and Enda Barrett. 2017. Predicting host CPU utilization in cloud computing using recurrent neural networks. In Proceedings of ICITST 2017. 67–72.Google ScholarCross Ref
- Christos Faloutsos, Jan Gasthaus, Tim Januschowski, and Yuyang Wang. 2018. Forecasting big time series: Old and new. Proceedings of VLDB 2018, 11, 12 (2018), 2102–2105.Google ScholarDigital Library
- Anshul Gandhi, Parijat Dube, Alexei A. Karve, Andrzej Kochut, and Li Zhang. 2020. Providing Performance Guarantees for Cloud-Deployed Applications. IEEE Transactions on Cloud Computing, 8, 1 (2020), 269–281.Google ScholarCross Ref
- Yongqiang Gao, Haibing Guan, Zhengwei Qi, Yang Hou, and Liang Liu. 2013. A multi-objective ant colony system algorithm for virtual machine placement in cloud computing. J. Comput. System Sci., 79, 8 (2013), 1230–1242.Google ScholarDigital Library
- Albert Greenberg, James Hamilton, David A Maltz, and Parveen Patel. 2008. The cost of a cloud: Research problems in data center networks. ACM SIGCOMM computer communication review, 39, 1 (2008), 68–73.Google ScholarDigital Library
- Rohit Gupta, Sumit Kumar Bose, Srikanth Sundarrajan, Manogna Chebiyam, and Anirban Chakrabarti. 2008. A two stage heuristic algorithm for solving the server consolidation problem with item-item and bin-item incompatibility constraints. In Proceedings of SCC 2008. 2, 39–46.Google ScholarDigital Library
- Fang Hao, Murali Kodialam, TV Lakshman, and Sarit Mukherjee. 2016. Online allocation of virtual machines in a distributed cloud. IEEE/ACM Transactions on Networking, 25, 1 (2016), 238–249.Google ScholarDigital Library
- Sen He, Glenna Manns, John Saunders, Wei Wang, Lori L. Pollock, and Mary Lou Soffa. 2019. A statistics-based performance testing methodology for cloud applications. In Proceedings of ESEC/SIGSOFT FSE 2019. 188–199.Google ScholarDigital Library
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation, 9, 8 (1997), 1735–1780.Google ScholarDigital Library
- Pooyan Jamshidi, Aakash Ahmad, and Claus Pahl. 2014. Autonomic resource provisioning for cloud-based software. In Proceedings of SEAMS 2014. 95–104.Google ScholarDigital Library
- Brendan Jennings and Rolf Stadler. 2015. Resource management in clouds: Survey and research challenges. Journal of Network and Systems Management, 23, 3 (2015), 567–619.Google ScholarDigital Library
- Yexi Jiang, Chang-Shing Perng, Tao Li, and Rong Chang. 2012. Intelligent cloud capacity management. In Proceedings of NOMS 2012. 502–505.Google Scholar
- Yexi Jiang, Chang-shing Perng, Tao Li, and Rong Chang. 2012. Self-adaptive cloud capacity planning. In Proceedings of SCC 2012. 73–80.Google ScholarDigital Library
- Yexi Jiang, Chang-Shing Perng, Tao Li, and Rong N Chang. 2013. Cloud analytics for capacity planning and instant vm provisioning. IEEE Transactions on Network and Service Management, 10, 3 (2013), 312–325.Google ScholarCross Ref
- Cristian Klein, Martina Maggio, Karl-Erik Årzén, and Francisco Hernández-Rodriguez. 2014. Brownout: Building more robust cloud applications. In Proceedings of ICSE 2014. 700–711.Google ScholarDigital Library
- Philipp Leitner, Branimir Wetzstein, Florian Rosenberg, Anton Michlmayr, Schahram Dustdar, and Frank Leymann. 2009. Runtime Prediction of Service Level Agreement Violations for Composite Services. In Proceedings of ICSOC/ServiceWave Workshops 2009. 176–186.Google Scholar
- Alexander Lenk, Markus Klems, Jens Nimis, Stefan Tai, and Thomas Sandholm. 2009. What’s inside the Cloud? An architectural map of the Cloud landscape. In Proceedings of ICSE workshop on software engineering challenges of cloud computing 2009. 23–31.Google ScholarDigital Library
- Roger J Lewis. 2000. An introduction to classification and regression tree (CART) analysis. In Proceedings of SAEM 2000.Google Scholar
- Qingwei Lin, Ken Hsieh, Yingnong Dang, Hongyu Zhang, Kaixin Sui, Yong Xu, Jian-Guang Lou, Chenggang Li, Youjiang Wu, Randolph Yao, Murali Chintalapati, and Dongmei Zhang. 2018. Predicting Node failure in cloud service systems. In Proceedings of ESEC/SIGSOFT FSE 2018. 480–490.Google ScholarDigital Library
- Chunhong Liu, Chuanchang Liu, Yanlei Shang, Shiping Chen, Bo Cheng, and Junliang Chen. 2017. An adaptive prediction approach based on workload pattern discrimination in the cloud. Journal of Network and Computer Applications, 80 (2017), 35–44.Google ScholarDigital Library
- Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research, 9 (2008), 2579–2605.Google Scholar
- Sunilkumar S. Manvi and Gopal Krishna Shyam. 2014. Resource management for Infrastructure as a Service (IaaS) in cloud computing: A survey. Journal of Network and Computer Applications, 41 (2014), 424–440.Google ScholarCross Ref
- Xiaoqiao Meng, Vasileios Pappas, and Li Zhang. 2010. Improving the scalability of data center networks with traffic-aware virtual machine placement. In Proceedings of INFOCOM 2010. 1–9.Google ScholarCross Ref
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of NeurIPS 2013. 3111–3119.Google Scholar
- Hidemoto Nakada, Takahiro Hirofuchi, Hirotaka Ogawa, and Satoshi Itoh. 2009. Toward virtual machine packing optimization based on genetic algorithm. In Proceedings of IWANN 2009. 651–654.Google ScholarDigital Library
- Jennifer Ortiz, Brendan Lee, Magdalena Balazinska, Johannes Gehrke, and Joseph L Hellerstein. 2018. SLAOrchestrator: Reducing the cost of performance SLAs for cloud data analytics. In Proceedings of ATC 2018. 547–560.Google Scholar
- Venkateshwar Rao and Sarika Rao. 2012. Application of artificial neural networks in capacity planning of cloud based IT infrastructure. In Proceedings of CCEM 2012. 1–4.Google ScholarCross Ref
- Christoph Rathfelder, Samuel Kounev, and David Evans. 2011. Capacity planning for event-based systems using automated performance predictions. In Proceedings of ASE 2011. 352–361.Google ScholarDigital Library
- Nilabja Roy, Abhishek Dubey, and Aniruddha Gokhale. 2011. Efficient autoscaling in the cloud using predictive models for workload forecasting. In Proceedings of CLOUD 2011. 500–507.Google ScholarDigital Library
- Sukhpal Singh, Inderveer Chana, and Rajkumar Buyya. 2017. STAR: SLA-aware autonomic management of cloud resources. IEEE Transactions on Cloud Computing.Google ScholarCross Ref
- Weijia Song, Zhen Xiao, Qi Chen, and Haipeng Luo. 2013. Adaptive resource provisioning for the cloud using online bin packing. IEEE Trans. Comput., 63, 11 (2013), 2647–2660.Google ScholarDigital Library
- Sean J Taylor and Benjamin Letham. 2018. Forecasting at scale. The American Statistician, 72, 1 (2018), 37–45.Google ScholarCross Ref
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Ł ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of NeurIPS 2017. 5998–6008.Google Scholar
- Jingjing Wang and Magdalena Balazinska. 2017. Elastic memory management for cloud data analytics. In Proceedings of ATC 2017. 745–758.Google Scholar
- Rafael Weingärtner, Gabriel Beims Bräscher, and Carlos Becker Westphall. 2015. Cloud resource management: A survey on forecasting and profiling models. Journal of Network and Computer Applications, 47 (2015), 99–106.Google ScholarDigital Library
- Ledell Yu Wu, Adam Fisch, Sumit Chopra, Keith Adams, Antoine Bordes, and Jason Weston. 2018. StarSpace: Embed All The Things!. In Proceedings of AAAI 2018. 5569–5577.Google ScholarCross Ref
- Bin Xia, Tao Li, Qi-Feng Zhou, Qianmu Li, and Hong Zhang. 2018. An Effective Classification-based Framework for Predicting Cloud Capacity Demand in Cloud Services. IEEE Transactions on Services Computing.Google Scholar
- Qi Zhang, Mohamed Faten Zhani, Shuo Zhang, Quanyan Zhu, Raouf Boutaba, and Joseph L Hellerstein. 2012. Dynamic energy-aware capacity provisioning for cloud computing environments. In Proceedings of ICAC 2012. 145–154.Google ScholarDigital Library
- Liming Zhu, Yan Liu, Ngoc Bao Bui, and Ian Gorton. 2007. Revel8or: Model Driven Capacity Planning Tool Suite. In Proceedings of ICSE 2007. 797–800.Google ScholarDigital Library
- Yi Zhu, Yan Liang, Qiong Zhang, Xi Wang, Paparao Palacharla, and Motoyoshi Sekiya. 2014. Reliable resource allocation for optically interconnected distributed clouds. In Proceedings of ICC 2014. 3301–3306.Google ScholarCross Ref
Index Terms
- Effective low capacity status prediction for cloud systems
Recommendations
Function points-based resource prediction in cloud computing
As a result of varying demands of computing resources by the users on cloud, resource provisioning in cloud computing has come out as a prominent topic of research. Many researchers have focused exclusively on the technical and security aspects of cloud ...
Cloud service engineering
ICSE '10: Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 2Building on compute and storage virtualization, Cloud Computing provides scalable, network-centric, abstracted IT infrastructure, platforms, and applications as on-demand services that are billed by consumption. Cloud Service Engineering is the ...
Resource Monitoring and Prediction in Cloud Computing Environments
ACIT-CSI '15: Proceedings of the 2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and IntelligenceCloud computing provides elastic, scalable resource sharing service by resource management. Resource monitoring and prediction are the foundation to achieve resource automation, high performance management in cloud computing environment. This paper ...
Comments