research-article

Effective low capacity status prediction for cloud systems

Authors:
Hang Dong

Microsoft Research, China

Microsoft Research, China

0000-0001-6439-8183
View Profile

,
Si Qin

Microsoft Research, China

Microsoft Research, China
View Profile

,
Yong Xu

Microsoft Research, China

Microsoft Research, China
View Profile

,
Bo Qiao

Microsoft Research, China

Microsoft Research, China

0000-0002-8997-8317
View Profile

,
Shandan Zhou

Microsoft Azure, USA

Microsoft Azure, USA
View Profile

,
Xian Yang

Hong Kong Baptist University, China

Hong Kong Baptist University, China

0000-0002-1496-8923
View Profile

,
Chuan Luo

Microsoft Research, China

Microsoft Research, China

0000-0001-5028-1064
View Profile

,
Pu Zhao

Microsoft Research, China

Microsoft Research, China

0000-0002-4518-323X
View Profile

,
Qingwei Lin

Microsoft Research, China

Microsoft Research, China

0000-0003-2559-2383
View Profile

,
Hongyu Zhang

University of Newcastle, Australia

University of Newcastle, Australia

0000-0002-3063-9425
View Profile

,
Abulikemu Abuduweili

Microsoft Research, China

Microsoft Research, China

0000-0002-3186-1976
View Profile

,
Sanjay Ramanujan

Microsoft Azure, USA

Microsoft Azure, USA
View Profile

,
Karthikeyan Subramanian

Microsoft Azure, USA

Microsoft Azure, USA
View Profile

,
Andrew Zhou

Microsoft 365, China

Microsoft 365, China

0000-0002-6157-8936
View Profile

,
Saravanakumar Rajmohan

Microsoft 365, USA

Microsoft 365, USA
View Profile

,
Dongmei Zhang

Microsoft Research, China

Microsoft Research, China

0000-0002-9230-2799
View Profile

,
Thomas Moscibroda

Microsoft Azure, USA

Microsoft Azure, USA
View Profile

ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software EngineeringAugust 2021Pages 1236–1241https://doi.org/10.1145/3468264.3473917

Published:18 August 2021Publication History

ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering

Pages 1236–1241

ABSTRACT

In cloud systems, an accurate capacity planning is very important for cloud provider to improve service availability. Traditional methods simply predicting "when the available resources is exhausted" are not effective due to customer demand fragmentation and platform allocation constraints. In this paper, we propose a novel prediction approach which proactively predicts the level of resource allocation failures from the perspective of low capacity status. By jointly considering the data from different sources in both time series form and static form, the proposed approach can make accurate LCS predictions in a complex and dynamic cloud environment, and thereby improve the service availability of cloud systems. The proposed approach is evaluated by real-world datasets collected from a large scale public cloud platform, and the results confirm its effectiveness.

References

Ayman Amin, Lars Grunske, and Alan Colman. 2013. An approach to software reliability prediction based on time series modeling. Journal of Systems and Software, 86, 7 (2013), 1923–1932.Google ScholarDigital Library
Mathias Björkqvist, Sebastiano Spicuglia, Lydia Y. Chen, and Walter Binder. 2013. QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis. In Proceedings of ICSOC 2013. 69–83.Google ScholarDigital Library
Rodrigo N Calheiros, Enayat Masoumi, Rajiv Ranjan, and Rajkumar Buyya. 2015. Workload prediction using ARIMA model and its impact on cloud applications’ QoS. IEEE Transactions on Cloud Computing, 3, 4 (2015), 449–458.Google ScholarDigital Library
Martin Duggan, Karl Mason, Jim Duggan, Enda Howley, and Enda Barrett. 2017. Predicting host CPU utilization in cloud computing using recurrent neural networks. In Proceedings of ICITST 2017. 67–72.Google ScholarCross Ref
Christos Faloutsos, Jan Gasthaus, Tim Januschowski, and Yuyang Wang. 2018. Forecasting big time series: Old and new. Proceedings of VLDB 2018, 11, 12 (2018), 2102–2105.Google ScholarDigital Library
Anshul Gandhi, Parijat Dube, Alexei A. Karve, Andrzej Kochut, and Li Zhang. 2020. Providing Performance Guarantees for Cloud-Deployed Applications. IEEE Transactions on Cloud Computing, 8, 1 (2020), 269–281.Google ScholarCross Ref
Yongqiang Gao, Haibing Guan, Zhengwei Qi, Yang Hou, and Liang Liu. 2013. A multi-objective ant colony system algorithm for virtual machine placement in cloud computing. J. Comput. System Sci., 79, 8 (2013), 1230–1242.Google ScholarDigital Library
Albert Greenberg, James Hamilton, David A Maltz, and Parveen Patel. 2008. The cost of a cloud: Research problems in data center networks. ACM SIGCOMM computer communication review, 39, 1 (2008), 68–73.Google ScholarDigital Library
Rohit Gupta, Sumit Kumar Bose, Srikanth Sundarrajan, Manogna Chebiyam, and Anirban Chakrabarti. 2008. A two stage heuristic algorithm for solving the server consolidation problem with item-item and bin-item incompatibility constraints. In Proceedings of SCC 2008. 2, 39–46.Google ScholarDigital Library
Fang Hao, Murali Kodialam, TV Lakshman, and Sarit Mukherjee. 2016. Online allocation of virtual machines in a distributed cloud. IEEE/ACM Transactions on Networking, 25, 1 (2016), 238–249.Google ScholarDigital Library
Sen He, Glenna Manns, John Saunders, Wei Wang, Lori L. Pollock, and Mary Lou Soffa. 2019. A statistics-based performance testing methodology for cloud applications. In Proceedings of ESEC/SIGSOFT FSE 2019. 188–199.Google ScholarDigital Library
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation, 9, 8 (1997), 1735–1780.Google ScholarDigital Library
Pooyan Jamshidi, Aakash Ahmad, and Claus Pahl. 2014. Autonomic resource provisioning for cloud-based software. In Proceedings of SEAMS 2014. 95–104.Google ScholarDigital Library
Brendan Jennings and Rolf Stadler. 2015. Resource management in clouds: Survey and research challenges. Journal of Network and Systems Management, 23, 3 (2015), 567–619.Google ScholarDigital Library
Yexi Jiang, Chang-Shing Perng, Tao Li, and Rong Chang. 2012. Intelligent cloud capacity management. In Proceedings of NOMS 2012. 502–505.Google Scholar
Yexi Jiang, Chang-shing Perng, Tao Li, and Rong Chang. 2012. Self-adaptive cloud capacity planning. In Proceedings of SCC 2012. 73–80.Google ScholarDigital Library
Yexi Jiang, Chang-Shing Perng, Tao Li, and Rong N Chang. 2013. Cloud analytics for capacity planning and instant vm provisioning. IEEE Transactions on Network and Service Management, 10, 3 (2013), 312–325.Google ScholarCross Ref
Cristian Klein, Martina Maggio, Karl-Erik Årzén, and Francisco Hernández-Rodriguez. 2014. Brownout: Building more robust cloud applications. In Proceedings of ICSE 2014. 700–711.Google ScholarDigital Library
Philipp Leitner, Branimir Wetzstein, Florian Rosenberg, Anton Michlmayr, Schahram Dustdar, and Frank Leymann. 2009. Runtime Prediction of Service Level Agreement Violations for Composite Services. In Proceedings of ICSOC/ServiceWave Workshops 2009. 176–186.Google Scholar
Alexander Lenk, Markus Klems, Jens Nimis, Stefan Tai, and Thomas Sandholm. 2009. What’s inside the Cloud? An architectural map of the Cloud landscape. In Proceedings of ICSE workshop on software engineering challenges of cloud computing 2009. 23–31.Google ScholarDigital Library
Roger J Lewis. 2000. An introduction to classification and regression tree (CART) analysis. In Proceedings of SAEM 2000.Google Scholar
Qingwei Lin, Ken Hsieh, Yingnong Dang, Hongyu Zhang, Kaixin Sui, Yong Xu, Jian-Guang Lou, Chenggang Li, Youjiang Wu, Randolph Yao, Murali Chintalapati, and Dongmei Zhang. 2018. Predicting Node failure in cloud service systems. In Proceedings of ESEC/SIGSOFT FSE 2018. 480–490.Google ScholarDigital Library
Chunhong Liu, Chuanchang Liu, Yanlei Shang, Shiping Chen, Bo Cheng, and Junliang Chen. 2017. An adaptive prediction approach based on workload pattern discrimination in the cloud. Journal of Network and Computer Applications, 80 (2017), 35–44.Google ScholarDigital Library
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research, 9 (2008), 2579–2605.Google Scholar
Sunilkumar S. Manvi and Gopal Krishna Shyam. 2014. Resource management for Infrastructure as a Service (IaaS) in cloud computing: A survey. Journal of Network and Computer Applications, 41 (2014), 424–440.Google ScholarCross Ref
Xiaoqiao Meng, Vasileios Pappas, and Li Zhang. 2010. Improving the scalability of data center networks with traffic-aware virtual machine placement. In Proceedings of INFOCOM 2010. 1–9.Google ScholarCross Ref
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of NeurIPS 2013. 3111–3119.Google Scholar
Hidemoto Nakada, Takahiro Hirofuchi, Hirotaka Ogawa, and Satoshi Itoh. 2009. Toward virtual machine packing optimization based on genetic algorithm. In Proceedings of IWANN 2009. 651–654.Google ScholarDigital Library
Jennifer Ortiz, Brendan Lee, Magdalena Balazinska, Johannes Gehrke, and Joseph L Hellerstein. 2018. SLAOrchestrator: Reducing the cost of performance SLAs for cloud data analytics. In Proceedings of ATC 2018. 547–560.Google Scholar
Venkateshwar Rao and Sarika Rao. 2012. Application of artificial neural networks in capacity planning of cloud based IT infrastructure. In Proceedings of CCEM 2012. 1–4.Google ScholarCross Ref
Christoph Rathfelder, Samuel Kounev, and David Evans. 2011. Capacity planning for event-based systems using automated performance predictions. In Proceedings of ASE 2011. 352–361.Google ScholarDigital Library
Nilabja Roy, Abhishek Dubey, and Aniruddha Gokhale. 2011. Efficient autoscaling in the cloud using predictive models for workload forecasting. In Proceedings of CLOUD 2011. 500–507.Google ScholarDigital Library
Sukhpal Singh, Inderveer Chana, and Rajkumar Buyya. 2017. STAR: SLA-aware autonomic management of cloud resources. IEEE Transactions on Cloud Computing.Google ScholarCross Ref
Weijia Song, Zhen Xiao, Qi Chen, and Haipeng Luo. 2013. Adaptive resource provisioning for the cloud using online bin packing. IEEE Trans. Comput., 63, 11 (2013), 2647–2660.Google ScholarDigital Library
Sean J Taylor and Benjamin Letham. 2018. Forecasting at scale. The American Statistician, 72, 1 (2018), 37–45.Google ScholarCross Ref
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Ł ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of NeurIPS 2017. 5998–6008.Google Scholar
Jingjing Wang and Magdalena Balazinska. 2017. Elastic memory management for cloud data analytics. In Proceedings of ATC 2017. 745–758.Google Scholar
Rafael Weingärtner, Gabriel Beims Bräscher, and Carlos Becker Westphall. 2015. Cloud resource management: A survey on forecasting and profiling models. Journal of Network and Computer Applications, 47 (2015), 99–106.Google ScholarDigital Library
Ledell Yu Wu, Adam Fisch, Sumit Chopra, Keith Adams, Antoine Bordes, and Jason Weston. 2018. StarSpace: Embed All The Things!. In Proceedings of AAAI 2018. 5569–5577.Google ScholarCross Ref
Bin Xia, Tao Li, Qi-Feng Zhou, Qianmu Li, and Hong Zhang. 2018. An Effective Classification-based Framework for Predicting Cloud Capacity Demand in Cloud Services. IEEE Transactions on Services Computing.Google Scholar
Qi Zhang, Mohamed Faten Zhani, Shuo Zhang, Quanyan Zhu, Raouf Boutaba, and Joseph L Hellerstein. 2012. Dynamic energy-aware capacity provisioning for cloud computing environments. In Proceedings of ICAC 2012. 145–154.Google ScholarDigital Library
Liming Zhu, Yan Liu, Ngoc Bao Bui, and Ian Gorton. 2007. Revel8or: Model Driven Capacity Planning Tool Suite. In Proceedings of ICSE 2007. 797–800.Google ScholarDigital Library
Yi Zhu, Yan Liang, Qiong Zhang, Xi Wang, Paparao Palacharla, and Motoyoshi Sekiya. 2014. Reliable resource allocation for optically interconnected distributed clouds. In Proceedings of ICC 2014. 3301–3306.Google ScholarCross Ref

Index Terms

Effective low capacity status prediction for cloud systems
1. Computer systems organization
  1. Architectures
    1. Distributed architectures
      1. Cloud computing

Recommendations

Function points-based resource prediction in cloud computing

As a result of varying demands of computing resources by the users on cloud, resource provisioning in cloud computing has come out as a prominent topic of research. Many researchers have focused exclusively on the technical and security aspects of cloud ...
Read More
Cloud service engineering
ICSE '10: Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 2

Building on compute and storage virtualization, Cloud Computing provides scalable, network-centric, abstracted IT infrastructure, platforms, and applications as on-demand services that are billed by consumption. Cloud Service Engineering is the ...
Read More
Resource Monitoring and Prediction in Cloud Computing Environments
ACIT-CSI '15: Proceedings of the 2015 3rd International Conference on Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence

Cloud computing provides elastic, scalable resource sharing service by resource management. Resource monitoring and prediction are the foundation to achieve resource automation, high performance management in cloud computing environment. This paper ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering
August 2021
1690 pages
ISBN:9781450385626
DOI:10.1145/3468264
General Chairs:
Diomidis Spinellis
Athens University of Economics and Business, Greece
,
Georgios Gousios
Facebook, Netherlands / Delft University of Technology, Netherlands
,
Program Chairs:
Marsha Chechik
University of Toronto, Canada
,
Massimiliano Di Penta
University of Sannio, Italy
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 August 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
capacity prediction
cloud computing
feature embedding
software reliability
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate112of543submissions,21%
Upcoming Conference
FSE '24

Sponsor:

sigsoft

32nd ACM International Conference on the Foundations of Software Engineering

July 15 - 19, 2024

Ipojuca (Pernambuco) , Brazil
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 247
  Total Downloads
- Downloads (Last 12 months)43
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Effective low capacity status prediction for cloud systems

ESEC/FSE 2021: Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Function points-based resource prediction in cloud computing

Cloud service engineering

Resource Monitoring and Prediction in Cloud Computing Environments