research-article

Is Sharing Caring? Analyzing the Incentives for Shared Cloud Clusters

Authors:

David IrwinAuthors Info & Claims

ICPE '23: Proceedings of the 2023 ACM/SPEC International Conference on Performance Engineering

Pages 7 - 16

https://doi.org/10.1145/3578244.3583730

Published: 15 April 2023 Publication History

Abstract

Many organizations maintain and operate large shared computing clusters, since they can substantially reduce computing costs by leveraging statistical multiplexing to amortize it across all users. Importantly, such shared clusters are generally not free to use, but have an internal pricing model that funds their operation. Since employees at many large organizations, especially Universities, have some budgetary autonomy over purchase decisions, internal shared clusters are increasingly competing for users with cloud platforms, which may offer lower costs and better performance. As a result, many organizations are shifting their shared clusters to operate on cloud resources. This paper empirically analyzes the user incentives for shared cloud clusters under two different pricing models using an 8-year job trace from a large shared cluster for a large University system.

Our analysis shows that, with either pricing model, a large fraction of users have little financial incentive to participate in a shared cloud cluster compared to directly acquiring resources from a cloud platform. While shared cloud clusters can provide some limited reductions in cost by leveraging reserved instances at a discount, due to bursty workloads, realizing these reductions generally requires imposing long job waiting times, which for many users are likely not worth the cost reduction. In particular, we show that, assuming users defect from the shared cluster if their wait time is greater than 15x their average job runtime, over 80% of the users would defect, which increases the price of the remaining users such that it eliminates any incentive to participate in a shared cluster. Thus, while shared cloud clusters may provide users other benefits, their financial incentives are weak.

References

[1]

2022. Amazon EC2 Spot Instances. https://aws.amazon.com/ec2/spot/.

[2]

2022. AWS - Discounts on Reserving Resources. https://aws.amazon.com/ec2/pricing/reserved-instances/pricing/.

[3]

2022. AWS ParallelCluster Auto Scaling. https://docs.aws.amazon.com/parallelcluster/latest/ug/autoscaling.html.

[4]

2022. Azure Spot Virtual Machines. https://azure.microsoft.com/en-us/products/virtual-machines/spot/.

[5]

2022. Cloud Cost Optimizer. https://research.redhat.com/blog/research_project/cloud-cost-optimizer/.

[6]

2022. Cloud Growth in Future. https://www.globenewswire.com/news-release/2022/05/06/2437934/0/en/Cloud-Computing-Market-to-Grow-at-a-CAGR-of-11-until-2028-BlueWeave-Consulting.html.

[7]

2022. Curator. https://github.com/operate-first/curator/.

[8]

2022. Google Preemptible Virtual Machines. https://cloud.google.com/compute/docs/instances/preemptible.

[9]

2022. Kubernetes on AWS. https://aws.amazon.com/kubernetes/.

[10]

2022. On-Prem Computing. https://www.techslang.com/definition/what-is-on-premises/.

[11]

2022. On-Prem Computing, Expensive than Cloud. https://www.executech.com/insights/the-cloud-vs-on-premise-cost-comparison/.

[12]

2022. Privacy and Regulatory on On-Prem Computing. https://www.cleo.com/blog/knowledge-base-on-premise-vs-cloud.

[13]

2022. Rapid Growth of Cloud. https://www.capacitymedia.com/article/2afswwuvis94wy12r320w/news/google-cloud-growing-45-a-year-with-azure-at-40-says-canalys.

[14]

2023. Job Simulator. https://github.com/sustainablecomputinglab/waitinggame/tree/master/simulator.

[15]

2023. University of Massachusetts Green High Performance Computing Cluster. http://wiki.umassrc.org/wiki/index.php/MainPage.

[16]

Abdullah Alzaqebah, Rizik Al-Sayyed, and Raja Masadeh. 2019. Task Scheduling Based on Modified Grey Wolf Optimizer in Cloud Computing Environment. In 2019 2nd International Conference on new Trends in Computing Sciences (ICTCS). IEEE.

[17]

Pradeep Ambati, Noman Bashir, David Irwin, and Prashant Shenoy. 2020. Waiting Game: Optimally Provisioning Fixed Resources for Cloud-Enabled Schedulers. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC). IEEE.

[18]

Pradeep Ambati, Noman Bashir, David Irwin, and Prashant Shenoy. 2021. Good Things Come to Those Who Wait: Optimizing Job Waiting in the Cloud. In Proceedings of the ACM Symposium on Cloud Computing.

Digital Library

[19]

Tekin Bicer, David Chiu, and Gagan Agrawal. 2011. A Framework for Data-Intensive Computing with Cloud Bursting. In IEEE International Conference on Cluster Computing. IEEE.

Digital Library

[20]

Kavitha Chandra. 2003. Statistical Multiplexing. Wiley Encyclopedia of Telecommunications 5 (January 2003).

[21]

Tian Guo, Upendra Sharma, Prashant Shenoy, Timothy Wood, and Sambit Sahu. 2014. Cost-Aware Cloud Bursting for Enterprise Applications. ACM Transactions on Internet Technology (TOIT) (2014).

[22]

Tian Guo, Upendra Sharma, Timothy Wood, Sambit Sahu, and Prashant Shenoy. 2012. Seagull: Intelligent Cloud Bursting for Enterprise Applications. In USENIX Annual Technical Conference.

[23]

Yu-Ju Hong, Jiachen Xue, and Mithuna Thottethodi. 2011. Dynamic Server Provisioning to Minimize Cost in an IaaS Cloud. In Special Interest Group on Measurement and Evaluation (SIGMETRICS).

[24]

Menglan Hu, Jun Luo, and Bharadwaj Veeravalli. 2012. Optimal Provisioning for Scheduling Divisible Loads with Reserved Cloud Resources. In IEEE International Conference on Networks (ICON).

[25]

Sriram Kailasam, Nathan Gnanasambandam, Janakiram Dharanipragada, and Naveen Sharma. 2010. Optimizing Service Level Agreements for Autonomic Cloud Bursting Schedulers. In International Conference on Parallel Processing Workshops. IEEE.

[26]

Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, and George Amvrosiadis. 2019. This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Technical Report (2019).

[27]

Tania Lorido-Botran, Jose Miguel-Alonso, and Jose A Lozano. 2014. A Review of Auto-Scaling Techniques for Elastic Applications in Cloud Environments. Journal of Grid Computing (2014).

[28]

Marko Luksa. 2017. Kubernetes in Action. Simon and Schuster.

[29]

Michael Mattess, Christian Vecchiola, Saurabh Kumar Garg, and Rajkumar Buyya. 2011. Cloud Bursting: Managing Peak Loads by Leasing Public Cloud Services. In Cloud Computing: Methodology, Systems, and Applications. CRC Press.

[30]

Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, and Wenguang Chen. 2013. Cost-effective Cloud HPC Resource Provisioning by Building Semi- Elastic Virtual Clusters. In The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC).

Digital Library

[31]

Siqi Shen, Kefeng Deng, Alexandru Iosup, and Dick Epema. 2013. Scheduling Jobs in the Cloud using On-demand and Reserved Instances. In International European Conference on Parallel and Distributed Computing (Euro-Par).

Digital Library

[32]

Abraham Silberschatz, Peter B Galvin, and Greg Gagne. 2018. Operating System Concepts, 10e Abridged Print Companion. John Wiley & Sons.

[33]

Jose Luis Lucas Simarro, Rafael Moreno-Vozmediano, Ruben S Montero, and Ignacio Martín Llorente. 2011. Dynamic Placement of Virtual Machines for Cost Optimization in Multi-Cloud Environments. In 2011 International Conference on High Performance Computing & Simulation. IEEE.

[34]

Ruben Van den Bossche, Kurt Vanmechelen, and Jan Broeckhove. 2015. IaaS Reserved Contract Procurement Optimisation with Load Prediction. Future Generation Computer Systems (2015).

[35]

Wei Wang, Baochun Li, and Ben Liang. 2013. To Reserve or Not to Reserve: Optimal Online Multi-Instance Aquisition in IaaS Clouds. In International Conference on Autonomic Computing (ICAC).

[36]

Andy B Yoo, Morris A Jette, and Mark Grondona. 2003. Slurm: Simple linux Utility for Resource Management. In Job Scheduling Strategies for Parallel Processing. Springer.

Index Terms

Is Sharing Caring? Analyzing the Incentives for Shared Cloud Clusters
1. Computer systems organization
  1. Architectures
    1. Distributed architectures
      1. Cloud computing

Recommendations

Self-managed cost-efficient virtual elastic clusters on hybrid Cloud infrastructures

In this study, we describe the further development of Elastic Cloud Computing Cluster (EC3), a tool for creating self-managed cost-efficient virtual hybrid elastic clusters on top of Infrastructure as a Service (IaaS) clouds. By using spot instances and ...
Secret Sharing Scheme Suitable for Cloud Computing
AINA '13: Proceedings of the 2013 IEEE 27th International Conference on Advanced Information Networking and Applications

Secret sharing schemes have recently been considered to apply for cloud computing in which many users distribute multiple data to servers. However, when Shamir's (k, n) secret sharing is applied to cloud systems, the amount of share increases more than ...
An Analysis of Provisioning and Allocation Policies for Infrastructure-as-a-Service Clouds
CCGRID '12: Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)

Today, many commercial and private cloud computing providers offer resources for leasing under the infrastructure as a service (IaaS) paradigm. Although an abundance of mechanisms already facilitate the lease and use of single infrastructure resources, ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICPE '23: Proceedings of the 2023 ACM/SPEC International Conference on Performance Engineering

April 2023

244 pages

ISBN:9798400700682

DOI:10.1145/3578244

General Chairs:
Marco Vieira
University of Coimbra, Portugal
,
Valeria Cardellini
University of Rome Tor Vergata, Italy
,
Program Chairs:
Antinisca Di Marco
University of L'Aquila, Italy
,
Petr Tuma
Charles University, Czechia

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 April 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation
National Science Foundation
NSF

Conference

ICPE '23

Sponsor:

ICPE '23: ACM/SPEC International Conference on Performance Engineering

April 15 - 19, 2023

Coimbra, Portugal

Acceptance Rates

ICPE '23 Paper Acceptance Rate 15 of 46 submissions, 33%;

Overall Acceptance Rate 252 of 851 submissions, 30%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
138
Total Downloads

Downloads (Last 12 months)36
Downloads (Last 6 weeks)1

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents