skip to main content
10.1145/3578244.3583730acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
research-article

Is Sharing Caring? Analyzing the Incentives for Shared Cloud Clusters

Published: 15 April 2023 Publication History

Abstract

Many organizations maintain and operate large shared computing clusters, since they can substantially reduce computing costs by leveraging statistical multiplexing to amortize it across all users. Importantly, such shared clusters are generally not free to use, but have an internal pricing model that funds their operation. Since employees at many large organizations, especially Universities, have some budgetary autonomy over purchase decisions, internal shared clusters are increasingly competing for users with cloud platforms, which may offer lower costs and better performance. As a result, many organizations are shifting their shared clusters to operate on cloud resources. This paper empirically analyzes the user incentives for shared cloud clusters under two different pricing models using an 8-year job trace from a large shared cluster for a large University system.
Our analysis shows that, with either pricing model, a large fraction of users have little financial incentive to participate in a shared cloud cluster compared to directly acquiring resources from a cloud platform. While shared cloud clusters can provide some limited reductions in cost by leveraging reserved instances at a discount, due to bursty workloads, realizing these reductions generally requires imposing long job waiting times, which for many users are likely not worth the cost reduction. In particular, we show that, assuming users defect from the shared cluster if their wait time is greater than 15x their average job runtime, over 80% of the users would defect, which increases the price of the remaining users such that it eliminates any incentive to participate in a shared cluster. Thus, while shared cloud clusters may provide users other benefits, their financial incentives are weak.

References

[1]
2022. Amazon EC2 Spot Instances. https://aws.amazon.com/ec2/spot/.
[2]
2022. AWS - Discounts on Reserving Resources. https://aws.amazon.com/ec2/pricing/reserved-instances/pricing/.
[3]
2022. AWS ParallelCluster Auto Scaling. https://docs.aws.amazon.com/parallelcluster/latest/ug/autoscaling.html.
[4]
2022. Azure Spot Virtual Machines. https://azure.microsoft.com/en-us/products/virtual-machines/spot/.
[5]
2022. Cloud Cost Optimizer. https://research.redhat.com/blog/research_project/cloud-cost-optimizer/.
[6]
2022. Cloud Growth in Future. https://www.globenewswire.com/news-release/2022/05/06/2437934/0/en/Cloud-Computing-Market-to-Grow-at-a-CAGR-of-11-until-2028-BlueWeave-Consulting.html.
[7]
2022. Curator. https://github.com/operate-first/curator/.
[8]
2022. Google Preemptible Virtual Machines. https://cloud.google.com/compute/docs/instances/preemptible.
[9]
2022. Kubernetes on AWS. https://aws.amazon.com/kubernetes/.
[10]
2022. On-Prem Computing. https://www.techslang.com/definition/what-is-on-premises/.
[11]
2022. On-Prem Computing, Expensive than Cloud. https://www.executech.com/insights/the-cloud-vs-on-premise-cost-comparison/.
[12]
2022. Privacy and Regulatory on On-Prem Computing. https://www.cleo.com/blog/knowledge-base-on-premise-vs-cloud.
[13]
2022. Rapid Growth of Cloud. https://www.capacitymedia.com/article/2afswwuvis94wy12r320w/news/google-cloud-growing-45-a-year-with-azure-at-40-says-canalys.
[14]
2023. Job Simulator. https://github.com/sustainablecomputinglab/waitinggame/tree/master/simulator.
[15]
2023. University of Massachusetts Green High Performance Computing Cluster. http://wiki.umassrc.org/wiki/index.php/MainPage.
[16]
Abdullah Alzaqebah, Rizik Al-Sayyed, and Raja Masadeh. 2019. Task Scheduling Based on Modified Grey Wolf Optimizer in Cloud Computing Environment. In 2019 2nd International Conference on new Trends in Computing Sciences (ICTCS). IEEE.
[17]
Pradeep Ambati, Noman Bashir, David Irwin, and Prashant Shenoy. 2020. Waiting Game: Optimally Provisioning Fixed Resources for Cloud-Enabled Schedulers. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC). IEEE.
[18]
Pradeep Ambati, Noman Bashir, David Irwin, and Prashant Shenoy. 2021. Good Things Come to Those Who Wait: Optimizing Job Waiting in the Cloud. In Proceedings of the ACM Symposium on Cloud Computing.
[19]
Tekin Bicer, David Chiu, and Gagan Agrawal. 2011. A Framework for Data-Intensive Computing with Cloud Bursting. In IEEE International Conference on Cluster Computing. IEEE.
[20]
Kavitha Chandra. 2003. Statistical Multiplexing. Wiley Encyclopedia of Telecommunications 5 (January 2003).
[21]
Tian Guo, Upendra Sharma, Prashant Shenoy, Timothy Wood, and Sambit Sahu. 2014. Cost-Aware Cloud Bursting for Enterprise Applications. ACM Transactions on Internet Technology (TOIT) (2014).
[22]
Tian Guo, Upendra Sharma, Timothy Wood, Sambit Sahu, and Prashant Shenoy. 2012. Seagull: Intelligent Cloud Bursting for Enterprise Applications. In USENIX Annual Technical Conference.
[23]
Yu-Ju Hong, Jiachen Xue, and Mithuna Thottethodi. 2011. Dynamic Server Provisioning to Minimize Cost in an IaaS Cloud. In Special Interest Group on Measurement and Evaluation (SIGMETRICS).
[24]
Menglan Hu, Jun Luo, and Bharadwaj Veeravalli. 2012. Optimal Provisioning for Scheduling Divisible Loads with Reserved Cloud Resources. In IEEE International Conference on Networks (ICON).
[25]
Sriram Kailasam, Nathan Gnanasambandam, Janakiram Dharanipragada, and Naveen Sharma. 2010. Optimizing Service Level Agreements for Autonomic Cloud Bursting Schedulers. In International Conference on Parallel Processing Workshops. IEEE.
[26]
Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, and George Amvrosiadis. 2019. This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Technical Report (2019).
[27]
Tania Lorido-Botran, Jose Miguel-Alonso, and Jose A Lozano. 2014. A Review of Auto-Scaling Techniques for Elastic Applications in Cloud Environments. Journal of Grid Computing (2014).
[28]
Marko Luksa. 2017. Kubernetes in Action. Simon and Schuster.
[29]
Michael Mattess, Christian Vecchiola, Saurabh Kumar Garg, and Rajkumar Buyya. 2011. Cloud Bursting: Managing Peak Loads by Leasing Public Cloud Services. In Cloud Computing: Methodology, Systems, and Applications. CRC Press.
[30]
Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, and Wenguang Chen. 2013. Cost-effective Cloud HPC Resource Provisioning by Building Semi- Elastic Virtual Clusters. In The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC).
[31]
Siqi Shen, Kefeng Deng, Alexandru Iosup, and Dick Epema. 2013. Scheduling Jobs in the Cloud using On-demand and Reserved Instances. In International European Conference on Parallel and Distributed Computing (Euro-Par).
[32]
Abraham Silberschatz, Peter B Galvin, and Greg Gagne. 2018. Operating System Concepts, 10e Abridged Print Companion. John Wiley & Sons.
[33]
Jose Luis Lucas Simarro, Rafael Moreno-Vozmediano, Ruben S Montero, and Ignacio Martín Llorente. 2011. Dynamic Placement of Virtual Machines for Cost Optimization in Multi-Cloud Environments. In 2011 International Conference on High Performance Computing & Simulation. IEEE.
[34]
Ruben Van den Bossche, Kurt Vanmechelen, and Jan Broeckhove. 2015. IaaS Reserved Contract Procurement Optimisation with Load Prediction. Future Generation Computer Systems (2015).
[35]
Wei Wang, Baochun Li, and Ben Liang. 2013. To Reserve or Not to Reserve: Optimal Online Multi-Instance Aquisition in IaaS Clouds. In International Conference on Autonomic Computing (ICAC).
[36]
Andy B Yoo, Morris A Jette, and Mark Grondona. 2003. Slurm: Simple linux Utility for Resource Management. In Job Scheduling Strategies for Parallel Processing. Springer.

Index Terms

  1. Is Sharing Caring? Analyzing the Incentives for Shared Cloud Clusters

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICPE '23: Proceedings of the 2023 ACM/SPEC International Conference on Performance Engineering
    April 2023
    244 pages
    ISBN:9798400700682
    DOI:10.1145/3578244
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 April 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cloud computing
    2. provisioning policies
    3. trace analysis

    Qualifiers

    • Research-article

    Funding Sources

    • National Science Foundation
    • National Science Foundation
    • NSF

    Conference

    ICPE '23

    Acceptance Rates

    ICPE '23 Paper Acceptance Rate 15 of 46 submissions, 33%;
    Overall Acceptance Rate 252 of 851 submissions, 30%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 138
      Total Downloads
    • Downloads (Last 12 months)36
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 20 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media