skip to main content
10.1145/3578244.3583730acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
research-article

Is Sharing Caring? Analyzing the Incentives for Shared Cloud Clusters

Published:15 April 2023Publication History

ABSTRACT

Many organizations maintain and operate large shared computing clusters, since they can substantially reduce computing costs by leveraging statistical multiplexing to amortize it across all users. Importantly, such shared clusters are generally not free to use, but have an internal pricing model that funds their operation. Since employees at many large organizations, especially Universities, have some budgetary autonomy over purchase decisions, internal shared clusters are increasingly competing for users with cloud platforms, which may offer lower costs and better performance. As a result, many organizations are shifting their shared clusters to operate on cloud resources. This paper empirically analyzes the user incentives for shared cloud clusters under two different pricing models using an 8-year job trace from a large shared cluster for a large University system.

Our analysis shows that, with either pricing model, a large fraction of users have little financial incentive to participate in a shared cloud cluster compared to directly acquiring resources from a cloud platform. While shared cloud clusters can provide some limited reductions in cost by leveraging reserved instances at a discount, due to bursty workloads, realizing these reductions generally requires imposing long job waiting times, which for many users are likely not worth the cost reduction. In particular, we show that, assuming users defect from the shared cluster if their wait time is greater than 15x their average job runtime, over 80% of the users would defect, which increases the price of the remaining users such that it eliminates any incentive to participate in a shared cluster. Thus, while shared cloud clusters may provide users other benefits, their financial incentives are weak.

References

  1. 2022. Amazon EC2 Spot Instances. https://aws.amazon.com/ec2/spot/.Google ScholarGoogle Scholar
  2. 2022. AWS - Discounts on Reserving Resources. https://aws.amazon.com/ec2/pricing/reserved-instances/pricing/.Google ScholarGoogle Scholar
  3. 2022. AWS ParallelCluster Auto Scaling. https://docs.aws.amazon.com/parallelcluster/latest/ug/autoscaling.html.Google ScholarGoogle Scholar
  4. 2022. Azure Spot Virtual Machines. https://azure.microsoft.com/en-us/products/virtual-machines/spot/.Google ScholarGoogle Scholar
  5. 2022. Cloud Cost Optimizer. https://research.redhat.com/blog/research_project/cloud-cost-optimizer/.Google ScholarGoogle Scholar
  6. 2022. Cloud Growth in Future. https://www.globenewswire.com/news-release/2022/05/06/2437934/0/en/Cloud-Computing-Market-to-Grow-at-a-CAGR-of-11-until-2028-BlueWeave-Consulting.html.Google ScholarGoogle Scholar
  7. 2022. Curator. https://github.com/operate-first/curator/.Google ScholarGoogle Scholar
  8. 2022. Google Preemptible Virtual Machines. https://cloud.google.com/compute/docs/instances/preemptible.Google ScholarGoogle Scholar
  9. 2022. Kubernetes on AWS. https://aws.amazon.com/kubernetes/.Google ScholarGoogle Scholar
  10. 2022. On-Prem Computing. https://www.techslang.com/definition/what-is-on-premises/.Google ScholarGoogle Scholar
  11. 2022. On-Prem Computing, Expensive than Cloud. https://www.executech.com/insights/the-cloud-vs-on-premise-cost-comparison/.Google ScholarGoogle Scholar
  12. 2022. Privacy and Regulatory on On-Prem Computing. https://www.cleo.com/blog/knowledge-base-on-premise-vs-cloud.Google ScholarGoogle Scholar
  13. 2022. Rapid Growth of Cloud. https://www.capacitymedia.com/article/2afswwuvis94wy12r320w/news/google-cloud-growing-45-a-year-with-azure-at-40-says-canalys.Google ScholarGoogle Scholar
  14. 2023. Job Simulator. https://github.com/sustainablecomputinglab/waitinggame/tree/master/simulator.Google ScholarGoogle Scholar
  15. 2023. University of Massachusetts Green High Performance Computing Cluster. http://wiki.umassrc.org/wiki/index.php/MainPage.Google ScholarGoogle Scholar
  16. Abdullah Alzaqebah, Rizik Al-Sayyed, and Raja Masadeh. 2019. Task Scheduling Based on Modified Grey Wolf Optimizer in Cloud Computing Environment. In 2019 2nd International Conference on new Trends in Computing Sciences (ICTCS). IEEE.Google ScholarGoogle Scholar
  17. Pradeep Ambati, Noman Bashir, David Irwin, and Prashant Shenoy. 2020. Waiting Game: Optimally Provisioning Fixed Resources for Cloud-Enabled Schedulers. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC). IEEE.Google ScholarGoogle Scholar
  18. Pradeep Ambati, Noman Bashir, David Irwin, and Prashant Shenoy. 2021. Good Things Come to Those Who Wait: Optimizing Job Waiting in the Cloud. In Proceedings of the ACM Symposium on Cloud Computing.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Tekin Bicer, David Chiu, and Gagan Agrawal. 2011. A Framework for Data-Intensive Computing with Cloud Bursting. In IEEE International Conference on Cluster Computing. IEEE.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Kavitha Chandra. 2003. Statistical Multiplexing. Wiley Encyclopedia of Telecommunications 5 (January 2003).Google ScholarGoogle ScholarCross RefCross Ref
  21. Tian Guo, Upendra Sharma, Prashant Shenoy, Timothy Wood, and Sambit Sahu. 2014. Cost-Aware Cloud Bursting for Enterprise Applications. ACM Transactions on Internet Technology (TOIT) (2014).Google ScholarGoogle Scholar
  22. Tian Guo, Upendra Sharma, Timothy Wood, Sambit Sahu, and Prashant Shenoy. 2012. Seagull: Intelligent Cloud Bursting for Enterprise Applications. In USENIX Annual Technical Conference.Google ScholarGoogle Scholar
  23. Yu-Ju Hong, Jiachen Xue, and Mithuna Thottethodi. 2011. Dynamic Server Provisioning to Minimize Cost in an IaaS Cloud. In Special Interest Group on Measurement and Evaluation (SIGMETRICS).Google ScholarGoogle Scholar
  24. Menglan Hu, Jun Luo, and Bharadwaj Veeravalli. 2012. Optimal Provisioning for Scheduling Divisible Loads with Reserved Cloud Resources. In IEEE International Conference on Networks (ICON).Google ScholarGoogle Scholar
  25. Sriram Kailasam, Nathan Gnanasambandam, Janakiram Dharanipragada, and Naveen Sharma. 2010. Optimizing Service Level Agreements for Autonomic Cloud Bursting Schedulers. In International Conference on Parallel Processing Workshops. IEEE.Google ScholarGoogle Scholar
  26. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, and George Amvrosiadis. 2019. This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Technical Report (2019).Google ScholarGoogle Scholar
  27. Tania Lorido-Botran, Jose Miguel-Alonso, and Jose A Lozano. 2014. A Review of Auto-Scaling Techniques for Elastic Applications in Cloud Environments. Journal of Grid Computing (2014).Google ScholarGoogle Scholar
  28. Marko Luksa. 2017. Kubernetes in Action. Simon and Schuster.Google ScholarGoogle Scholar
  29. Michael Mattess, Christian Vecchiola, Saurabh Kumar Garg, and Rajkumar Buyya. 2011. Cloud Bursting: Managing Peak Loads by Leasing Public Cloud Services. In Cloud Computing: Methodology, Systems, and Applications. CRC Press.Google ScholarGoogle Scholar
  30. Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, and Wenguang Chen. 2013. Cost-effective Cloud HPC Resource Provisioning by Building Semi- Elastic Virtual Clusters. In The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC).Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Siqi Shen, Kefeng Deng, Alexandru Iosup, and Dick Epema. 2013. Scheduling Jobs in the Cloud using On-demand and Reserved Instances. In International European Conference on Parallel and Distributed Computing (Euro-Par).Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Abraham Silberschatz, Peter B Galvin, and Greg Gagne. 2018. Operating System Concepts, 10e Abridged Print Companion. John Wiley & Sons.Google ScholarGoogle Scholar
  33. Jose Luis Lucas Simarro, Rafael Moreno-Vozmediano, Ruben S Montero, and Ignacio Martín Llorente. 2011. Dynamic Placement of Virtual Machines for Cost Optimization in Multi-Cloud Environments. In 2011 International Conference on High Performance Computing & Simulation. IEEE.Google ScholarGoogle Scholar
  34. Ruben Van den Bossche, Kurt Vanmechelen, and Jan Broeckhove. 2015. IaaS Reserved Contract Procurement Optimisation with Load Prediction. Future Generation Computer Systems (2015).Google ScholarGoogle Scholar
  35. Wei Wang, Baochun Li, and Ben Liang. 2013. To Reserve or Not to Reserve: Optimal Online Multi-Instance Aquisition in IaaS Clouds. In International Conference on Autonomic Computing (ICAC).Google ScholarGoogle Scholar
  36. Andy B Yoo, Morris A Jette, and Mark Grondona. 2003. Slurm: Simple linux Utility for Resource Management. In Job Scheduling Strategies for Parallel Processing. Springer.Google ScholarGoogle Scholar

Index Terms

  1. Is Sharing Caring? Analyzing the Incentives for Shared Cloud Clusters

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICPE '23: Proceedings of the 2023 ACM/SPEC International Conference on Performance Engineering
      April 2023
      244 pages
      ISBN:9798400700682
      DOI:10.1145/3578244

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 April 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      ICPE '23 Paper Acceptance Rate15of46submissions,33%Overall Acceptance Rate252of851submissions,30%

      Upcoming Conference

    • Article Metrics

      • Downloads (Last 12 months)125
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader