skip to main content
10.1145/2541940.2541966acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
research-article

Underprovisioning backup power infrastructure for datacenters

Published: 24 February 2014 Publication History

Abstract

While there has been prior work to underprovision the power distribution infrastructure for a datacenter to save costs, the ability to underprovision the backup power infrastructure, which contributes significantly to capital costs, is little explored. There are two main components in the backup infrastructure - Diesel Generators (DGs) and UPS units - which can both be underprovisioned (or even removed) in terms of their power and/or energy capacities. However, embarking on such underprovisioning mandates studying several ramifications - the resulting cost savings, the lower availability, and the performance and state loss consequences on individual applications - concurrently. This paper presents the first such study, considering cost, availability, performance and application consequences of underprovisioning the backup power infrastructure. We present a framework to quantify the cost of backup capacity that is provisioned, and implement techniques leveraging existing software and hardware mechanisms to provide as seamless an operation as possible for an application within the provisioned backup capacity during a power outage. We evaluate the cost-performance-availability trade-offs for different levels of backup underprovisioning for applications with diverse reliance on the backup infrastructure. Our results show that one may be able to completely do away with DGs, compensating for it with additional UPS energy capacities, to significantly cut costs and still be able to handle power outages lasting as high as 40 minutes (which constitute bulk of the outages). Further, we can push the limits of outage duration that can be handled in a cost-effective manner, if applications are willing to tolerate degraded performance during the outage. Our evaluations also show that different applications react differently to the outage handling mechanisms, and that the efficacy of the mechanisms is sensitive to the outage duration. The insights from this paper can spur new opportunities for future work on backup power infrastructure optimization.

References

[1]
Y. Agarwal, S. Hodges, R. Chandra, J. Scott, P. Bahl, and R. Gupta. Somniloquy: augmenting network interfaces to reduce pc energy usage. In Proceedings of the 6th USENIX symposium on Networked systems design and implementation (NSDI), 2009.
[2]
AGIGARAM: DDR3 Non-Volatile DIMM. http://www.agigatech.com/pdf/pdf_ProductBrief_DDR3_ 12-0820.pdf.
[3]
M. K. Aguilera. Tutorial on geo-replication in data center applications. In Proceedings of the ACM SIGMETRICS/ international conference on Measurement and modeling of computer systems (SIGMETRICS), 2013.
[4]
B. Aksanli, J. Venkatesh, L. Zhang, and T. Rosing. Utilizing green energy prediction to schedule mixed batch and service jobs in data centers. In Proceedings of the 4th Workshop on Power-Aware Computing and Systems (HotPower), 2011.
[5]
H. Amur, R. Nathuji, M. Ghosh, K. Schwan, and H.-H. S. Lee. Idlepower: Application-Aware Management of Processor Idle States. In Proceedings of the Workshop on Managed Many- Core Systems (MMCS, in conjunction with HPDC), 2008.
[6]
V. Anagnostopoulou, S. Biswas, H. Saadeldeen, A. Savage, R. Bianchini, T. Yang, D. Franklin, and F. T. Chong. Barely alive memory servers: Keeping data active in a low-power state. Journal on Emerging Technologies in Computing Systems, 8(4), 2012.
[7]
APC InfraStruxure Total Cost of Ownership, 2013. http://www.apc.com/tools/isx/tco/index.cfm.
[8]
APC White paper: Comparing UPS System Design Configurations, 2008.
[9]
A. Bar-Noy, M. P. Johnson, and O. Liu. Peak Shaving Through Resource Buffering. In Workshop On Approximation and Online Algorithms (WAOA), 2008.
[10]
L. A. Barroso and U. Holzle. The Case for Energy- Proportional Computing. Computer, 40(12), 2007.
[11]
L. A. Barroso and U. Holzle. The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan and Claypool Publishers, 2009.
[12]
Benefits OF Using Mobile Transformers and Mobile Substations for Rapidly Restoring Electrical Service, 2006. http://energy.gov/sites/prod/files/oeprod/DocumentsandMedia/MTS_Report_to_Congress_FINAL_73106.pdf.
[13]
O. Bilgir, M. Martonosi, and Q. Wu. Exploring the Potential of CMP Core Count Management on Data Center Energy Savings. In Workshop on Energy Efficient Design, 2011.
[14]
P. Bohrer, D. Cohn, E. Elnozahy, T. Keller, M. Kistler, C. Lefurgy, R. Rajamony, F. Rawson, and E. V. Hensbergen. Energy Conservation for Servers. InWorkshop on Power Management for Real-Time and Embedded Systems, 2001.
[15]
T. C. Bressoud and F. B. Schneider. Hypervisor-based fault tolerance. In Proceedings of the fifteenth ACM symposium on Operating systems principles (SOSP), 1995.
[16]
J. Chase, D. Anderson, P. Thakur, and A. Vahdat. Managing Energy and Server Resources in Hosting Centers. In Proceedings of SOSP, 2001.
[17]
G. Chen, W. He, J. Liu, S. Nath, L. Rigas, L. Xiao, and F. Zhao. Energy-aware Server Provisioning and Load Dispatching for Connection-intensive Internet Services. In Proceedings of NSDI, 2008.
[18]
C. Clark, K. Fraser, S. Hand, J. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live Migration of Virtual Machines. In Proceedings of the Symposium on Networked Systems Design and Implementation (NSDI), 2005.
[19]
B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson, and A. Warfield. Remus: high availability via asynchronous virtual machine replication. In Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2008.
[20]
Q. Deng, D. Meisner, A. Bhattacharjee, T. F. Wenisch, and R. Bianchini. Coscale: Coordinating cpu and memory system dvfs in server systems. In Proceedings of the IEEE/ACM International Symposium on Microarchitecture, 2012.
[21]
Facebook Rack-level UPS for improved efficiency. Facebook Rack-level UPS for Improved Efficiency. http://www.datacenterknowledge.com/archives/2011/04/07/.
[22]
X. Fan, W.-D. Weber, and L. A. Barroso. Power provisioning for a warehouse-sized computer. In Proceedings of the International Symposium on Computer Architecture (ISCA), 2007.
[23]
X. Fu, X. Wang, and C. Lefurgy. How Much Power Oversubscription Is Safe and Allowed in Data Centers. In Proceedings of the ACM International Conference on Autonomic Computing (ICAC), 2011.
[24]
A. Gandhi, M. Harchol-Balter, R. Das, and C. Lefurgy. Optimal Power Allocation in Server Farms. In Proceedings of SIGMETRICS, 2009.
[25]
Google Revenue in 2011. http://investor.google.com/earnings/2011/index.html.
[26]
S. Govindan, J. Choi, B. Urgaonkar, A. Sivasubramaniam, and A.Baldini. Statistical Profiling-based Techniques for Effective Power Provisioning in Data Centers. In Proceedings of the ACM European Conference on Computer Systems (EuroSys), 2009.
[27]
S. Govindan, A. Sivasubramaniam, and B. Urgaonkar. Benefits and Limitations of Tapping into Stored Energy For Datacenters. In Proceedings of the International Symposium of Computer Architecture (ISCA), 2011.
[28]
S. Govindan, D. Wang, L. Y. Chen, A. Sivasubramaniam, and B.Urgaonkar. Towards Realizing a Low Cost and Highly Available Datacenter Power Infrastructure. In Workshop on HotPower, 2011.
[29]
S. Govindan, D. Wang, A. Sivasubramaniam, and B. Urgaonkar. Leveraging stored energy for handling power emergencies in aggressively provisioned datacenters. In Proceedings of the international conference on Architectural Support for Programming Languages and Operating Systems, 2012.
[30]
J. Hamilton. Internet-scale Service Infrastructure Efficiency, ISCA Keynote, 2009.
[31]
JAMES GLANZ. Google Details, and Defends, Its Use of Electricity. http://www.nytimes.com/2011/09/09/technology/google-details-and-defends-its-use-of-electricity.html?_r=2&.
[32]
A. Kansal, B. Urgaonkar, and S. Govindan. Using dark fiber to displace diesel generators. In Proceedings of the USENIX conference on Hot Topics in Operating Systems, HotOS, 2013.
[33]
S. T. King, G. W. Dunlap, and P. M. Chen. Debugging operating systems with time-traveling virtual machines. In Proceedings of the annual conference on USENIX Annual Technical Conference, 2005.
[34]
V. Kontorinis, L. E. Zhang, B. Aksanli, J. Sampson, H. Homayoun, E. P. tis, D. M. Tullsen, and T. S. Rosing. Managing Distributed UPS Energy for Effective Power Capping in Data Centers. In Proceedings of the International Symposium on Computer Architecture (ISCA), 2012.
[35]
K. Le, R. Bianchini, M. Martonosi, and T. Nguyen. Costand Energy-Aware Load Distribution Across Data Centers. In Workshop on Power-Aware Computing and Systems (Hot- Power), 2009.
[36]
C. Lefurgy, X. Wang, and M. Ware. Server-Level Power Control. In Proceedings of the International Conference on Autonomic Computing (ICAC), 2007.
[37]
J. Leverich, M. Monchiero, V. Talwar, P. Ranganathan, and C. Kozyrakis. Power management of datacenter workloads using per-core power gating. IEEE Comput. Archit. Lett., 8 (2), 2009.
[38]
D. Linden and T. B. Reddy. Handbook of Batteries. McGraw Hill Handbooks, 2002.
[39]
M. R. Marty and M. D. Hill. Virtual Hierarchies to Support Server Consolidation. In Proceedings of ISCA, 2007.
[40]
M. Marwah, P. Maciel, A. Shah, R. Sharma, T. Christian, V. Almeida, C. Araüjo, E. Souza, G. Callou, B. Silva, S. Galdino, and J. Pires. Quantifying the Sustainability Impact of Data Center Availability. SIGMETRICS Performance Evaluation Review, 37(4), 2010.
[41]
D. Meisner and T. F. Wenisch. Dreamweaver: architectural support for deep sleep. In Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems, 2012.
[42]
D. Meisner, B. T. Gold, and T. F. Wenisch. PowerNap: Eliminating Server Idle Power. In Proceedings of ASPLOS, 2009.
[43]
D. Meisner, C. M. Sadler, L. A. Barroso, W. Weber, and T. F. Wenisch. Power Management of Online Data-intensive Services. In Proceedings of the International Symposium on Computer Architecture (ISCA), 2011.
[44]
MEMCACHED: Memory-object Caching System. http://www.memcached.org.
[45]
Michael A. Bell. http://www.it.northwestern.edu/bin/docs/DesignBestPractices_127434.pdf.
[46]
Microsoft Reveals its Specialty Servers, Racks, Apr. 2011. http://www.datacenterknowledge.com/archives/2011/04/25/microsoft-reveals-its-specialty-servers-racks/.
[47]
J. Moore, J. Chase, P. Ranganathan, and R. Sharma. Making Scheduling Cool: Temperature-AwareWorkload Placement in Data Centers. In Proceedings of USENIX, 2005.
[48]
D. Narayanan and O. Hodson. Whole-system persistence. In Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2012), 2012.
[49]
R. Nathuji and K. Schwan. VirtualPower: Coordinated Power Management in Virtualized Enterprise Systems. In Proceedings of the Symposium on Operating Systems Principles (SOSP), 2007.
[50]
National Survey of Datacenter Outages, 2010. http://www.inquirere.com/wp-content/uploads/2010/11/National-Survey-on-Data-Center-Outages.pdf.
[51]
S. Osman, D. Subhraveti, G. Su, and J. Nieh. The design and implementation of zap: a system for migrating computing environments. SIGOPS Oper. Syst. Rev., 36(SI), 2002.
[52]
E. Pinheiro, R. Bianchini, E.Carrera, and T. Heath. Load Balancing and Unbalancing for Power and Performance in Cluster-Based Systems. In Workshop on COLP, 2001.
[53]
R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu. No Power Struggles: Coordinated Multi-level Power Management for the Data Center. In Proceedings of ASPLOS, 2008.
[54]
P. Ranganathan, P. Leech, D. Irwin, and J. Chase. Ensemblelevel Power Management for Dense Blade Servers. In Proceedings of the International Symposium on Computer Architecture (ISCA), 2006.
[55]
N. Sharma, S. Barker, D. Irwin, and P. Shenoy. Blink: Managing Server Clusters on Intermittent Power. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2011.
[56]
R. Singh, D. Irwin, P. Shenoy, and K. K. Ramakrishnan. Yank: Enabling green data centers to pull the plug. In Proceedings of 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2013.
[57]
SPEC CPU2006. http://www.spec.org/cpu2006/.
[58]
SPEC JBB2005: Java Business Benchmark. http://www.spec.org/jbb2005/.
[59]
J. Stoess, C. Lang, and F. Bellosa. Energy Management for Hypervisor-Based Virtual Machines. In Usenix Technical Conference, 2007.
[60]
The Cost of Power Disturbances to U.S. Businesses, 2001. http://www.onpower.com/pdf/EPRICostOfPowerProblems.pdf.
[61]
W. P. Turner, J. H. Seader, V. Renaud, and K. G. Brill. Tier classifications define site infrastructure performance. Uptime Institute White Paper, 2008.
[62]
A. Verma, G. Dasgupta, T. Kumar, N. Pradipta, and R. Kothari. Server Workload Analysis for Power Minimization Using Consolidation. In Proceedings of USENIX, 2009.
[63]
D. Wang, C. Ren, A. Sivasubramaniam, B. Urgaonkar, and H. Fathy. Energy storage in datacenters: what, where, and how much? In Proceedings of the Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), 2012.
[64]
D. Wang, S. Govindan, A. Sivasubramaniam, A. Kansal, J. Liu, and B. Khessib. Underprovisioning backup power infrastructure for datacenters. Technical Report CSE-13-012, The Pennsylvania State University, 2013.
[65]
D. Wang, C. Ren, and A. Sivasubramaniam. Virtualizing Power Distribution in Datacenters. In Proceedings of the International Symposium on Computer Architecture (ISCA), 2013.
[66]
X.Wang and M. Chen. Cluster-level Feedback Power Control for Performance Optimization. In Proceedings of HPCA, 2008.
[67]
A. Weisel and F. Bellosa. Process Cruise Control-Event-Driven Clock Scaling for Dynamic Power Management. In Proceedings of Compilers, Architecture and Synthesis for Embedded Systems (CASES), 2002.
[68]
Windows Hybrid Low-power Sleep, 2009. http://windows.microsoft.com/en-us/windows7/sleep-and-hibernation-frequently-asked-questions.
[69]
H. Zeng, C. S. Ellis, A. R. Lebeck, and A. Vahdat. Currentcy: A Unifying Abstraction for Expressing Energy Management Policies. In Proceedings of the Usenix Annual Technical Conference (USENIX), 2003.

Cited By

View all
  • (2024)Power Demand Reshaping Using Energy Storage for Distributed Edge CloudsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.334777435:2(362-376)Online publication date: Feb-2024
  • (2024)Proposal of Battery Back-up Unit (BBU) Configuration for Data Centers2024 IEEE Energy Conversion Congress and Exposition (ECCE)10.1109/ECCE55643.2024.10861418(1791-1796)Online publication date: 20-Oct-2024
  • (2022)Integrated Power Anomaly Defense: Towards Oversubscription-Safe Data CentersIEEE Transactions on Cloud Computing10.1109/TCC.2020.300145410:3(1875-1887)Online publication date: 1-Jul-2022
  • Show More Cited By

Index Terms

  1. Underprovisioning backup power infrastructure for datacenters

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
    February 2014
    780 pages
    ISBN:9781450323055
    DOI:10.1145/2541940
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 February 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. backup power infrastructure
    2. datacenters
    3. diesal generator
    4. underprovision
    5. upsr

    Qualifiers

    • Research-article

    Conference

    ASPLOS '14

    Acceptance Rates

    ASPLOS '14 Paper Acceptance Rate 49 of 217 submissions, 23%;
    Overall Acceptance Rate 535 of 2,713 submissions, 20%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)26
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Power Demand Reshaping Using Energy Storage for Distributed Edge CloudsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.334777435:2(362-376)Online publication date: Feb-2024
    • (2024)Proposal of Battery Back-up Unit (BBU) Configuration for Data Centers2024 IEEE Energy Conversion Congress and Exposition (ECCE)10.1109/ECCE55643.2024.10861418(1791-1796)Online publication date: 20-Oct-2024
    • (2022)Integrated Power Anomaly Defense: Towards Oversubscription-Safe Data CentersIEEE Transactions on Cloud Computing10.1109/TCC.2020.300145410:3(1875-1887)Online publication date: 1-Jul-2022
    • (2022)Model predictive control of a dual fuel engine integrated with waste heat recovery used for electric power in buildingsOptimal Control Applications and Methods10.1002/oca.285844:2(699-718)Online publication date: 27-Jan-2022
    • (2021)GreenHetero: Adaptive Power Allocation for Heterogeneous Green Datacenters2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS51616.2021.00024(160-170)Online publication date: Jul-2021
    • (2021)Heat Behind the Meter: A Hidden Threat of Thermal Attacks in Edge Colocation Data Centers2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA51647.2021.00035(318-331)Online publication date: Feb-2021
    • (2020)Toward fairness in AI for people with disabilities SBG@a research roadmapACM SIGACCESS Accessibility and Computing10.1145/3386296.3386298(1-1)Online publication date: 2-Mar-2020
    • (2020)Coordinated Priority-aware Charging of Distributed Batteries in Oversubscribed Data Centers2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO50266.2020.00073(839-851)Online publication date: Oct-2020
    • (2020)An energy-efficient power management for heterogeneous servers in data centersComputing10.1007/s00607-020-00805-wOnline publication date: 6-Apr-2020
    • (2019)Unleashing the Scalability Potential of Power-Constrained Data Center in the Microservice EraProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337857(1-10)Online publication date: 5-Aug-2019
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media