Elsevier

Performance Evaluation

Volume 79, September 2014, Pages 104-119
Performance Evaluation

Reducing the expenses of geo-distributed data centers with portable containerized modules

https://doi.org/10.1016/j.peva.2014.07.007Get rights and content

Abstract

In order to maximize their profits, big IT companies need to reduce the operating expenses (OpEx) and capital expenses (CapEx) of their geo-distributed data centers. To reduce OpEx, recent studies have proposed algorithms that dynamically dispatch workload among different data center sites, to exploit the variability in local electricity price and green energy availability. However, the amount of OpEx savings achieved by workload dispatching is often limited by the local computing capacity. To reduce CapEx, data centers try to minimize the needed increases in size based on estimated workload growth. However, workload overestimation would result in a high CapEx, while underestimation would hurt business increases.

In this paper, we propose ExContainer, a novel strategy that leverages portable containerized modules to minimize both OpEx and CapEx for geo-distributed data centers. With ExContainer, containers are allocated monthly to different sites and workload is dispatched hourly, to minimize total OpEx. To cut CapEx, ExContainer finds the best trade-off between the degree of capacity expansion and the number of needed new containers to handle the workload peak. We evaluate ExContainer with real-world traces. Our experimental results show that compared to state-of-the-art solutions, ExContainer allows significant reduction in both OpEx and CapEx.

Introduction

In the era of cloud computing, big IT companies are rapidly expanding their data centers in order to meet the fast-increasing demands for computing. In order to achieve the maximized profits, those companies need to minimize the operating expenses (OpEx) and capital expenses (CapEx) of their geo-distributed data centers.

Dynamic workload dispatching has been recently proposed as an effective way to reduce data center OpEx. For example, it is well-known that different geographical locations may have different and time-varying electricity prices, so Internet requests can be dynamically distributed to different data center sites to take advantage of the lower prices in certain locations  [1], [2]. Likewise, some data centers may have on-site renewable energy generators (e.g., solar or wind farms) that can provide low-cost green energy. As a result, requests can be dynamically dispatched based on green energy availability as well for reduced OpEx  [3], [4]. Similarly, requests can also be dispatched based on the ambient temperatures of different data centers, because the ambient temperature can significantly impact data center costs if advanced cooling techniques such as free cooling  [5] are used. Since cooling power accounts for 30%–40% of the total power consumption of a data center  [6], the OpEx savings on cooling can also be significant.

However, the amount of OpEx savings that can be achieved by workload dispatching is often limited by the size of each data center site. This is because a data center cannot handle more workload if it already reaches the peak computing capacity (i.e., all its servers are 100% utilized), regardless of its low electricity price, abundant green energy, or cold weather. Simply increasing the servers hosted in every data center may lead to a significantly increased CapEx as we discuss later. In addition, different data centers are commonly configured to provide different services (e.g., search, email, video streaming). Hence, workload can be dispatched only to some data centers that provide the required services and have available capacities, which further limits the benefits of workload dispatching. Since different data centers rarely have their workload peaks at the same time, portable containerized modules  [7], [8], [9] can be exploited to flexibly move servers among different sites monthly based on electricity price, green energy availability, and ambient temperatures, such that hourly workload dispatching can achieve a further reduced OpEx.

In order to reduce CapEx, data centers try to minimize the needed increases in computing, cooling, and power delivery capacities based on estimated workload growth over the lifetime of the data center infrastructure (about 15 years  [6], [10]). However, it is well-known that workload growth can be highly unpredictable with significant fluctuations. As a result, workload overestimation would result in over-provisioning of data center capacities and thus a high CapEx  [10], while underestimation would hurt business increases due to insufficient capacities. Therefore, if the size of a data center can be increased incrementally with container modules to handle short-term workload peaks, the data center can wait for the demand to grow to a certain level that best justifies an upgrade of the data center power and cooling capacities. Hence, expensive upgrades can be deferred, resulting in reduced CapEx.

Recently, big IT companies such as Microsoft  [11], eBay  [12], and Google  [13] have started to adopt portable containerized modules, which are designed mainly to provide additional computing capacity for disaster recovery, sport events, or military applications. In sharp contrast to traditional (room-based) data centers, containerized modules can be quickly built (typically within 1–6 months), shipped with trucks, and deployed with power and network in a few hours  [9]. IT equipment and cooling system can be placed within one module (all-in-one, e.g., HP POD 240a, Huawei IDS1000) or within two different modules (e.g., IBM PMDC). Moreover, the cooling system is usually built with an economizer mode (i.e., free cooling), which uses outdoor air (when the temperature is lower than a certain threshold) to cool down servers. As a result, containers can generally achieve a lower Power Usage Effectiveness (PUE, as low as 1.05  [8]).

In this paper, we propose ExContainer, a novel strategy that leverages portable containerized modules to minimize both OpEx1 and CapEx for geo-distributed data centers. Although containerized modules have already been used in industry, to our best knowledge, they have not yet been used to fully exploit the diversity and variability in electricity price, green energy availability, and ambient temperature across different geographical locations for minimized OpEx. Neither have they been used to optimize data center CapEx by handling short-term workload spikes and deferring the expensive upgrades of data center infrastructure. ExContainer is composed of an OpEx optimizer and a CapEx optimizer. The OpEx optimizer features a two-level approach, which allocates containers to different sites on a coarse time scale (e.g., one month) and conducts dynamic workload dispatching and server turning on/off on a fine time scale (e.g., every hour), for the minimization of electricity and network delay (from dispatcher to data center) cost. The CapEx optimizer uses containers to handle short-term workload peaks that would exceed the capacity of a data center. When the demand grows to a level that justifies an upgrade of the data center capacities, the CapEx optimizer finds the best trade-off between the degree of capacity expansion and the number of needed new containers, to minimize the total CapEx. Specifically, this paper makes the following major contributions:

  • While existing work focuses mainly on workload dispatching for reduced OpEx, we identify its two limitations: limited capacity of each site and service heterogeneity. Accordingly, we propose to leverage portable containerized modules for further reduced OpEx.

  • While current practice commonly uses containers in a stationery way as modules and only moves containers for special events, we propose to move them periodically for minimized OpEx, based on the electricity prices, green energy availability, and ambient temperatures.

  • We propose a new CapEx minimization approach to defer the expensive upgrades of data center infrastructure by finding the best trade-off between the degree of capacity expansion and the number of needed new containers.

  • We formulate three constrained optimization problems to minimize OpEx by (1) allocating containers monthly to different locations, (2) conducting hourly dynamic workload dispatching and server turning on/off, and to minimize CapEx. Our results show that compared to state-of-the-art solutions  [1], [2], [14], [6], [10], ExContainer allows significant reduction in both OpEx and CapEx.

The rest of this paper is organized as follows. Section  2 discusses the related work. Section  3 presents the overview of ExContainer. Sections  4 ExContainer OpEx optimizer, 5 ExContainer CapEx optimizer describe the detailed designs of the OpEx and CapEx optimizers, respectively. Section  6 evaluates ExContainer with real-world traces. Section  7 discusses the implications of ExContanainer and Section  8 concludes the paper.

Section snippets

Related work

Numerous approaches for workload dispatching and servers turning on/off have been explored in literature.  [1], [14], [15], [16], focus on developing algorithms to maximize the usage of green energy or to balance the load to achieve energy savings for distributed data centers.  [2], [4], [17], [18], [19] minimize the electricity bill by exploiting the hourly fluctuations of electricity price or capping a monthly cost budget. Server room temperature is considered in  [20] while the impact of

Overview of ExContainer

In this work, we consider a big IT company that has a network of data centers located in different places. For example, Google owns seven data centers in the US and 13 in total globally. Depending on the IT company, each data center site may exploit green energy by (1) having on-site renewable (e.g., wind or solar) power generators that can provide low-cost renewable energy, or (2) purchasing renewable energy from off-site. In this work, we assume the first case even though ExContainer can be

ExContainer OpEx optimizer

In this section we describe the power models and the optimization problems for the two components of the OpEx optimizer: the container allocator and the workload dispatcher.

ExContainer CapEx optimizer

In this section, we first formulate the CapEx optimization problem. We then describe the optimization algorithm, and third we define the cost models in detail.

The CapEx of building a data center includes the expenses of servers (CS), cooling (CCool) and power system (CP), server room (CB), and eventually the cost for buying containers (CCont) according to a well established CapEx depreciation cost model explained in  [6]: CapExi=CS+CCool+CP+CB+CCont. The model to derive the CapEx for containers

Simulation results

In this section, we evaluate ExContainer using real-world traces of the workload and the influential factors, such as electricity price, green energy and ambient temperature.

Discussion

In this section, we discuss implications of our study for the dynamic allocation of containerized data centers. First, we discuss transportation-related concerns and data management of containers. Second we discuss the possible extra costs that may be associated with the transportation of the containerized modules.

Typical transportation-related concerns include the risks associated with deploying modules, the environmental impact, and the effect of delays on costs. Containerized data centers

Conclusions

A major limitation of dynamic workload dispatching is the processing capacity of each data center site, which can be addressed by moving containers around for increased capacities. In addition, the expensive upgrades of the power and cooling infrastructures of a data center can also be delayed with container deployment. In this paper, we have presented ExContainer, a novel strategy that leverages portable containerized modules to minimize both OpEx and CapEx for geo-distributed data centers.

Marco Brocanelli is a Ph.D. student with the Department of Electrical and Computer Engineering at the Ohio State University, Columbus. He received his M.S. degree in Control Engineering at University of Rome Tor Vergata, Italy, in 2011. He received his B.S. degree in Control Engineering at University of Rome Tor Vergata, Italy, in 2008. He has been a J1 visiting scholar at Ohio State University between October 2010 and April 2011–October 2011 and May 2012, working on Hypersonic Vehicle

References (53)

  • Z. Liu, M. Lin, A. Wierman, S.H. Low, L.L.H. Andrew, Greening geographical load balancing, in: SIGMETRICS’,...
  • A. Qureshi, R. Weber, H. Balakrishnan, J. Guttag, B. Maggs, Cutting the electricity bill for internet-scale systems,...
  • Z. Liu, Y. Chen, C. Bash, A. Wierman, D. Gmach, Z. Wang, M. Marwah, C. Hyser, Renewable and cooling aware workload...
  • Y. Zhang, Y. Wang, X. Wang, Electricity bill capping for cloud-scale data centers that impact the power markets, in:...
  • J. Niemann, J. Bean, V. Avelar, Economizer modes of data center cooling systems,...
  • L.A. Barroso, U. Hlzle, The datacenter as a computer, An Introduction to the Design of Warehouse-Scale Machines, Mark...
  • IBM, Portable modular data centers—add capacity where and when you need, White Paper,...
  • T.P. Morgan, Hp welds data center containers into EcoPODs, 2011....
  • M. Industria, Mdc plug & play POP,...
  • N. Rasmussen, Avoiding costs from oversizing data center and network room infrastructure, White Paper,...
  • I. Fried, Inside one of the world’s largest data centers, 2009....
  • R. Miller, Ebay containers push frontiers of efficiency, 2013....
  • R. Miller, Googles infrastructure boom continues expansion ahead in oregon, 2013....
  • Y. Zhang, Y. Wang, X. Wang, Greenware: greening cloud-scale data centers to maximize the use of renewable energy,...
  • Y. Gao, Z. Zeng, X. Liu, P.R. Kumar, The answer is blowing in the wind: analysis of powering internet data centers with...
  • G. Chen, W. He, J. Liu, S. Nath, L. Rigas, L. Xiao, F. Zhao, Energy-aware server provisioning and load dispatching for...
  • W. Fang, Q. Zhou, Y. An, Y. Li, H. Zhang, Cost-aware workload dispatching and server provisioning for distributed cloud...
  • J. Luo, L. Rao, X. Liu, Temporal load balancing with service delay guarantees for data center energy cost optimization,...
  • T. Sakamoto, H. Yamada, H. Horie, K. Kono, Energy price driven request dispatching for cloud data centers, in: CLOUD,...
  • Z. Abbasi, G. Varsamopoulos, S.K.S. Gupta, Thermal aware server provisioning and workload distribution for internet...
  • I. Goiri, K. Le, J. Guitart, J. Torres, R. Bianchini, Intelligent placement of datacenters for internet services, in:...
  • R. Singh, P. Shenoy, K.K. Ramakrishnan, R. Kelkar, H. Vin, eTransform: transforming enterprise data centers by...
  • J. Koomey, K. Brill, P. Turner, J. Stanley, B. Taylor, A simple model for determining true total cost of ownership for...
  • N. Rasmussen, S. Niles, Data center projects: growth model, Schneider Electric White Paper,...
  • N. Rasmussen, S. Niles, Towards the design and operation of net-zero energy data centers, iTherm,...
  • S. Govindan, A. Sivasubramaniam, B. Urgaonkar, Benefits and limitations of tapping into stored energy for datacenters,...
  • Cited by (1)

    • Sustainable Cloud Data Centers: A survey of enabling techniques and technologies

      2016, Renewable and Sustainable Energy Reviews
      Citation Excerpt :

      Moreover, VM based workload migration and consolidation techniques are utilized to pack a set of VMs to fewer number of physical devices to balance renewable power generation and workload demand [35]. Researchers have leveraged both MDC designs and VM migration techniques to efficiently harness renewable energy resources and waste heat utilization alternatives in green CDCs [36–38]. However, the cost, in terms of network delay and energy consumption, between geo- dispersed nodes is the foremost challenge to VM based workload migrations in CDCs.

    Marco Brocanelli is a Ph.D. student with the Department of Electrical and Computer Engineering at the Ohio State University, Columbus. He received his M.S. degree in Control Engineering at University of Rome Tor Vergata, Italy, in 2011. He received his B.S. degree in Control Engineering at University of Rome Tor Vergata, Italy, in 2008. He has been a J1 visiting scholar at Ohio State University between October 2010 and April 2011–October 2011 and May 2012, working on Hypersonic Vehicle non-linear Control.

    Wenli Zheng is a Ph.D. student with the Department of Electrical and Computer Engineering at the Ohio State University, Columbus. He received his M.S. degree in Computer Engineering from University of Southern California, Los Angeles, California, in 2012, and his B.S. degree in Electrical and Computer Engineering from Shanghai Jiao Tong University, Shanghai, China, in 2010.

    Xiaorui Wang is an Associate Professor in the Department of Electrical and Computer Engineering at The Ohio State University. He is the recipient of the Office of Naval Research (ONR) Young Investigator (YIP) Award in 2011, the NSF CAREER Award in 2009, the Power-Aware Computing Award from Microsoft Research in 2008, and the IBM Real-Time Innovation Award in 2007. He also received the Best Paper Award from the 29th IEEE Real-Time Systems Symposium (RTSS) in 2008. Prior to joining Ohio State, he was an Assistant Professor at the University of Tennessee, Knoxville, where he received the EECS Early Career Development Award, the Chancellor’s Award for Professional Promise, and the College of Engineering Research Fellow Award in 2008, 2009, and 2010, respectively, as well as one-year early promotion with tenure. In 2005, he worked at the IBM Austin Research Laboratory, designing power control algorithms for high-density computer servers. From 1998 to 2001, he was a senior software engineer and then a project manager at Huawei Technologies Co. Ltd., China, developing distributed management systems for optical networks. He received his doctoral degree from Washington University in St. Louis. His research interests include computer systems, computer architecture, data center power management, embedded and real-time systems, and cyber–physical systems. He is an author or coauthor of more than 80 refereed journal and conference publications.

    This work was supported, in part, by the  US NSF under grants  CNS-1143607 (CAREER Award) and  CCF-1331712.

    View full text