Towards green data centers: A comparison of x86 and ARM architectures power efficiency

https://doi.org/10.1016/j.jpdc.2012.08.005Get rights and content

Abstract

Servers and clusters are fundamental building blocks of high performance computing systems and the IT infrastructure of many companies and institutions. This paper analyzes the feasibility of building servers based on low power computers through an experimental comparison of server applications running on x86 and ARM computer architectures. The comparison executed on web and database servers includes power usage, CPU load, temperature, request latencies and the number of requests handled by each tested system. Floating point performance and power usage are also evaluated. The use of ARM based systems has shown to be a good choice when power efficiency is needed without losing performance.

Highlights

► We evaluate performance and power efficiency of x86 and ARM architectures. ► We present an experimental setup that can be easily replicated with low cost. ► Key differences of ARM and x86 processors are presented and discussed. ► ARM systems are more power efficient for SQL and static HTTP servers. ► x86 architecture is still more power efficient for floating point computation.

Introduction

Much has been said about green computing and energy efficient data centers recently, but most solutions are focused in enhancing current server architectures or using virtualization [20] and other techniques to consume less power in data centers. This article discusses if processors typically used in mobile devices are suitable for server and cluster applications. Considering such possibilities, we designed a series of experiments in order to compare x86 and ARM systems in the context of two classical network services applications. Several hardware platforms are evaluated, measuring their power usage, temperature and application performance. We also analyze floating point performance and its power usage using Linpack [6].

Low power consumption is historically related to embedded systems running with batteries, but over time more data centers and high performance systems are concerned with power efficiency [19], [2]. One characteristic of data centers is their huge electric power consumption [21], which is a serious problem [16]. According to Mahadevan et al., the biggest consumers of energy are servers and cooling systems [21]. Reducing power is not only interesting for ecological reasons, but also to reduce electrical bill costs. Svanfeldt-Winter et al. relate that 57% of monthly data-center costs are spent on servers’ electricity bills [36].

As designers work towards faster systems there is an increasing concern about the expected power consumption of future systems. For exascale computing performance to be achieved in the years to come, energy efficiency will be the most important constraining factor [9], [11]. This is why power consumption dominates every discussion of exascale [41], and in fact, there is still no clear roadmap to power-efficient exascale computing [19].

The technical report “The landscape of parallel computing: A view from Berkeley” [2], analyzes both extremes of computing: servers and embedded systems, and concludes that these extremes are “colliding and merging worlds”. One example is power consumption: until some years ago no one was concerned with power consumption in data centers, but today, most data centers are looking for energy efficient systems. Embedded systems, on the other hand, have always been concerned with battery life, requiring minimal power consumption. In fact, according to Jensen and Rodrigues, high performance systems must borrow concepts from embedded systems to achieve exascale performance [19].

In this work, we could verify that the use of low power servers in parallel applications is suitable. In our experiments, while power consumption goes down, performance stays almost the same for x86 based architectures typically used to build clusters. We find out that processing clusters based on low power systems such as ARM processors are feasible and that this is a nice way to decrease power usage of several server applications.

Grounded in experimental analysis, our main contributions are:

  • A low cost and reliable testing setup to analyze servers’ power efficiency;

  • A systematic and quantitative evaluation of several server architectures running web server, database server and Linpack benchmarks.

This paper is structured as follows. Section 2 presents several differences of the analyzed systems and a review of the state of the art. Section 3 describes the equipment used in the measurements and our experimental setup, while Section 4 depicts the experimental results. Section 5 discusses the results addressing our main contributions and Section 6 features our final remarks.

Section snippets

Theory

This section discusses several differences of ARM and x86 architectures from theoretical and practical points of view, focusing on power efficiency aspects of these devices.

Hardware setup

The hardware used for comparison purpose is composed of 2 notebook computers with low power x86 processors and 2 ARM based development boards. Table 1 shows the specifications for each device. We used two open source boards that can be easily acquired: the BeagleBoard-XM and PandaBoard. As a reference for the measurements, we also tested a classical machine used as a server in many scenarios. It is a HP-Z200 workstation with a quad-core Intel Xeon processor.

Platforms Turion and Atom have a dual

Results

This section presents the results obtained from our measurements. As explained in Section 3 the systems were prepared to have minimal or no disk I/O during the tests. I/O wait was also collected in parallel to every measurement that is executed, but we decided to omit all I/O wait values in the graphs because the worst measured I/O wait is 2.8%. All the tests were executed according to the measurement technique presented in Section 3.

Another observation regards network bandwidth. A series of

Discussion

According to Cameron [4], one of the challenges of green IT is the lack of more efficient hardware and software. As an answer, our experiments show that it is viable to build clusters and servers of low power devices to act as energy efficient production systems.

One possible solution to decrease data centers power usage is to put x86 servers to sleep and wake them up with Wake On LAN as the computing demand increases. Although this approach would certainly reduce power usage, servers in sleep

Conclusions

In this paper we present a low cost testing system to analyze computers’ power efficiency that can be easily reproduced by other authors. Using this test setup we systematically analyze and compare several ARM and x86 devices in typical server and number crunching tasks: web server, database server and floating point computation. In the comparisons we show temperature, CPU and power usage for each device at different load scenarios. We also show performance per Watt metrics and the latency to

Acknowledgment

The authors would like to thank the support from the National Research Council (CNPq), the Brazilian sponsoring Agency for research.

Rafael Vidal Aroca holds an Informatics degree and a master of sciences degree in mechatronics engineering both from the University of São Paulo (USP). He is a Ph.D. student at the Electrical and Computing Engineering Graduate Program of Federal University of Rio Grande do Norte, Brazil. He has over 10 years of industry experience in embedded systems, IT systems and servers administration. His main research interests are in Embedded Systems, Operating Systems and Robotics. Rafael is an IEEE

References (42)

  • S. Anthony, Intel Medfield 32 nm Atom SoC power consumption, specs, and benchmarks leak, Extreme Tech. On-Line. Dec 17...
  • K. Asanovic, R. Bodik, B.C. Catanzaro, J.J. Gebis, P. Husbands, K. Keutzer, D.A. Patterson, W.L. Plishker, J. Shalf,...
  • D. Athow, Intel details roadmap, 14 nm Atom SoC To Come in 2014, IT Pro Portal. On-Line. Feb 28 2012. Access date:...
  • K.W. Cameron

    Green with envy

    Computer

    (2010)
  • A.P. Chandrakasan et al.

    Low-power CMOS digital design, solid-state circuits

    IEEE Journal of Solid State Circuits

    (1992)
  • J. Dongarra, Performance of various computers using standard linear equations software, University of Tennessee,...
  • J. Fitzpatrick

    An interview with Steve Furber

    Communications of ACM

    (2011)
  • I. Foster

    Designing and Building Parallel Programs

    (1995)
  • K. Fuerlinger, The AppleTV cluster: energy efficient parallel computing on consumer electronic devices. 12 August 2011....
  • K. Fuerlinger, C. Klausecker, D. Kranzlmller, The AppleTV-Cluster: towards energy efficient parallel computing on...
  • K. Fuerlinger, C. Klausecker, D. Kranzlmller, Towards energy efficient parallel computing on consumer electronic...
  • R. Gruber et al.

    HPC@Green IT: Green High Performance Computing Methods

    (2010)
  • S. Gupta, Path to exascale computing rests with heterogeneous design. Nvidia Corporation. On-Line. November 8 2011....
  • S. Gupta, Worlds first ARM-based supercomputer to launch in Barcelona. Nvidia Corporation. On-Line. November 14 2011....
  • K.R. Hoffman, P. Hedge, ARM Cortex-A8 vs. Intel Atom: architectural and benchmark comparisons. Technical Report....
  • IDC executive brief, heterogeneous computing: a new paradigm for the exascale era (Adapted from IDC HPC end-user study...
  • C. Isen et al.

    A tale of two processors: revisiting the RISC-CISC debate

  • T. Jamil

    RISC versus CISC: why less is more

    IEEE Potentials

    (1995)
  • D. Jensen et al.

    Embedded systems and exascale computing

    Computing in Science Engineering

    (2010)
  • V. Kumar et al.

    Towards better performance per watt in virtual environments on asymmetric single-isa multi-core systems

    ACM SIGOPS Operating Systems Review

    (2009)
  • P. Mahadevan et al.

    On energy efficiency for enterprise and data center networks

    IEEE Communications Magazine

    (2011)
  • Cited by (52)

    • Performance analysis and optimization for SpMV based on aligned storage formats on an ARM processor

      2021, Journal of Parallel and Distributed Computing
      Citation Excerpt :

      With the debut of the ARMv8-A architecture, ARM has become a key research object for solving energy consumption in high-performance computing. Moreover, compared to the previous ARM architecture, 64-bit ARMv8-A benefits from increased support of double-precision and SIMD operations [9], making it more compelling [2,6,11,17]. Many companies worldwide have purchased an ARMv8-A license and have developed their cores based on ARM.

    • SegaNet: An Advanced IoT Cloud Gateway for Performant and Priority-Oriented Message Delivery

      2023, Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023
    View all citing articles on Scopus

    Rafael Vidal Aroca holds an Informatics degree and a master of sciences degree in mechatronics engineering both from the University of São Paulo (USP). He is a Ph.D. student at the Electrical and Computing Engineering Graduate Program of Federal University of Rio Grande do Norte, Brazil. He has over 10 years of industry experience in embedded systems, IT systems and servers administration. His main research interests are in Embedded Systems, Operating Systems and Robotics. Rafael is an IEEE member.

    Luiz Marcos Garcia Gonçalves holds a Doctorate in systems and computing engineering from the Federal University of Rio de Janeiro and graduated in 1999. He is Associate Professor at the Computing Engineering and Automation Department of the Federal University of Rio Grande do Norte, Brazil. He is a member of the IEEE Latin American Robotics Council (since 2002) and he was the Chair for the Brazilian Committee on Robotics and on Computer Graphics and Image Processing both under the Brazilian Computer Society. His research interests are in Computer Vision, Robotics, and all aspects of Graphics Processing.

    View full text