skip to main content
10.1145/1542275.1542322acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article

Evaluating high performance communication: a power perspective

Authors Info & Claims
Published:08 June 2009Publication History

ABSTRACT

Recently, high speed interconnects capable of remote direct memory access (RDMA) such as InfiniBand and iWARP have gained considerable popularity due to their superb latency and bandwidth. Most existing studies about RDMA have focused mainly on its performance aspect. However, as power management has become essential for high-end systems such as enterprise servers and high performance computing nodes which are often equipped with RDMA capable network adapters, it is very important for us to take a fresh look at the benefits of RDMA from the power perspective.

In the paper, we provide a detailed empirical study of the benefits of RDMA in terms of power savings compared with traditional communication protocols such as TCP/IP. We used two popular RDMA adapters in our evaluations: Mellanox ConnectX InfiniBand HCAs and Chelsio T3 10GE RNICs. In order to isolate the impact of communication on power consumption, our evaluation focused on using micro-benchmarks which perform different communication patterns. We have also studied several important factors that may have an impact on the performance and the power consumption of RDMA adapters such as the use of polling versus blocking, CPU speeds, and extra memory copies.

We show that using high speed RDMA adapters can result in significant amount of power consumption during communication. (In one test, the system power has increased by as much as 50 watts, or over 30% of the idle power.) We found that RDMA generally has better power efficiency compared to that of TCP/IP, especially for communication intensive phases, for example when large messages are transferred. The power savings of RDMA are achieved by minimizing the interactions between the network adapters and other system components such as the CPUs and the memory: Although nearly the same amount of data must be going through the network adapters for both RDMA and TCP/IP, RDMA requires much fewer CPU cycles for protocol processing and also generates less memory bus traffic, both of which contribute to its power savings.

Overall, our research demonstrated that RDMA not only provides high communication performance, but also offers excellent power efficiency, making it a desirable choice in environments that have strict power/energy constraints and demand high communication performance.

References

  1. Intelligent Platform Management Interface Specifications. http://www.intel.com/design/servers/ipmi/spec.htm.Google ScholarGoogle Scholar
  2. IP over InfiniBand Working Group. http://www.ietf.org/-html.charters/ipoib-charter.html.Google ScholarGoogle Scholar
  3. M. Anand, E. B. Nightingale, and J. Flinn. Self-tuning wireless network power management. In Proceedings of the Ninth Annual International Conference on Mobile Computing and Networking (MOBICOM'03), Sept. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Balaji, W. Feng, and D. Panda. Bridging the Ethernet-Ethernot Performance Gap. IEEE MICRO, 26(3):24, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Balaji, H. Shah, and D. Panda. Sockets vs RDMA Interface over 10-Gigabit Networks: An In-depth analysis of the Memory Traffic Bottleneck. In RAIT Workshop 2004.Google ScholarGoogle Scholar
  6. C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove, P. Husbands, C. Iancu, M. Welcome, and K. Yelick. An evaluation of current high-performance networks. In International Parallel and Distributed Processing Symposium (IPDPS'03), April 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Bertozzi, A. Raghunathan, L. Benini, and S. Ravi. Transport protocol optimization for energy efficient wireless embedded systems. In Proceedings of the Conference on Design Automation and Test in Europe (DATE'03), Mar. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Bianchini and R. Rajamony. Power and Energy Management for Server Systems. COMPUTER, pages 68--76, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Bircher and L. John. Complete System Power Estimation: A Trickle-Down Approach Based on Performance Events. In Performance Analysis of Systems & Software, 2007. ISPASS 2007. IEEE International Symposium on, pages 158--168, 2007.Google ScholarGoogle Scholar
  10. R. Brightwell, D. Doerfler, and K. Underwood. A comparison of 4X InfiniBand and Quadrics Elan-4 technologies. In Cluster Computing, 2004 IEEE International Conference on, pages 193--204, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Buzzard, D. Jacobson, M. Mackey, S. Marovich, and J. Wilkes. An Implementation of the Hamlyn Sender-Managed Interface Architecture. ACM SIGOPS Operating Systems Review, 30(si):245--259, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Chase, D. Anderson, P. Thakar, A. Vahdat, and R. Doyle. Managing energy and server resources in hosting centers. In Proceedings of the eighteenth ACM symposium on Operating systems principles, pages 103--116, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Chelsio Communications. Chelsio 10GbE NICs. http://www.chelsio.com.Google ScholarGoogle Scholar
  14. P. Culley, U. Elzur, R. Recio, S. Bailey, and J. Carrier. Marker PDU Aligned Framing for TCP Specification. http://www.rdmaconsortium.org/home/draft-culley-iwarp-mpa-v1.0.pdf.Google ScholarGoogle Scholar
  15. D. Dalessandro, P. Wyckoff, O. Center, and O. Springfield. A Performance Analysis of the Ammasso RDMA Enabled Ethernet Adapter and its iWARP API. Cluster Computing, 2005. IEEE International, pages 1--7, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  16. C. Dubnicki, A. Bilas, Y. Chen, S. Damianakis, and K. Li. VMMC-2: Efficient Support for Reliable, Connection-Oriented Communication. In Proc. Hot Interconnects Conf., Aug, 1997.Google ScholarGoogle Scholar
  17. L. Feeney and M. Nilsson. Investigating the energy consumption of a wireless network interface in an ad hoc networking environment. In Proceedings of the Twentieth Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM'01), Apr. 2001.Google ScholarGoogle ScholarCross RefCross Ref
  18. W. Feng. Making a Case for Efficient Supercomputing. Queue, 1(7):54--64, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. W. Feng, P. Balaji, C. Baron, L. Bhuyan, D. Panda, C. Sci, and E. Riverside. Performance Characterization of a 10-Gigabit Ethernet TOE. In Proceedings of the IEEE International Symposium on High-Performance Interconnects (HotI), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. V. W. Freeh and D. K. Lowenthal. Using multiple energy gears in MPI programs on a power-scalable cluster. In Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 164--173, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. V. W. Freeh, F. Pan, N. Kappiah, D. K. Lowenthal, and R. Springer. Exploring the Energy-Time Tradeoff in MPI Programs on a Power-Scalable Cluster. In Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International, pages 4a--4a, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Hilland, P. Culley, J. Pinkerton, and R. Recio. RDMA Protocol Verbs Specification. http://www.rdmaconsortium.org/home/draft-hilland-iwarp-verbs-v1.0-RDMAC.pdf.Google ScholarGoogle Scholar
  23. C. Hsu and W. Feng. A power-aware run-time system for high-performance computing. In Proceedings of the 2005 ACM/IEEE conference on Supercomputing, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Hurwitz and W. Feng. End-to-end performance of 10-gigabit Ethernet on commodity systems. Micro, IEEE, 24(1):10--22, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. InfiniBand Trade Association. InfiniBand Architecture Specification, Release 1.2.Google ScholarGoogle Scholar
  26. C. Isci and M. Martonosi. Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data. In Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society Washington, DC, USA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Isci and M. Martonosi. Phase characterization for power: Evaluating control-flow-based and event-counter-based techniques. In Proceedings of the Twelfth International Symposium on High-Performance Computer Architecture (HPCA06), February, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  28. N. Kappiah, V. W. Freeh, and D. K. Lowenthal. Just In Time Dynamic Voltage Scaling: Exploiting Inter-Node Slack to Save Energy in MPI Programs. In Proc. of IEEE/ACM Supercomputing 2005: High Performance Computing, Networking Storage, and Analysis Conference, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. Lefurgy, K. Rajamani, F. Rawson, W. Felter, M. Kistler, and T. Keller. Energy Management for Commercial Servers. COMPUTER, pages 39--48, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Lim, V. W. Freeh, and D. K. Lowenthal. Adaptive, transparent frequency and voltage scaling of communication phases in mpi programs. IEEE/ACM Supercomputing, November, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. J. Liu, B. Chandrasekaran, J. Wu, W. Jiang, S. Kini, W. Yu, D. Buntinas, P. Wyckoff, and D. K. Panda. Performance Comparison of MPI Implementations over InfiniBand Myrinet and Quadrics. In Supercomputing 2003: The International Conference for High Performance Computing and Communications, Nov. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. J. Liu, B. Chandrasekaran, W. Yu, J. Wu, D. Buntinas, S. Kini, D. Panda, and P. Wyckoff. Microbenchmark Performance Comparison of High--Speed Cluster Interconnects. IEEE MICRO, pages 42--51, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Mellanox Technologies. http://www.mellanox.com.Google ScholarGoogle Scholar
  34. OpenFabrics Alliance. OpenFabrics Alliance Web Site. http://www.openfabrics.org/.Google ScholarGoogle Scholar
  35. V. Pandey, W. Jiang, Y. Zhou, and R. Bianchini. DMA-Aware Memory Energy Management. In Proceedings of HPCA, February, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  36. M. Pettersson. perfctr patches. http://user.it.uu.se/ mikpe/linux/perfctr/.Google ScholarGoogle Scholar
  37. J. Pinkerton. The Case for RDMA. http://www.rdmaconsortium.org/.Google ScholarGoogle Scholar
  38. M. Rashti and A. Afsahi. 10-Gigabit iWARP Ethernet: Comparative Performance Analysis with InfiniBand and Myrinet-10G. In Proceedings of the International Workshop on Communication Architecture for Clusters (CAC), held in conjunction with IPDPS, volume 7, page 234.Google ScholarGoogle Scholar
  39. R. Recio, P. Culley, D. Garcia, and J. Hilland. An RDMA Protocol Specification (Version 1.0). http://www.rdmaconsortium.org/home/draft-recio-iwarp-rdmap-v1.0.pdf.Google ScholarGoogle Scholar
  40. H. Shah, J. Pinkerton, R. Recio, and P. Culley. Direct Data Placement over Reliable Transports. http://www.rdmaconsortium.org/home/draft-shah-iwarp-ddp-v1.0.pdf.Google ScholarGoogle Scholar
  41. E. Shih, P. Bahl, and M. J. Sinclair. Wake on wireless: An event driven energy saving strategy for battery operated devices. In Proceedings of the Eighth Annual International Conference on Mobile Computing and Networking (MOBICOM'02), Sept. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Q. Snell, A. Mikler, and J. Gustafson. NetPIPE: A Network Protocol Independent Performance Evaluator. IASTED International Conference on Intelligent Information Management and Systems, 6, 1996.Google ScholarGoogle Scholar
  43. A. Tirumala, F. Qin, J. Dugan, J. Ferguson, and K. Gibbs. Iperf-The TCP/UDP bandwidth measurement tool. URL: http://dast.nlanr.net/Projects/Iperf, 2004.Google ScholarGoogle Scholar
  44. R. Zamani, A. Afsahi, Y. Qian, and C. Hamacher. A Feasibility Analysis of Power-Awareness and Energy Minimization in Modern Interconnects for High-Performance Computing. In Proc. of IEEE Cluster 2007, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Evaluating high performance communication: a power perspective

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICS '09: Proceedings of the 23rd international conference on Supercomputing
      June 2009
      544 pages
      ISBN:9781605584980
      DOI:10.1145/1542275

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 June 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate584of2,055submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader