skip to main content
10.1145/2063384.2063440acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

An early performance analysis of POWER7-IH HPC systems

Published: 12 November 2011 Publication History

Abstract

In this work we present a performance evaluation of the POWER7-IH processor and of integrated systems built from it. We describe the architecture of P7-IH with an emphasis on those characteristics that have a direct impact on the performance for large-scale HPC systems and applications. An important area of emphasis is the memory and communication subsystems and their impact on achievable application performance. The results from a set of micro-benchmarks are presented that include memory, communication and OS-noise characteristics. In addition the results from several production level applications are analyzed and their performance linked to the results of the micro-benchmarks through the use of accurate performance models. The models will also be employed in exploring the achievable performance of these applications on much larger systems.

References

[1]
Arimilli, B., Arimilli, R., Chung, V., Clark, S., Denzel, W., Drerup, B., Hoefler, T., Joyner, J., Lewis, J., Li, J. Ni, N., Rajamony, R. 2010. The PERCS High-Performance Interconnect. in Proc. 18 th IEEE Symp. on High Performance Interconnects, pp. 75--85.
[2]
Arimilli, B., Baumgartner, S. et. al. 2010. The IBM Power7 Hub Module: A Terabyte Interconnect Switch for High-Performance Computer Systems. Hot Chips 22.
[3]
Barker, K. J., Davis, K. Hoisie, A., Kerbyson, D. J., Lang, M., Pakin, S., Sancho, J. C. 2009. Using Performance Modeling to Design Large-Scale Systems. IEEE Computer, 42(11):42--49.
[4]
Barker, K. J., Davis, K., Hoisie, A., Kerbyson, D. J., Lang, M., Pakin, S., Sancho, J. C. 2008. Entering the Petaflop Era: The Architecture and Performance of Roadrunner, in Proc. IEEE/ACM Supercomputing (SC'08), Austin TX.
[5]
Barker, K. J., et. al., 2005. On the Feasibility of Optical Circuit Switching for High Performance Computing Systems, in Proc. IEEE/ACM Supercomputing (SC'05), Seattle, WA.
[6]
Barker, K. J., Kerbyson, D. J. 2007. Performance Analysis of an Optical Circuit Switched Network for Peta-Scale Systems. in Proc. Euro-Par 2007, Rennes, France.
[7]
Barker K. J., Kerbyson, D. J. 2011. A Performance Model of the MiLC MIMD Lattice Code, Technical Report, Pacific Northwest National Laboratory.
[8]
Bernard, C., et. al., 1991. Studying Quarks and Gluons on MIMD Parallel Computers, Int. J. of Supercomputer Applications, 5(61).
[9]
Bhatele, A., Jetley, P., Gahvari, H., Wesolowski, L., Gropp, W. D., Kale, L. V. 2011. Architectural Constraints Required to Attain 1 Exaflop/s for Scientific Applications, in Proc. Int. Parallel and Distributed Processing Symposium (IPDPS), Anchorage, AK.
[10]
Blue Waters Sustained Petascale Computing, Project Office. 2011. National Center for Supercomputing Applications, IL. http://www.ncsa.illinois.edu/BlueWaters
[11]
Donzis, D. A., Yeung, P. K., Pekurovsky, D. 2008. Turbulence simulations on O(104) processors. in Proc. TeraGrid.
[12]
Hoisie, A., Johnson, G., Kerbyson, D. J., Lang, M., Pakin, S. 2006. A Performance Comparison through Benchmarking and Modeling of Three Leading Supercomputers: Blue Gene/L, Red Storm, and Purple. in Proc. IEEE/ACM Supercomputing (SC'06), Tampa FL.
[13]
Kalla, R., Sinharoy, B. 2009. Power7: IBM's Next Generation Server Processor. Hot Chips 21.
[14]
Kerbyson, D. J., Barker, K. J., Davis, K. 2007. Analysis of the Weather Research and Forcasting (WRF) Model on Large-scale Systems. Parallel Computing: Architectures, Algorithms and Applications, IOS Press, NIC 38, Juelich, Germany, pp. 89--98.
[15]
Kerbyson, D. J. Barker, K. J. 2011. A Performance Model of Direct Numerical Simulation for Analyzing Large-Scale Systems, in Proc. Workshop on Large-Scale Parallel Processing (LSPP), Int. Parallel and Distributed Processing Symposium (IPDPS), Anchorage, AK.
[16]
Kerbyson, D. J., Barker, K. J. 2011. Analyzing the Performance Bottlenecks of the Power7-IH Network, in Proc. IEEE Cluster, Austin, TX.
[17]
Kurien, S., Taylor, M. 2005. Direct Numerical Simulation of Turbulence: Data Generation and Statistical Analysis. Los Alamos Science, Vol 29, pp. 142--151.
[18]
McCalpin, J. 1995. Memory bandwidth and machine balance in current high performance computers, in IEEE Comp. Soc. Tech. committee on Computer Architecture (TCCA) Newsletter, pp. 19--25.
[19]
National Science Foundation, "Leadership-Class System Acquisition -- Creating a Petascale Computing Environment for Science and Engineering", NSF06573, 2006.
[20]
Petrini, F., Kerbyson, D. J., Pakin, S. 2003. The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q. in proc. IEEE/ACM Supercomputing (SC'03), Phoenix, AZ.
[21]
Rodrigues, R. F., et. al, 2011. The Structural Simulation Toolkit, ACM Sigmetrics Performance Evaluation Review, 38(4), pp. 37--42.
[22]
Saini, S., Naraikin, A., Biswas, R., Barkai, D., Sandstrom, T. 2009. Early Performance Evaluation of a Nehalem Cluster Using Scientific and Engineering Applications, in Proc. IEEE/ACM Supercomputing (SC'09), Portland, OR
[23]
Vaughan, C., Rajan, M., Barrett, R. F., Doerfler, D., Pedretti, K. T. 2011. Investigating the Impact of the Cielo Cray XE6 Architecture on Scientific Application Codes, in Proc. Workshop on Large-Scale Parallel Processing (LSPP), Int. Parallel and Distributed Processing Symposium (IPDPS), Anchorage, AK.
[24]
Zheng, G., Kakulapati, G., Kale, L. V. 2004. BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Machines, in Proc. Int. Parallel and Distributed Processing Symposium (IPDPS), Santa Fe, NM.

Cited By

View all
  • (2016)Combining Static and Dynamic Data Coalescing in Unified Parallel CIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.240555127:2(381-393)Online publication date: 1-Feb-2016
  • (2016)Modeling the Impact of Silicon Photonics on Graph Analytics2016 IEEE International Conference on Networking, Architecture and Storage (NAS)10.1109/NAS.2016.7549410(1-11)Online publication date: Aug-2016
  • (2015)Performance Evaluation of Scientific Applications on POWER8High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation10.1007/978-3-319-17248-4_2(24-45)Online publication date: 18-Apr-2015
  • Show More Cited By

Index Terms

  1. An early performance analysis of POWER7-IH HPC systems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SC '11: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
    November 2011
    866 pages
    ISBN:9781450307710
    DOI:10.1145/2063384
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 November 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. applications
    2. benchmarking
    3. high performance computing
    4. performance modeling

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SC '11
    Sponsor:

    Acceptance Rates

    SC '11 Paper Acceptance Rate 74 of 352 submissions, 21%;
    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)Combining Static and Dynamic Data Coalescing in Unified Parallel CIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.240555127:2(381-393)Online publication date: 1-Feb-2016
    • (2016)Modeling the Impact of Silicon Photonics on Graph Analytics2016 IEEE International Conference on Networking, Architecture and Storage (NAS)10.1109/NAS.2016.7549410(1-11)Online publication date: Aug-2016
    • (2015)Performance Evaluation of Scientific Applications on POWER8High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation10.1007/978-3-319-17248-4_2(24-45)Online publication date: 18-Apr-2015
    • (2014)Performance Evaluation of the Intel Sandy Bridge Based NASA Pleiades Using Scientific and Engineering ApplicationsHigh Performance Computing Systems. Performance Modeling, Benchmarking and Simulation10.1007/978-3-319-10214-6_2(25-51)Online publication date: 1-Oct-2014
    • (2014)Performance Analysis of Graph Algorithms on P7IHProceedings of the 29th International Conference on Supercomputing - Volume 848810.1007/978-3-319-07518-1_7(109-123)Online publication date: 22-Jun-2014
    • (2013)The power 775 architecture at scaleProceedings of the 27th international ACM conference on International conference on supercomputing10.1145/2464996.2465435(183-192)Online publication date: 10-Jun-2013
    • (2013)Improving communication in PGAS environmentsProceedings of the 27th international ACM conference on International conference on supercomputing10.1145/2464996.2465006(129-138)Online publication date: 10-Jun-2013
    • (2013)A Theoretical Framework for Algorithm-Architecture Co-designProceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing10.1109/IPDPS.2013.99(791-802)Online publication date: 20-May-2013
    • (2013)Toward a Theory of Algorithm-Architecture Co-designHigh Performance Computing for Computational Science - VECPAR 201210.1007/978-3-642-38718-0_2(4-8)Online publication date: 2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media