Elsevier

Parallel Computing

Volume 22, Issue 7, 1 October 1996, Pages 917-942
Parallel Computing

Practical aspect and experience
Early prediction of MPP performance: The SP2, T3D, and Paragon experiences

https://doi.org/10.1016/0167-8191(96)00034-8Get rights and content

Abstract

The performance of Massively Parallel Processors (MPPs) is attributed to a large number of machine and program factors. Software development for MPP applications is often very costly. The high cost is partially caused by a lack of early prediction of MPP performance. The program development cycle may iterate many times before achieving the desired performance level. In this paper, we present an early prediction scheme we have developed at the University of Southern California for reducing the cost of application software development. Using workload analysis and overhead estimation, our scheme optimizes the design of parallel algorithm before entering the tedious coding, debugging, and testing cycle of the applications. The scheme is generally applied at user/programmer level, not tied to any particular machine platform or any specific software environment. We have tested the effectiveness of this early performance prediction scheme by running the MIT/STAP benchmark programs on a 400-node IBM SP2 system at the Maui High-Performance Computing Center (MHPCC), on a 400-node Intel Paragon system at the San Diego Supercomputing Center (SDSC), and on a 128-node Cray T3D at the Cray Research Eagan Center in Wisconsin. Our prediction shows to be rather accurate compared with the actual performance measured on these machines. We use the SP2 data to illustrate the early prediction scheme. The main contribution of this work lies in providing a systematic procedure to estimate the computational work-load, to determine the application attributes, and to reveal the communication overhead in using these MPPs. These results can be applied to develop any MPP applications other than the STAP benchmarks by which this prediction scheme was developed.

References (29)

  • T. Fahringer

    Estimating and optimizing performance for parallel programs

    IEEE Computer

    (1995)
  • G.C. Fox et al.
  • J.L. Gustafson

    Reevaluating Amdahl's law

    Comm. ACM

    (1988)
  • R.W. Hockney

    Performance parameters and benchmarking of supercomputers

    Parallel Computing

    (1991)
  • Cited by (25)

    • Workload forecasting framework for applications in cloud

      2014, Proceedings of 2014 International Conference on Cloud Computing and Internet of Things, CCIOT 2014
    • Adaptive workload prediction of grid performance in confidence windows

      2010, IEEE Transactions on Parallel and Distributed Systems
    • Scalability of time- and space-efficient embedded Runge-Kutta solvers for distributed address space

      2009, Proceedings of the International Conference on Parallel Processing
    View all citing articles on Scopus
    View full text