Abstract
Experimental design of parallel computers calls for quantifiable methods to compare and evaluate the requirements of different workloads within an application domain. Such metrics can help establish the basis for scientific design of parallel computers driven by application needs, to optimize performance to cost. In this work, a parallelism-based framework is presented for representing and comparing workloads, based on the way they would exercise parallel machines. This method is architecture-invariant and can be used effectively for the comparison of workloads and assessing resource requirements. Our workload characterization is derived from parallel instruction centroid and parallel workload similarity. The centroid is a workload approximation which captures the type and amount of parallel work generated by the workload on the average. The centroid is an efficient measure which aggregates average parallelism, instruction mix, and critical path length. When captured with abstracted information about communication requirements, the result is a powerful tool in understanding the requirements of workloads and their potential performance on target parallel machines. The parallel workload similarity is based on measuring the normalized Euclidean distance (ned), which provides an efficient means of comparing workloads, between workload centroids. This provides the basis for quantifiable analysis of workloads to make informed decisions on the composition of parallel benchmark suites. It is shown that this workload characterization method outperforms comparable ones in accuracy, as well as in time and space requirements. Analysis of the NAS Parallel Benchmark workloads and their performance is presented to demonstrate some of the applications and insight provided by this framework. The parallel-instruction workload model is used to study the similarities among the NAS Parallel Benchmark workloads in a quantitative manner. The results confirm that workloads in NPB represent a wide range of non-redundant benchmarks with different characteristics.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
R. Hockney, The Science of Computer Benchmarking, SIAM Publisher, Philadelphia, PA., 1996.
G.S. Tjaden and M.J. Flynn, “Detection and Parallel Execution of Independent Instructions,” IEEE Transactions on Computers, vol. C-19, no. 10, pp. 889–895, October 1970.
A. Nicolau and J.A. Fisher, “Measuring the Parallelism Available for Very Long Instruction Word Architectures,” IEEE Transactions on Computers, vol. 33, no. 11, pp. 968–976, Nov. 1984.
M. Butler, T. Yeh, Y. Patt, M. Alsup, H. Scales, and M. Shebanow, “Single Instruction Stream Parallelism Is Greater Than Two,” Proc. of the 8th Annual Symp. on Comp. Arch., pp. 276–286, May 1991.
Kumar, “Measuring Parallelism in Computation-Intensive Scientific/Engineering Applications,” IEEE Transactions on Computers, vol. C-37, no. 9, pp. 1088–1098, Sep. 1988.
T. Sterling, T. El-Ghazawi, A. Meajil and A. Ozkaya, “NASA Science Workload Characterization for Scalable Parallel Computer Architectures,” Supercomputing 93, technical program, 1993, pp. 78.
K.B. Theobald, G.R. Gao, and L.J. Hendren, “On the Limits of Program Parallelism and its Smoothability” Proc. of the 25th Annual International Symp. on Micro-Architecture (MICRO-25), pp. 10–19, Portland, Oregon, December 1992.
T. Conte and W. Hwu, “Benchmark Characterization,” IEEE Computer, pp. 48–56, January 1991.
M. Calzarossa, and G. Serazzi, “Workload Characterization for Supercomputer,” Performance Evaluation of Supercomputers, J.L. Martin (editor), pp. 283–315, North-Holland, 1988.
J. Martin, “Performance Evaluation of Supercomputers and Their Applications,” Parallel Systems and Computation, G. Paul and G. Almasi (Editors), pp. 221–235, North-Holland, 1988.
D. Bradley and J. Larson, “A Parallelism-Based Analytic Approach to Performance Evaluation Using Application Programs,” Proceedings of the IEEE, vol., 81, no. 8, pp. 1126–1135, August 1993.
A.I. Meajil, “An Architecture-Independent Workload Characterization Model for Parallel Computer Architectures,” Technical Report No. GWU-IIST 96-12, Department of Electrical Engineering and Computer Science, George Washington University, July 1996.
K.B. Theobald, G.R. Gao, and L.J. Hendren, “Speculative Execution and Branch Prediction on Parallel Machines,” ACAPS Technical Memo 57, McGill University, December 21, 1992.
D. Bailey et al. “The NAS Parallel Benchmarks,” RNR Technical Report RNR-94-007, March 1994, NASA Ames Research Center, Moffett Field, CA.
G. Irlam, The Spa package, version 1.0, October 1991.
The SPARC Architecture Manual, Version 8, SPARC Int'l, Inc., Menlo Park, CA, 1991.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Meajil, A.I., El-Ghazawi, T., Sterling, T. (1996). A quantitative approach for architecture-invariant parallel workload characterization. In: Waśniewski, J., Dongarra, J., Madsen, K., Olesen, D. (eds) Applied Parallel Computing Industrial Computation and Optimization. PARA 1996. Lecture Notes in Computer Science, vol 1184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-62095-8_55
Download citation
DOI: https://doi.org/10.1007/3-540-62095-8_55
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62095-2
Online ISBN: 978-3-540-49643-4
eBook Packages: Springer Book Archive