Parallel Reductions: An Application of Adaptive Algorithm Selection

Yu, Hao; Dang, Francis; Rauchwerger, Lawrence

doi:10.1007/11596110_13

Hao Yu⁶,
Francis Dang⁶ &
Lawrence Rauchwerger⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 2481))

Included in the following conference series:

International Workshop on Languages and Compilers for Parallel Computing

554 Accesses
1 Citations

Abstract

Irregular and dynamic memory reference patterns can cause significant performance variations for low level algorithms in general and especially for parallel algorithms. We have previously shown that parallel reduction algorithms are quite input sensitive and thus can benefit from an adaptive, reference pattern directed selection. In this paper we extend our previous work by detailing a systematic approach to dynamically select the best parallel algorithm. First we model the characteristics of the input, i.e., the memory reference pattern, with a descriptor vector. Then we measure the performance of several reduction algorithms for various values of the pattern descriptor. Finally we establish a (many-to-one) mapping (function) between a finite set of descriptor values and a set of algorithms. We thus obtain a performance ranking of the available algorithms with respect to a limited set of descriptor values. The actual dynamic selection code is generated using statistical regression methods or a decision tree. Finally we present experimental results to validate our modeling and prediction techniques.

This research supported in part by NSF CAREER Awards CCR-9624315 and CCR-9734471, NSF Grants ACI-9872126, EIA-9975018, EIA-0103742, and by the DOE ASCI ASAP program grant B347886.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Charmm: A program for macromolecular energy, minimization, and dynamics calculations. J. of Computational Chemistry 4(6) (1983)
Google Scholar
Blume, W., et al.: Advanced Program Restructuring for High-Performance Computers with Polaris. IEEE Computer 29(12), 78–82 (1996)
Google Scholar
Eigenmann, R., Hoeflinger, J., Li, Z., Padua, D.: Experience in the Automatic Parallelization of Four Perfect-Benchmark Programs. In: Banerjee, U., Nicolau, A., Gelernter, D., Padua, D.A. (eds.) LCPC 1991. LNCS, vol. 589, pp. 65–83. Springer, Heidelberg (1992)
Chapter Google Scholar
Han, H., Tseng, C.-W.: Improving compiler and run-time support for adaptive irregular codes. In: Int. Conf. on Parallel Architectures and Compilation Techniques (October 1998)
Google Scholar
Han, H., Tseng, C.-W.: A comparison of locality transformations for irregular codes. In: Dwarkadas, S. (ed.) LCR 2000. LNCS, vol. 1915, pp. 70–84. Springer, Heidelberg (2000)
Chapter Google Scholar
Jain, R.: The Art of Computer Systems Performance Analysis. John Wiley & Sons, Inc., Chichester (1991)
MATH Google Scholar
Kruskal, C.: Efficient parallel algorithms for graph problems. In: Proc. of the 1986 Int. Conf. on Parallel Processing, August 1986, pp. 869–876 (1986)
Google Scholar
Leighton, F.T.: Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes. Morgan Kaufmann, San Francisco (1992)
MATH Google Scholar
Lin, Y., Padua, D.: On the automatic parallelization of sprase and irregular fortran programs. In: Proc. of the Workshop on Languages, Compilers and Run-time Systems for Scalable Computers, Pittsburgh, PA, May 1998, pp. 41–56 (1998)
Google Scholar
Frisch, M.J., et al.: Gaussian 94, Revision B.1. Gaussian, Inc., Pittsburgh (1995)
Google Scholar
Mitchell, T.: Machine Learning. MIT Press/The McGraw-Hill Companies, Inc. (1997)
Google Scholar
Nagel, L.: SPICE2: A Computer Program to Simulate Semiconductor Circuits. PhD thesis, Univ. of California (May 1975)
Google Scholar
Pottenger, W.M.: Theory, Techniques, and Experiments in Solving Recurrences in Computer Programs. PhD thesis, CSRD, Univ. of Illinois at Urbana-Champaign (May 1997)
Google Scholar
Quinlan, R.: C4.5 Release 8, http://www.cse.unsw.edu.au/quinlan/
Whirley, R.G., Engelmann, B.: DYNA3D: A Nonlinear, Explicit. In: Three-Dimensional Finite Element Code For Solid and Structural Mechanics, November 1993, Lawrence Livermore National Lab. (1993)
Google Scholar
Wu, J., Saltz, J., Hiranandani, S., Berryman, H.: Runtime compilation methods for multicomputers. In: Schwetman, H.D. (ed.) Proc. of the 1991 Int. Conf. on Parallel Processing, vol. II - Software, pp. 26–30. CRC Press, Inc., Boca Raton (1991)
Google Scholar
Yu, H., Rauchwerger, L.: Adaptive reduction parallelization. In: Proc. of the 14th ACM Int.Conf. on Supercomputing, Santa Fe, NM (May 2000)
Google Scholar
Yu, H., Rauchwerger, L.: Run-time parallelization overhead reduction techniques. In: Proc. of the 9th Int. Conf. on Compiler Construction, CC 2000, Berlin, Germany. LNCS, vol. 1781. Springer, Heidelberg (2000)
Google Scholar
Zima, H.: Supercompilers for Parallel and Vector Computers. ACM Press, New York (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Texas A&M University, College Station, TX, 77843-3112, USA
Hao Yu, Francis Dang & Lawrence Rauchwerger

Authors

Hao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Francis Dang
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence Rauchwerger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Deptartment of Computer Science, University of Maryland, 4135 A.V. Williams Bldg., College Park, 20742, MD, USA
Bill Pugh
Dept. of Computer Science, Univ. of Maryland at College Park,
Chau-Wen Tseng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, H., Dang, F., Rauchwerger, L. (2005). Parallel Reductions: An Application of Adaptive Algorithm Selection. In: Pugh, B., Tseng, CW. (eds) Languages and Compilers for Parallel Computing. LCPC 2002. Lecture Notes in Computer Science, vol 2481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11596110_13

Download citation

DOI: https://doi.org/10.1007/11596110_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30781-5
Online ISBN: 978-3-540-31612-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics