Irregular Assignment Computations on cc-NUMA Multiprocessors

Arenaz, Manuel; Touriño, Juan; Doallo, Ramón

doi:10.1007/3-540-47847-7_33

Irregular Assignment Computations on cc-NUMA Multiprocessors

Manuel Arenaz⁶,
Juan Touriño⁶ &
Ramón Doallo⁶

Conference paper
First Online: 01 January 2002

840 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2327))

Abstract

This paper addresses the parallelization of loops with irregular assignment computations on cc-NUMA multiprocessors. This loop pattern is distinguished by the existence of loop-carried output data dependences that can only be detected at run-time. A parallelization technique based on the inspector-executor model is proposed in this paper. In the inspector, loop iterations are reordered so that they can be executed in a conflict-free manner during the executor stage. The design of the inspector ensures load-balancing and uniprocessor data write locality exploitation. Experimental results show the scalability of this technique, which is presented as a clear alternative to other existing methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arenaz, M., Touriño, J., Doallo, R.: A Compiler Framework to Detect Parallelism in Irregular Codes. In Proceedings of 14th International Workshop on Languages and Compilers for Parallel Computing, LCPC’2001, Cumberland Falls, KY (2001)
Google Scholar
Glassner, A.: Graphics Gems. Academic Press (1993)
Google Scholar
Gutiérrez, E., Plata, O., Zapata, E.L.: Balanced, Locality-Based Parallel Irregular Reductions. In Proceedings of 14th International Workshop on Languages and Compilers for Parallel Computing, LCPC’ 2001, Cumberland Falls, KY (2001)
Google Scholar
Han, H., Tseng, C.-W.: Efficient Compiler and Run-Time Support for Parallel Irregular Reductions. Parallel Computing 26(13–14) (2000) 1861–1887
Article MATH Google Scholar
Knobe, K., Sarkar, V.: Array SSA Form and Its Use in Parallelization. In Proceedings of the 25th ACM SIGACT-SIGPLAN Symposium on the Principles of Programming Languages (1998) 107–120
Google Scholar
Lin, Y., Padua, D.A.: On the Automatic Parallelization of Sparse and Irregular Fortran Programs. In: O’Hallaron, D. (ed.): Languages, Compilers, and Run-Time Systems for Scalable Computers. Lecture Notes in Computer Science, Vol. 1511, Springer-Verlag (1998) 41–56
Chapter Google Scholar
OpenMP Architecture Review Board: OpenMP: A proposed industry standard API for shared memory programming(1997)
Google Scholar
Ponnusamy, R., Saltz, J., Choudhary, A., Hwang, Y.-S., Fox, G.: Runtime Support and Compilation Methods for User-Specified Irregular Data Distributions. IEEE Transactions on Parallel and Distributed Systems 6(8) (1995) 815–831
Article Google Scholar
Rauchwerger, L., Padua, D.A.: The LRPD Test: Speculative Run-Time Parallelization of Loops with Privatization and Reduction Parallelization. IEEE Transactions on Parallel and Distributed Systems 10(2) (1999) 160–180
Article Google Scholar
Saad, Y.: SPARSKIT: A Basic Tool Kit for Sparse Matrix Computations. http://www.cs.umn.edu/Research/darpa/SPARSKIT/sparskit.html (1994)
Turek, S., Becker, Chr.: Featflow: Finite Element Software for the Incompressible Navier-Stokes Equations. User Manual. http://www.featflow.de (1998)
Wolfe, M.J.: Optimizing Supercompilers for Supercomputers. Pitman, London and The MIT Press, Cambridge, Massachussets (1989) In the series, Research Monographs in Parallel and Distributed Computing.
MATH Google Scholar
Yu, H., Rauchwerger, L.: Adaptive Reduction Parallelization Techniques. In Proceedings of the 14th ACM International Conference on Supercomputing, Santa Fe, NM (2000) 66–77
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Architecture Group Department of Electronics and Systems, University of A Coruña, Campus de Elviña, s/n, 15071, A Coruña, Spain
Manuel Arenaz, Juan Touriño & Ramón Doallo

Authors

Manuel Arenaz
View author publications
You can also search for this author in PubMed Google Scholar
Juan Touriño
View author publications
You can also search for this author in PubMed Google Scholar
Ramón Doallo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Software Science, University of Vienna, Liechtensteinstr. 22, 1090, Vienna, Austria
Hans P. Zima
Department of Information and Computer Science, Nara Women’s University, Kitauoyanishimachi, Nara City, 630-8506, Japan
Kazuki Joe
Institute of Information Science and Electronics, University of Tsukuba, Tenno-dai 1-1-1, Tsukuba, Ibaraki, 305-8577, Japan
Mitsuhisa Sato
Internet Systems Research Laboratories, NEC Corporation, 4-1-1, Miyazaki, Miyamae, Kawasaki, Kanagawa, 216-8555, Japan
Yoshiki Seo
Kyoto University, Yoshidahonmachi, Sakyo-ku, Kyoto, 606-8501, Japan
Masaaki Shimasaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arenaz, M., Touriño, J., Doallo, R. (2002). Irregular Assignment Computations on cc-NUMA Multiprocessors. In: Zima, H.P., Joe, K., Sato, M., Seo, Y., Shimasaki, M. (eds) High Performance Computing. ISHPC 2002. Lecture Notes in Computer Science, vol 2327. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47847-7_33

Download citation

DOI: https://doi.org/10.1007/3-540-47847-7_33
Published: 29 April 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43674-4
Online ISBN: 978-3-540-47847-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics