Abstract
In this paper, we document the experimental work in our attempts to automatically parallelize SPEC2000FP benchmarks for SMP machines. This is not purely a research project. It was implemented within IBM’s software laboratory in a commercial compiler infrastructure that implements OpenMP 2.0 specifications in both Fortran and C/C++. From the beginning, our emphasis is on using simple parallelization techniques. We aim to maintain a good trade-off between performance, especially scalability of an application program and its compilation time. Although the parallelization results show relatively low speed up, it is still promising considering the problems associated with explicit parallel programming and the fact that more and more multi-thread and multi-core chips will soon be available even for home computing.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Koelbel, C.H., Loveman, D.B., Schreiber, R.S.: The High Performance Fortran Handbook. MIT Press, Cambridge (1993)
Pacheco, P.: Parallel Programming with MPI. Morgan Kaufmann, San Francisco (1996)
Chandra, R., et al.: Parallel programming in OpenMP. Morgan Kaufmann Publishers, San Francisco (2001)
El-Ghazawi, T.A., Carlson, W.W., Draper, J.M.: Upc language specification (v 1.1.1) (2003), http://upc.gwu.edu
Zhang, G., Silvera, R., Archambault, R.: Structure and algorithm for implementing OpenMP workshare. In: WOMPAT. LNCS, Springer, Heidelberg (2004)
Banerjee, U.: Dependence Analysis for Supercomputing. Kluwer, Boston (1988)
Allen, R., Kennedy, K.: Optimizing compilers for modern architectures. Morgan Kaufmann Publishers, San Francisco (2002)
Wolf, M.E., Lam, M.S.: A loop transformation theory and an algorithm to maximize parallelism. IEEE Transactions on Parallel and Distributed Systems 2(4), 452–471 (1991)
Wolf, M.E., Lam, M.S.: A data locality optimizing algorithm. ACM SIGPLAN 1991 Conference on Programming Language Design and Implementation (1991)
Banerjee, U.: Unimodular transformations of double loops. In: Proc. of the 3rd Workshop on Programming Languages and Compilers for Parallel Computing, Irvine, CA (August 1990)
Zima, H., Chapman, B.: Supercompilers for Parallel and Vector Computers. Addison-wesley, Reading (1990)
Mckinley, K.S., Carr, S., Tseng, C.-W.: Improving data locality with loop transformations. ACM Trans. on Programming Language and Systems 18(4) (1996)
Cohn, R., Lowney, P.G.: Feedback directed optimization in Compaq’s compilation tools for Alpha. In: 2nd ACM Workshop on Feedback-Directed Optimization (1999)
Schmidt, W.J., et al.: Profile-directed restructuring of operating system code. IBM Systems Journal 37(2) (1998)
Nandy, S., Gao, X., Ferrante, J.: TFP: Time-sensitive, Flow-specific Profiling at Runtime. In: Rauchwerger, L. (ed.) LCPC 2003. LNCS, vol. 2958. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, G., Unnikrishnan, P., Ren, J. (2005). Experiments with Auto-Parallelizing SPEC2000FP Benchmarks. In: Eigenmann, R., Li, Z., Midkiff, S.P. (eds) Languages and Compilers for High Performance Computing. LCPC 2004. Lecture Notes in Computer Science, vol 3602. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11532378_25
Download citation
DOI: https://doi.org/10.1007/11532378_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28009-5
Online ISBN: 978-3-540-31813-2
eBook Packages: Computer ScienceComputer Science (R0)