Abstract
Using functions of parallelized mathematical libraries is a common way to accelerate numerical applications. Computer architectures with shared memory characteristics support different approaches for the implementation of such libraries, usually OpenMP or MPI.
This paper’s content is based on the performance comparison of DGEMM calls (floating point matrix multiplication, double precision) with different OpenMP parallelized numerical libraries, namely Intel MKL and SGI SCSL, and how they can be optimized. Additionally, we have a look at the memory placement policy and give hints for initializing data. Our attention has been focused on a SGI Altix 3700 Bx2 system using BenchIT [1] as a very convenient performance measurement suite for the examinations.
Chapter PDF
Similar content being viewed by others
References
BenchIT: Homepage, http://www.benchit.org
Juckeland, G., Börner, S., Kluge, M., Kölling, S., Nagel, W.E., Pflüger, S., Röding, H., Seidl, S., William, T., Wloch, R.: ParCo 2003: BenchIT - Performance Measurement and Comparison for Scientic Applications (2003), http://www.benchit.org/DOWNLOAD/DOCUMENTS/parco_paper.pdf
Juckeland, G., Kluge, M., Nagel, W.E., Pflüger, S.: Performance Analysis with BenchIT: Portable, Flexible, Easy to Use. In: QEST, pp. S320–321. IEEE Computer Society, Los Alamitos (2004), ISBN 0–7695–2185–1
Schöne, R., Juckeland, G., Nagel, W.E., Pflüger, S., Wloch, S.: Parco 2005: Performance comparison and optimization: Case studies using BenchIT (2005), http://www.benchit.org/downloads/documents/parco_05_abstract.pdf
Silicon Graphics Inc.: Homepage http://www.sgi.com
Oak Ridge National Laboratoy: Evaluation of the Altix 3700 at Oak Ridge National Laboratoy, http://www.gelato.unsw.edu.au/archives/linux-ia64/0409/10993.html
University of Tennessee: Basic Linear Algebra Subprograms Technical (BLAST) Forum, http://www.netlib.org/utk/papers/blas-report.ps
Intel: Intel Math Kernel Library 8.0, http://www.intel.com/cd/software/products/asmo-na/eng/perflib/mkl/index.htm
Silicon Graphics Inc.: Scientific Computing Software Library, http://www.sgi.com/products/software/scsl.html
Silicon Graphics Inc.: Linux Application Tuning Guide, http://techpubs.sgi.com/library/manuals/4000/007-4639-004/pdf/007-4639-004.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hackenberg, D., Schöne, R., Nagel, W.E., Pflüger, S. (2006). Optimizing OpenMP Parallelized DGEMM Calls on SGI Altix 3700. In: Nagel, W.E., Walter, W.V., Lehner, W. (eds) Euro-Par 2006 Parallel Processing. Euro-Par 2006. Lecture Notes in Computer Science, vol 4128. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823285_15
Download citation
DOI: https://doi.org/10.1007/11823285_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37783-2
Online ISBN: 978-3-540-37784-9
eBook Packages: Computer ScienceComputer Science (R0)