Abstract
The aim is to present a new data distribution of triangular matrices that provides steady distribution of blocks among processes and reduces memory wasting compared to the standard block-cyclic data layout used in the ScaLAPACK library for dense matrix computations. A new algorithm for solving triangular systems of linear equations is also introduced. The results of experiments performed on a cluster of Itanium 2 processors and Cray X1 show that in some cases, the new method is faster than corresponding PBLAS routines PSTRSV and PSTRSM.
The work has been sponsored by the KBN grant 6T11 2003C/06098. The use of Cray X1 from the Interdisciplinary Center for Mathematical and Computational Modeling (ICM) of the Warsaw University is kindly acknowledged.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baboulin, M., Giraud, L., Gratton, S., Langou, J.: A distributed packed storage for large dense parallel in-core calculations. Technical Report TR/PA/05/30, CERFACS (2005)
Blackford, L., et al.: ScaLAPACK User’s Guide. SIAM, Philadelphia (1997)
Chaudron, M.R., van Duin, A.C.: The formal derivation of parallel triangular system solvers using a coordination-based design method. Parallel Comput. 24, 1023–1046 (1998)
D’Azevedo, E.F., Dongarra, J.J.: LAPACK Working Note 135: Packed storage extension for ScaLAPACK (1998)
Dongarra, J., Duff, I., Sorensen, D., Van der Vorst, H.: Solving Linear Systems on Vector and Shared Memory Computers. SIAM, Philadelphia (1991)
Dongarra, J.J., Whaley, R.C.: LAPACK Working Note 94: A user’s guide to the BLACS, vol. 1.1 (1997), http://www.netlib.org/blacs
Gustavson, F.G., Karlsson, L., Kagstrom, B.: Three algorithms for Cholesky factorization on distributed memory using packed storage. In: Workshop on State-of-the-Art in Scientific and Parallel Computing, Umea, Sweden, June 2006. LNCS (2006)
Heath, M., Romine, C.: Parallel solution of triangular systems on distributed memory multiprocessors. SIAM J. Sci. Statist. Comput. 9, 558–588 (1988)
Li, G., Coleman, T.F.: A new method for solving triangular systems on distributed-memory message-passing multiprocessors. SIAM J. Sci. Stat. Comput. 10, 382–396 (1989)
Netwok Computer Services Inc.: The AHPCRC Cray X1 primer, http://www.ahpcrc.org/publications/Primer.pdf
Romine, C., Ortega, J.: Parallel solutions of triangular systems of equations. Parallel Comput. 6, 109–114 (1988)
Stpiczyński, P.: Parallel Cholesky factorization on orthogonal multiprocessors. Parallel Computing 18, 213–219 (1992)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Stpiczyński, P. (2007). New Data Distribution for Solving Triangular Systems on Distributed Memory Machines. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2006. Lecture Notes in Computer Science, vol 4699. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75755-9_71
Download citation
DOI: https://doi.org/10.1007/978-3-540-75755-9_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75754-2
Online ISBN: 978-3-540-75755-9
eBook Packages: Computer ScienceComputer Science (R0)