Skip to main content

New Data Distribution for Solving Triangular Systems on Distributed Memory Machines

  • Conference paper
Applied Parallel Computing. State of the Art in Scientific Computing (PARA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4699))

Included in the following conference series:

  • 1230 Accesses

Abstract

The aim is to present a new data distribution of triangular matrices that provides steady distribution of blocks among processes and reduces memory wasting compared to the standard block-cyclic data layout used in the ScaLAPACK library for dense matrix computations. A new algorithm for solving triangular systems of linear equations is also introduced. The results of experiments performed on a cluster of Itanium 2 processors and Cray X1 show that in some cases, the new method is faster than corresponding PBLAS routines PSTRSV and PSTRSM.

The work has been sponsored by the KBN grant 6T11 2003C/06098. The use of Cray X1 from the Interdisciplinary Center for Mathematical and Computational Modeling (ICM) of the Warsaw University is kindly acknowledged.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baboulin, M., Giraud, L., Gratton, S., Langou, J.: A distributed packed storage for large dense parallel in-core calculations. Technical Report TR/PA/05/30, CERFACS (2005)

    Google Scholar 

  2. Blackford, L., et al.: ScaLAPACK User’s Guide. SIAM, Philadelphia (1997)

    Google Scholar 

  3. Chaudron, M.R., van Duin, A.C.: The formal derivation of parallel triangular system solvers using a coordination-based design method. Parallel Comput. 24, 1023–1046 (1998)

    Article  MATH  MathSciNet  Google Scholar 

  4. D’Azevedo, E.F., Dongarra, J.J.: LAPACK Working Note 135: Packed storage extension for ScaLAPACK (1998)

    Google Scholar 

  5. Dongarra, J., Duff, I., Sorensen, D., Van der Vorst, H.: Solving Linear Systems on Vector and Shared Memory Computers. SIAM, Philadelphia (1991)

    Google Scholar 

  6. Dongarra, J.J., Whaley, R.C.: LAPACK Working Note 94: A user’s guide to the BLACS, vol. 1.1 (1997), http://www.netlib.org/blacs

  7. Gustavson, F.G., Karlsson, L., Kagstrom, B.: Three algorithms for Cholesky factorization on distributed memory using packed storage. In: Workshop on State-of-the-Art in Scientific and Parallel Computing, Umea, Sweden, June 2006. LNCS (2006)

    Google Scholar 

  8. Heath, M., Romine, C.: Parallel solution of triangular systems on distributed memory multiprocessors. SIAM J. Sci. Statist. Comput. 9, 558–588 (1988)

    Article  MATH  MathSciNet  Google Scholar 

  9. Li, G., Coleman, T.F.: A new method for solving triangular systems on distributed-memory message-passing multiprocessors. SIAM J. Sci. Stat. Comput. 10, 382–396 (1989)

    Article  MATH  MathSciNet  Google Scholar 

  10. Netwok Computer Services Inc.: The AHPCRC Cray X1 primer, http://www.ahpcrc.org/publications/Primer.pdf

  11. Romine, C., Ortega, J.: Parallel solutions of triangular systems of equations. Parallel Comput. 6, 109–114 (1988)

    Article  MathSciNet  Google Scholar 

  12. Stpiczyński, P.: Parallel Cholesky factorization on orthogonal multiprocessors. Parallel Computing 18, 213–219 (1992)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bo Kågström Erik Elmroth Jack Dongarra Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Stpiczyński, P. (2007). New Data Distribution for Solving Triangular Systems on Distributed Memory Machines. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2006. Lecture Notes in Computer Science, vol 4699. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75755-9_71

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75755-9_71

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75754-2

  • Online ISBN: 978-3-540-75755-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics