Skip to main content

Partitioning sparse rectangular matrices for parallel computations of Ax and A T v

  • Conference paper
  • First Online:
Applied Parallel Computing Large Scale Scientific and Industrial Problems (PARA 1998)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1541))

Included in the following conference series:

Abstract

This paper addresses the problem of partitioning the nonzeros of sparse nonsymmetric and nonsquare matrices in order to efficiently compute parallel matrix-vector and matrix-transpose-vector multiplies. Our goal is to balance the work per processor while keeping communications costs low. Although the symmetric partitioning problem has been well-studied, the nonsymmetric and rectangular cases have received scant attention. We show that this problem can be described as a partitioning problem on a bipartite graph. We then describe how to use (modified) multilevel methods to partition these graphs and how to implement the matrix multiplies in parallel to take advantage of the partitioning. Finally, we compare various multilevel and other partitioning strategies on matrices from different applications. The multilevel methods are shown to be best.

This work was supported by the Applied Mathematical Sciences Research Program, Office of Energy Research, U.S. Department of Energy, under contracts DE-AC05-96OR22464 and DE-AL04-94AL85000 with Lockheed Martin Energy Research Corporation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Stephen T. Barnard and Horst D. Simon. A fast multilevel implementation of recursive spectral bisection for partitioning unstructured problems. Concurrency: Practice and Experience, 6:101–117, 1994.

    Article  Google Scholar 

  2. Michael W. Berry, Bruce Hendrickson, and Padma Raghavan. Sparse matrix reordering schemes for browsing hypertext. In James Renegar, Michael Shub, and Steve Smale, editors, The Mathematics of Numerical Analysis, volume 32 of Lectures in Applied Mathematics, pages 99–122. American Mathematical Society, 1996.

    Google Scholar 

  3. Roland W. Freund and Noël M. Nachtigal. QMR: A quasi-minimal residual method for non-Hermitian linear systems. Numer. Math., 60:315–339, 1991.

    Article  MathSciNet  MATH  Google Scholar 

  4. Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, New York, 1979.

    MATH  Google Scholar 

  5. Markus Hegland. Description and use of animal breeding data for large least squares problems. Technical Report TR-PA-93-50, CERFACS, Toulouse, France, 1993.

    Google Scholar 

  6. Bruce Hendrickson and Tamara G. Kolda. Partitioning nonsquare and nonsymmetric matrices for parallel processing. Technical Memorandum TM-13657, Oak Ridge National Laboratory, Oak Ridge, TN 37831, 1998. Submitted to SIAM J. Scientific Computing.

    Google Scholar 

  7. Bruce Hendrickson and Robert Leland. The Chaco user’s guide, version 2.0. Technical Report SAND95-2344, Sandia Natl. Lab., Albuquerque, NM, 87185, 1995.

    Google Scholar 

  8. Bruce Hendrickson and Robert Leland. A multilevel algorithm for partitioning graphs. In Proc. Supercomputing ’95. ACM, 1995.

    Google Scholar 

  9. A. Hofer. Schätzung von Zuchtwerten feldgeprüfter Schweine mit einem Mehrmerkmals-Tiermodell. PhD thesis, ETH-Zurich, 1990. Cited in Markus Hegland. Description and use of animal breeding data for large least squares problems. Technical Report TR-PA-93-50, CERFACS, Toulouse, France, 1993.

    Google Scholar 

  10. George Karypis and Vipin Kumar. A fast and high quality multilevel scheme for partitioning irregular graphs. Technical Report 95-035, Dept. Computer Science, Univ. Minnesota, Minneapolis, MN 55455, 1995.

    Google Scholar 

  11. George Karypis and Vipin Kumar. Parallel multilevel graph partitioning. Technical Report 95-036, Dept. Computer Science, Univ. Minnesota, Minneapolis, MN 55455, 1995.

    Google Scholar 

  12. B. W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. Bell System Technical J., 1970.

    Google Scholar 

  13. Tamara G. Kolda. Partitioning sparse rectangular matrices for parallel processing. In Proc. 5th Intl. Symposium on Solving Irregularly Structured Problems in Parallel (Irregular ’98), to appear.

    Google Scholar 

  14. Christopher C. Paige and Michael A. Saunders. LSQR: An algorithm for sparse linear equations and sparse least squares. ACM Trans. Mathematical Software, 8:43–71, 1982.

    Article  MathSciNet  MATH  Google Scholar 

  15. Weichung Wang and Dianne P. O’Leary. Adaptive use of iterative methods in interior point methods for linear programming. Technical Report CS-TR-3560, Dept. Computer Science, Univ. Maryland, College Park, MD 20742, 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bo Kågström Jack Dongarra Erik Elmroth Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hendrickson, B., Kolda, T.G. (1998). Partitioning sparse rectangular matrices for parallel computations of Ax and A T v . In: Kågström, B., Dongarra, J., Elmroth, E., Waśniewski, J. (eds) Applied Parallel Computing Large Scale Scientific and Industrial Problems. PARA 1998. Lecture Notes in Computer Science, vol 1541. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095342

Download citation

  • DOI: https://doi.org/10.1007/BFb0095342

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65414-8

  • Online ISBN: 978-3-540-49261-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics