Abstract
In this paper we describe algorithms for the ordering and numerical factorization step in parallel sparse Cholesky factorization. Direct methods for solving sparse positive definite systems play an important role in many scientific applications such as linear programming and structual engineering. The importance of direct methods is mainly due to their generality and robustness. The paper describes minimum degree and nested dissection based ordering methods and presents a scalable parallel algorithm for the factorization of sparse matrices. The interested reader will find many references to the relevant literature.
Preview
Unable to display preview. Download preview PDF.
References
F.L. Alvarado, A. Pothen, R. Schreiber, Highly parallel sparse triangular solution, RIACS Techn. Rep. 92.11, NASA Ames Research Center, 1992.
P. Amestoy, T.A. Davis, I.S. Duff, An approximate minimum degree ordering algorithm, Techn. Rep. TR/PA/95/09, Parallel Algorithm Project, CERFACS, Toulouse, 1995.
C. Ashcraft, S.C. Eisenstat, J.W.H. Liu, A. Sherman, A comparison of three distributed sparse factorization schemes, SIAM Symposium on Sparse Matrices, 1989.
C. Ashcraft, S.C. Eisenstat, J.W.H. Liu, A fan-in algorithm for distributed sparse numerical factorization, SIAM J. Sci. Stat. Comput., Vol. 11, No. 3, 593–599, 1990.
C. Ashcraft, J.W.H. Liu, Using domain decomposition to find graph bisectors, Techn. Rep. ISSTECH-95-024, Boeing Computer Services, Seattle, 1995.
C. Ashcraft, J.W.H. Liu, Robust ordering of sparse matrices using multisection, Techn. Rep. ISSTECH-96-002, Boeing Computer Services, Seattle, 1996.
C. Ashcraft, J.W.H. Liu, applications of the Dulmage-Mendelsohn decomposition and network flow of graph bisection improvement, Techn. Rep., Boeing Computer Services, Seattle, 1996.
S.T. Barnard, H.D. Simon, A fast multilevel implementation of recursive spectral bisection, Proc. of 6th SIAM Conf. Parallel Processing for Scientific Computing, 711–718, 1993.
P. Berman, G. Schnitger, On the performance of the minimum degree ordering for Gaussian elimination, SIAM J. Matrix Anal. Appl., Vol. 11, No. 1, 83–88, 1990.
C.-K. Cheng, Y.-C. Weil, An improved two-way partitioning algorithm with stable performance, IEEE Transactions on Computer Aided Design, Vol. 10, No. 12, 1502–1511, 1991.
R. Diekmann, B. Monien, R. Preis, Using helpful sets to improve graph bisections, DIMACS Series in Discrete Mathematics and Theoretical Computer Science, American Mathematical Society, Volume 21, 1995.
I.S. Duff, J.K. Reid, The multifrontal solution of indefinite sparse symmetric linear equations, ACM Trans. Math. Software, Vol. 9, 302–325, 1983.
A. Dulmage, N. Mendelsohn, Coverings of bipartite graphs, Can. J. Math., Vol. 10, 517–534, 1958.
S.C. Eisenstat, M.H. Schultz, A.H. Sherman, Algorithms and data structures for sparse symmetric Gaussian elimination, SIAM J. Sci. Stat. Comput., Vol. 2, No. 2, 225–237, 1981.
C.M. Fiduccia, R.M. Mattheyses, A linear-time heuristic for improving network partitions, 19th IEEE Design Automation Conference, 175–181, 1982.
K.A. Gallivan, R.J. Plemmons, A.H. Sameh, Parallel algorithms for dense linear algebra computations, SIAM Review Vol. 32, No. 1, 54–135, 1990.
G.A. Geist, C.H. Romine, LU factorization algorithms on distributed-memory multiprocessor architectures, SIAM J. Sci. Stat. Comput., Vol. 9, No. 4, 639–649, 1988.
G.A. Geist, E. Ng, Task scheduling for parallel sparse Cholesky factorization, International Journal of Parallel Programming, Vol. 18, No. 4, 291–314, 1989.
A. George, Nested dissection of a regular finite element mesh, SIAM J. Numer. Anal., Vol. 10, No. 2, 345–363, 1973.
A. George, J.W.H. Liu, An automatic nested dissection algorithm for irregular finite element problems, SIAM J. Numer. Anal., Vol. 15, No. 5, 1053–1069, 1978.
A. George, M.T. Heath, E. Ng, J.W.H. Liu, Symbolic Cholesky factorization on a local-memory multiprocessor, Parallel Computing, 85–95, 1987.
J.A. George, J.W.H. Liu, Computer Solution of Large Sparse Positive Definite Systems, Prentice-Hall, Englewood Cliffs, NJ, 1981.
A. George, J.W.H. Liu, A minimal storage implementation of the minimum degree algorithm, SIAM J. Numer. Anal, Vol. 17, No. 2, 282–299, 1980.
A. George, J.W.H. Liu, A fast implementation of the minimum degree algorithm using quotient graphs, ACM Trans. Math. Software, Vol. 6, 337–358, 1980.
A. George, J.W.H. Liu, The evolution of the minimum degree ordering algorithm, SIAM Review, Vol. 31, No. 1, 1–19, 1989.
A. George, J.W.H. Liu, E. Ng, Communication results for parallel sparse Cholesky factorization on a hypercube, Parallel Computing 10, 287–298, 1989.
A. George, M.T. Heath, J.W.H. Liu, E. Ng, Sparse Cholesky factorization on a local-memory multiprocessor, SIAM J. Sci. Stat. Comput., Vol. 9, No. 2, 327–340, 1988.
M. Ghose, E. Rothberg, A parallel implementation of the multiple minimum degree ordering heuristic, Techn. Rep., Old Dominion University, Norfolk, 1994.
N.E. Gibbs, W.G. Poole, P.K. Stockmeyer, An algorithm for reducing the bandwidth and profile of a sparse matrix, SIAM J. Numer. Anal., Vol. 13, 236–250, 1976.
J.R. Gilbert, C. Moler, R. Schreiber, Sparse matrices in MATLAB: design and implementation, SIAM J. Matrix Anal. Appl., Vol. 13, 333–356, 1992.
J.R. Gilbert, R. Schreiber, Highly sparse Cholesky factorization, SIAM J. Sci. Stat. Comput., Vol. 13, No. 5, 1151–1172, 1992.
A. Gupta, Fast and effective algorithms for graph partitioning and sparse matrix ordering, IBM T.J. Watson Research Center, Research Report RC 20496, New York, 1996.
A. Gupta, WGPP: Watson graph partitioning (and sparse matrix ordering) package, users manual, IBM T.J. Watson Research Center, Research Report RC 20453, New York, 1996.
A. Gupta, V. Kumar, A Scalable parallel algorithm for sparse matrix factorization, Tech. Rep. 94-19, CS-Dep., Univ. Minnesota, 1994.
L. Hagen, A. Kahng, A new approach in effective circuit clustering, Proc. of IEEE International Conference on Computer Aided Design, 422–427, 1992.
M.T. Heath, E. Ng, B.W. Peyton, Parallel algorithms for sparse linear systems, SIAM Review, Vol. 33, No. 3, 420–460, 1991.
B. Hendrickson, R. Leland, The chaco user's guide, Tech. Rep. SAND94-2692, Sandia Nat. Lab., 1994.
B. Hendrickson, R. Leland, An improved spectral graph partitioning algorithm for mapping parallel computations, SIAM J. Sci. Comput., Vol. 16, 1995.
B. Hendrickson, R. Leland, A multilevel algorithm for partitioning graphs, Proc. of Supercomputing'95, 1995.
B. Hendrickson, E. Rothberg, Improving the runtime and quality of nested dissection ordering, Techn. Rep., SAND96-0868, Sandia Nat. Lab., 1996.
B. Hendrickson, E. Rothberg, Effective sparse matrix ordering: just around the BEND, Proc. of 8th SIAM Conf. Parallel Processing for Scientific Computing, 1997.
L. Hulbert, E. Zmijewski, Limiting communication in parallel sparse Cholesky factorization, SIAM J. Sci. Stat. Comput., Vol. 12, No. 5, 1184–1197, 1991.
G. Karypis, V. Kumar, A high performance sparse Cholesky factorization algorithm for scalable parallel computers, Tech. Rep. 94-41, CS-Dep., Univ. Minnesota, 1994.
G. Karypis, V. Kumar, A fast and high quality multilevel scheme for partitioning irregular graphs, Tech. Rep. 95-035, CS-Dep., Univ. Minnesota, 1995.
G. Karypis, V. Kumar, METIS: unstructured graph partitioning and sparse matrix ordering system, Techn. Rep., CS-Dep., Univ. Minnesota, 1995.
B.W. Kernighan, S. Lin, An effective heuristic procedure for partitioning graphs, The Bell Systems Technical Journal, 291–308, 1970.
V. Kumar, A. Grama, A. Gupta, G. Karypis, Introduction to Parallel Computing: Design and Analysis of Algorithms, Benjamin Cummings Publishing Company, Redwood City, CA, 1994.
D. König, Über Graphen und ihre Anwendung auf Determinantentheorie und Mengenlehre, Math. Ann., 77, 453–465, 1916.
M. Leuze, Independent set orderings for parallel matrix factorization by Gaussian elimination, Parallel Computing, Vol. 10, 177–191, 1989.
J.G. Lewis, B.W. Peyton, A. Pothen, A fast algorithm for reordering sparse matrices for parallel factorization, SIAM J. Sci. Stat. Comput., Vol. 10, No. 6, 1146–1173, 1989.
J.W.H. Liu, Modification of the minimum-degree algorithm by multiple elimination, ACM Trans. Math. Software, Vol. 11, No. 2, 141–153, 1985.
J.W.H. Liu, Equivalent sparse matrix reordering by elimination tree rotations, SIAM J. Sci. Stat. Comput., Vol. 9, No. 3, 424–444, 1988.
J.W.H. Liu, A graph partitioning algorithm by node separators, ACM Trans. Math. Software, Vol. 15, No. 3, 198–219, 1989.
J.W.H. Liu, Reordering sparse matrices for parallel elimination, Parallel Computing 11, 73–91, 1989.
J.W.H. Liu, The minimum degree ordering with constraints, SIAM J. Sci. Stat. Comput., Vol. 10, No. 6, 1136–1145, 1989.
J.W.H. Liu, The role of elimination trees in sparse factorization, SIAM J. Matrix Anal. Appl., Vol. 11, No. 1, 134–172, 1990.
J.W.H. Liu, The multifrontal method for sparse matrix solutions: theory and practice, SIAM Review, Vol. 34, No. 1, 82–109, 1992.
J.W.H. Liu, A. Mirzaian, A linear reordering algorithm for parallel pivoting of chordal graphs, SIAM J. Disc. Math., Vol. 2, No. 1, 100–107, 1989.
R.F. Lucas, T. Blank, J.J. Tiemann, A parallel solution method for large sparse systems of equations, IEEE Transactions on Computer Aided Design, Vol. 6, No. 6, 981–991, 1987.
M. Mu, J.R. Rice, A grid-based subtree-subcube assignment strategy for solving partial differential equations on hypercubes, SIAM J. Sci. Stat. Comput., Vol. 13, No. 3, 826–839, 1992.
S.V. Parter, The use of linear graphs in Gauss elimination, SIAM Review, Vol. 3, 119–130, 1961.
R. Pozo, S.L. Smith, Performance evaluation of the parallel multifrontal method in a distributed-memory environment, Proc. of 6th SIAM Conference on Parallel Processing for Scientific Computing, 1993.
A. Pothen, C.-J. Fan, Computing the block triangular form of a sparse matrix, ACM Trans. Math. Software, Vol. 16, No. 4, 303–324, 1990.
A. Pothen, H.D. Simon, K.-P. Liou, Partitioning sparse matrices with eigenvectors of graphs, SIAM J. Matrix Anal. Appl., Vol. 11, No. 3, 430–452, 1990.
A. Pothen, C. Sun, Distributed multifrontal factorization using clique trees, Proc. 5th SIAM Conference on Parallel Processing for Scientific Computing, 34–40, 1991.
A. Pothen, C. Sun, A mapping algorithm for parallel sparse Cholesky factorization, SIAM J. Sci. Comput., Vol. 14, No. 5, 1253–1257, 1993.
R. Preis, R. Diekmann, The PARTY partitioning library user guide — version 1.1, Techn. Rep., CS-Dept., Univ. of Paderborn, 1996.
E. Rothberg, A. Gupta, An efficient block-oriented approach to parallel sparse Cholesky factorization, Proc. of Supercomputmg'92. 1992.
D. J. Rose, A graph-theoretic study of the numerical solution of sparse positive definite systems of linear equations, in Graph-Teory and Computing, R. Read (Ed.), Academic Press, New York, 1972.
D. J. Rose, R.E. Tarjan, G.S. Luecker, Algorithmic aspects of vertex elimination on graphs, SIAM J. Comput., Vol. 5, No. 2, 266–283, 1976.
R. Schreiber, Scalability of sparse direct solvers, in Sparse Matrix Computations: Graph Theory Issues and Algorithms, J.R. Gilbert, J.W.H. Liu (Eds.), Springer Verlag, 1992.
J. Schulze, A new multilevel scheme for constructing vertex separators, Techn. Rep., CS-Dept., Univ. of Paderborn, 1997.
J. Schulze, Implementation of a parallel algorithm for sparse matrix factorization, Techn. Rep., CS-Dept., Univ. of Paderborn, 1996.
M. Yannakakis, Computing the minimum fill-in is NP-complete, SIAM J. Alg. Disc. Meth., Vol. 2, No. 1, 77–79, 1981.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Monien, B., Schulze, J. (1997). Parallel sparse Cholesky factorization. In: Bilardi, G., Ferreira, A., Lüling, R., Rolim, J. (eds) Solving Irregularly Structured Problems in Parallel. IRREGULAR 1997. Lecture Notes in Computer Science, vol 1253. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63138-0_22
Download citation
DOI: https://doi.org/10.1007/3-540-63138-0_22
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63138-5
Online ISBN: 978-3-540-69157-0
eBook Packages: Springer Book Archive