Sparse QR factorization on a massively parallel computer

Kratzer, Steven G.

doi:10.1007/BF00155801

Sparse QR factorization on a massively parallel computer

Published: December 1992

Volume 6, pages 237–255, (1992)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Steven G. Kratzer¹

54 Accesses
3 Citations
Explore all metrics

Abstract

This paper shows that QR factorization of large, sparse matrices can be performed efficiently on massively parallel SIMD (single instruction stream/multiple data stream) computers such as the Connection Machine CM-2. The problem is cast as a dataflow graph, whose nodes are mapped to a “virtual dataflow machine” in such a way that only nearest-neighbor communication is required. This virtual machine is implemented by programming the CM-2 processors to support a restricted dataflow protocol. Execution results for several test matrices show that good performance can be obtained without relying on nested dissection techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ahmed, H., Delosme, J.-M., and Morf, M. 1982. Highly concurrent computing structures for matrix arithmetic and signal processing. IEEE Comp. (Jan.): 65–81.
Argyris, J., and Bronlund, O. 1975. The natural factor formulation of the stiffness matrix displacement method. Comput. Methods Appl. Mech. Engrg., 5: 97–119.
Google Scholar
Betancourt, R. 1986. Efficient parallel processing technique for inverting matrices with random sparsity. IEEE Proc., 133B: 236–240.
Google Scholar
Brent, R.P., Luk, F.T., and Van Loan, C. 1985. Computation of the singular value decomposition using mesh connected processors. J. VLSI. and Comp. Systems, 1: 242–270.
Google Scholar
Dennis, J.B. 1980. Dataflow supercomputers. IEEE Comp., 13, 11 (Nov.): 48–56.
Google Scholar
Dongarra, J., Sameh, A., and Sorensen, D. 1984. Implementation of some concurrent algorithms for matrix factorization. Tech. Rept. MCS/TM-25, Argonne Nat. Laboratory.
Duff, I. 1986. Parallel implementation of multifrontal schemes. Parallel Computing, 3: 193–204.
Google Scholar
Gannon, D. 1980. A note on pipelining mesh-connected multiprocessors for finite element problems by nested dissection. In Proc., 1990 Internat. Conf. on Parallel Processing (St. Charles, Ill., Aug. 13–17), Penn. State Univ. Press, pp. 197–204.
George, A., and Heath, M. 1980. Solution of sparse linear least squares problems using Givens rotations. Lin. Alg. Appl., 34: 69–83.
Google Scholar
George, A., and Ng, E. 1983. On row and column orderings for sparse least squares problems. SIAM J. Numer. Anal., 20: 326–344.
Google Scholar
George, A., and Ng, E. 1984. SPARSPAK: Waterloo Sparse Matrix Package User's Guide for Sparspak-B. Research rept. CS-84–37, Dept. of Comp. Sci., Univ. of Waterloo, Waterloo, Ontario.
Google Scholar
George, A., and Ng, E. 1986. Orthogonal reduction of sparse matrices to upper triangular form using Householder transformations. SIAM J. Sci. Stat. Comput., 7: 460–472.
Google Scholar
George, A., Heath, M., and Liu, J. 1986. Parallel Cholesky factorization on a shared-memory multiprocessor. Lin. Alg. Appl., 77: 165–187.
Google Scholar
Gilbert, J., and Schreiber, R. 1990. Highly parallel sparse Cholesky factorization. Tech. rept. CSL-90-7, Xerox Palo Alto Research Center. (To appear in SIAM J. Sci. Stat. Comp.)
Gill, P., et al. 1986. On projected Newton barrier methods for linear programming and an equivalence to Karmarkar's projective method. Math. Prog., 36: 183–209.
Google Scholar
Golub, G., and Van Loan, C. 1989. Matrix Computations, 2nd ed. Johns Hopkins Univ. Press, Baltimore, Md.
Google Scholar
Heath, M., and Sorenson, D. 1985. A pipelined Givens method for computing the QR factorization of a sparse matrix. Tech. Memo No. 47, Argonne Nat. Laboratory.
Heath, M., Ng, E., and Peyton, B. 1991. Parallel algorithms for sparse linear systems. SIAM Review, 33, 3 (Sept.): 420–460.
Google Scholar
Ida, N., and Udawatta, K. 1986. Solution of large linear systems of equations on the Massively Parallel Processor. In Frontiers of Massively Parallel Scientific Computing (Greenbelt, Md., Sept. 24–25), NASA Conf. Pub. 2478, pp. 257–263.
Karmarkar, N. 1984. A new polynomial time algorithm for linear programming. Combinatorica, 4: 373–395.
Google Scholar
Kaufman, L. 1979. Application of dense Householder transformations to a sparse matrix. ACM Trans. Math. Software, 5: 442–450.
Google Scholar
Levitt, C. 1989. Grid communications on the Connection Machine: Analysis, performance, and improvements. In Scientific Applications of the Connection Machine (H. Simon, (ed.)), World Scientific, Singapore, pp. 316–322.
Google Scholar
Liu, J. 1989. Reordering sparse matrices for parallel elimination. Parallel Computing, 11: 73–91.
Google Scholar
Liu, J. 1990. The role of elimination trees in sparse factorization. SIAM J. Matrix Analysis and Applications, 11: 134–172.
Google Scholar
Lucas, R. 1987. Solving planar systems of equations on distributed memory multiprocessors. Ph.D. thesis, Stanford Univ., Stanford, Calif.
Google Scholar
Middleton, D., and Tamboulian, S. 1989. Evaluating local indirect addressing in SIMD processors. Rept. 89-30, NASA ICASE.
Moore, D. 1987. Dense patch-oriented matrix factorization on a hypercube multiprocessor. TR-87–809, Dept. of Comp. Sci., Cornell Univ., Ithaca, N.Y.
Google Scholar
Opsahl, T., and Reif, J. 1986. Solving very large, sparse linear systems on mesh-connected parallel computers. In Frontiers of Massively Parallel Scientific Computing (Greenbelt, Md., Sept. 24–25), NASA Conf. Pub. 2478, pp. 249–256.
Paige, C.C., and Saunders, M.A. 1982. LSQR: An algorithm for sparse linear equations and sparse least squares. ACM Trans. Math. Software, 8, 1 (Mar.): 43–71.
Google Scholar
Pan, V., and Reif, J. 1985. Efficient parallel solution of linear systems. In Proc., Seventeenth Annual ACM Symp. on the Theory of Computing (Providence, R.I., May 6–8), Academic Press, pp. 143–152.

Download references

Author information

Authors and Affiliations

Supercomputing Research Center, 17100 Science Drive, 20715-4300, Bowie, MD, USA
Steven G. Kratzer

Authors

Steven G. Kratzer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kratzer, S.G. Sparse QR factorization on a massively parallel computer. J Supercomput 6, 237–255 (1992). https://doi.org/10.1007/BF00155801

Download citation

Issue Date: December 1992
DOI: https://doi.org/10.1007/BF00155801

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sparse QR factorization on a massively parallel computer

Abstract

Access this article

Similar content being viewed by others

Parallelization of Sparse Matrix Kernels for Big Data Applications

Analysis of Partitioning Models and Metrics in Parallel Sparse Matrix-Vector Multiplication

Sparse Direct Solution on Parallel Computers

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sparse QR factorization on a massively parallel computer

Abstract

Access this article

Similar content being viewed by others

Parallelization of Sparse Matrix Kernels for Big Data Applications

Analysis of Partitioning Models and Metrics in Parallel Sparse Matrix-Vector Multiplication

Sparse Direct Solution on Parallel Computers

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation