Abstract
This paper describes block algorithms for Cholesky and LU factorizations of band matrices. The algorithms can be implemented using Level 3 BLAS. Although they involve some extra data movement and extra arithmetic operations, on some machines they can outperform unblocked algorithms which use Level 2 BLAS. We present performance measurements to demonstrate this, on an IBM 3090 VF, a Cray 2, and a Siemens VP 400.
Preview
Unable to display preview. Download preview PDF.
References
Bischof, C., Demmel, J., Dongarra, J.J., Du Croz, J.J., Greenbaum, A., Hammarling, S.J. and Sorensen, D., LAPACK Working Note #5 Provisional Contents. ANL-88-38, Argonne National Laboratory, 1988.
Dayde, M. and Duff, I.S., Use of Level 3 BLAS in LU Factorization on the Cray 2, the ETA-10P, and the IBM 3090-200/VF. CERFACS report TR 88/1, 1988.
Demmel, J., Dongarra, J.J., Du Croz, J.J., Greenbaum, A., Hammarling, S.J. and Sorensen, D., Prospectus for the Development of a Linear Algebra Library for High-Performance Computers. ANL/MCS-TM-97, Argonne National Laboratory, MCS Division, 1987.
Dongarra, J., Bunch, J., Moler, J., and Stewart, G., LINPACK Users' Guide. SIAM Philadelphia, 1979
Dongarra, J.J., Du Croz, J.J., Duff, I.S. and Hammarling, S.J., A Set of Level 3 Basic Linear Algebra Subprograms. ANL/MCS-TM-88 (revision 1), Argonne National Laboratory, MCS Division, 1988.
Dongarra, J.J., Du Croz, J.J., Hammarling, S.J. and Hanson, R., An Extended Set of Fortran Basic Linear Algebra Subprograms. ACM Trans. Math. Software, 14, pp. 1–17, 1988.
Engineering and Scientific Subroutine Library. Guide and Reference. SC23-0184-3
Geers, N., Optimization of Level 2 BLAS for Siemens VP Systems. Technical report no. 37.89, University of Karlsruhe, Computer Center, 1989.
Golub, G.H. and Van Loan, C.F., Matrix Computations, 2nd edition. The Johns Hopkins University Press, Baltimore, 1989.
Grasemann, H., Optimization of Level 3 BLAS for Siemens VP Systems. Technical report no. 38.89, University of Karlsruhe, Computer Center, 1989.
Lawson, C., Hanson, R.J., Kincaid, D. and Krogh F.T., Basic Linear Algebra Subprograms for Fortran Usage. ACM Trans. Math. Software, 5, pp. 308–323, 1979.
Mayes, P.J.D. and Radiati, G., LAPACK Working Note #12: Banded Cholesky Factorization Using Level 3 BLAS. ANL/MCS-TM-134, Argonne National Laboratory, MCS Division, 1989.
Mayes, P.J.D. and Radicati, G., Block Factorization Algorithms on the IBM 3090/VF. in Proceedings of the 1989 International Conference on Supercomputing, Crete, Greece, pp. 263–270, 1989.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1990 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Du Croz, J., Mayes, P., Radicati, G. (1990). Factorizations of band matrices using level 3 BLAS. In: Burkhart, H. (eds) CONPAR 90 — VAPP IV. VAPP CONPAR 1990 1990. Lecture Notes in Computer Science, vol 457. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-53065-7_102
Download citation
DOI: https://doi.org/10.1007/3-540-53065-7_102
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-53065-7
Online ISBN: 978-3-540-46597-3
eBook Packages: Springer Book Archive