Efficient implementation of Jacobi algorithms and Jacobi sets on distributed memory architectures

https://doi.org/10.1016/0743-7315(90)90134-BGet rights and content

Abstract

One-sided methods for implementing Jacobi diagonalization algorithms have been recently proposed for both distributed memory and vector machines. These methods are naturally well suited to distributed memory and vector architectures because of their inherent parallelism and their abundance of vector operations. Also, one-sided methods require substantially less message passing than the two-sided methods, and thus can achieve higher efficiency. We describe in detail the use of the one-sided Jacobi rotation as opposed to the rotation used in the “Hestenes” algorithm; we perceive this difference to have been widely misunderstood. Furthermore the one-sided algorithm generalizes to other problems such as the nonsymmetric eigenvalue problem while the Hestenes algorithm does not. We discuss two new implementations for Jacobi sets for a ring connected array of processors and show their isomorphism to the round-robin ordering. Moreover, we show that two implementations produce Jacobi sets in identical orders up to a relabeling. These orderings are optimal in the sense that they complete each sweep in a minimum number of stages with minimal communication. We present implementation results of one-sided Jacobi algorithms using these orderings on the NCUBE/seven hypercube as well as the Intel iPSC/2 hypercube. Finally, we mention how other orderings, and can be, implemented. The number of nonisomorphic Jacobi sets has recently been shown to become infinite with increasing n.

References (19)

  • M. Berry et al.

    Parallel algorithms for the singular value and dense symmetric eigen value problems

    (1988)
  • C. Bischof

    A parallel ordering for the block Jacobi method on a hypercube architecture

  • R.P. Brent et al.

    The solution of singular-value and symmetric eigenvalue problems on multiprocessor arrays

    SIAM J. Sci. Statist. Comput.

    (1985)
  • R.P. Brent et al.

    Computation of the singular value decomposition using mesh-connected processors

    J. VLSI Comput. Systems

    (1985)
  • P.J. Eberlein

    On the Schur decomposition of a matrix for parallel computation

    IEEE Trans. Comput.

    (1987)
  • P.J. Eberlein

    On one-sided Jacobi methods for parallel computation

    SIAM J. Algebraic Discrete Methods

    (1987)
  • P.J. Eberlein

    On using the Jacobi method on the hypercube

  • G.R. Gao et al.

    An optimal parallel Jacobi-like solution method for the singular value decomposition

  • E. Hansen

    On cyclic Jacobi methods

    J. SIAM

    (1963)
There are more references available in the full text version of this article.

Cited by (32)

  • An efficient method for computing eigenvalues of a real normal matrix

    2003, Journal of Parallel and Distributed Computing
    Citation Excerpt :

    This makes it very attractive for parallel processing. Many parallel Jacobi ordering algorithms have been introduced in the literature [1–3,6,7,9,13–16]. Most of these orderings can be classified into two equivalent groups by the definition of ordering equivalence, that is, two orderings are equivalent if they can generate the same set of index pairs at the same step by a relabelling of the initial indices.

  • Implementing the one-sided Jacobi method on a 2D/3D mesh multicomputer

    2001, Parallel Computing
    Citation Excerpt :

    The algorithms described in this section implement one of the ring Jacobi orderings proposed in [5].

View all citing articles on Scopus

The work of this author was supported by National Science Foundation Grants ASC-8705104 and CCR-8812521.

The work of this author was supported by National Science Foundation Grant CCR-8813493.

View full text