Elsevier

Parallel Computing

Volume 4, Issue 1, February 1987, Pages 17-31
Parallel Computing

Matrix algorithms on a hypercube I: Matrix multiplication

https://doi.org/10.1016/0167-8191(87)90060-3Get rights and content

Abstract

We discuss algorithms for matrix multiplication on a concurrent processor containing a two-dimensional mesh or richer topology. We present detailed performance measurements on hypercubes with 4, 16, and 64 nodes, and analyze them in terms of communication overhead and load balancing. We show that the decomposition into square subblocks is optimal C code implementing the algorithms is available.

References (17)

  • E Brooks

    Pure gauge SU(3) lattice theory on an array of computers

    Phys. Rev. Lett.

    (1984)
  • G Fox

    Matrix operations on the homogeneous machines

    (1982)
  • G Fox

    The performance of the Caltech hypercube in scientific calculations

  • G Fox et al.

    The Caltech concurrent computation program—Project description

  • G Fox et al.

    Algorithms for concurrent processors

    Phys. Today

    (may 1984)
  • G.A Geist et al.

    Parallel Cholesky factorization on a hypercube multiprocessor

    ORNL-6190 Technical Report

    (1985)
  • M.T Heath

    Parallel Cholesky factorization in message-passing multiprocessor enviroments

    ORNL-6150, Oak Ridge National Laboratory, Technical Report

    (1985)
There are more references available in the full text version of this article.

Cited by (167)

  • Parallel Theatre: An actor framework in Java for high performance computing

    2021, Simulation Modelling Practice and Theory
    Citation Excerpt :

    Parallel Theatre leverages on two concurrent executing modes: cooperative concurrency based on event-loops and message interleaving within each theatre, and truly parallelism of message processing in actors allocated to distinct theatres. The paper demonstrates the practical usefulness of Parallel Theatre by the development of two non-trivial scalable models respectively devoted to a parallel matrix multiplication algorithm [26–27] and to a large simulation model of the Iterated Prisoner's Dilemma (IPD) [36–38]. Both models exhibit a good execution performance.

  • Secure Outsourced Matrix Multiplication with Fully Homomorphic Encryption

    2024, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
View all citing articles on Scopus

The research reported here was supported in part by Department of Energy grants DE-AS03-ER13118, DE-FG-03-85ER25009, and by the Parsons Foundation and Systems Development Foundation. S. Otto holds a Bantrell Research Fellowship at Caltech.

View full text