Skip to main content

Hyper-systolic implementation of BLAS-3 routines on the APE100/Quadrics machine

  • Conference paper
  • First Online:
Applied Parallel Computing Large Scale Scientific and Industrial Problems (PARA 1998)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1541))

Included in the following conference series:

Abstract

Basic Linear Algebra Subroutines (BLAS-3) [1] are building blocks to solve a lot of numerical problems (Cholesky factorization, Gram-Schmidt ortonormalization, LU decomposition,…). Their efficient implementation on a given parallel machine is a key issue for the maximal exploitation of the system’s computational power. In this work we refer to a massively parallel processing SIMD machine (the APE100/Quadrics

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. Choi, J.J. Dongarra, D.W. Walker: ‘The design of scalable software libraries for distributed memory concurrent computers’. J.J. Dongarra and B. Tourancheau editors. Environments and Tools for Parallel Scientific Computing. Elsevier 1982.

    Google Scholar 

  2. A. Bartoloni et al: ‘A hardware implementation of the APE100 architecture’. International Journal of Modern Physics C4 1993.

    Google Scholar 

  3. T. Lippert, P. Palazzari, K. Schilling: ‘Automatic template generation for solving n 2 problems on parallel systems with arbitrary topology’. Proceedings of the IEEE Workshop on Parallel and Distributed Software Engineering, Boston (MA) May 1997.

    Google Scholar 

  4. T. Lippert et al.: ‘Hyper-systolic matrix multiplication’. Proceedings of the Proceedings of PDPTA ’97, CSREA 1997.

    Google Scholar 

  5. T. Lippert, A. Seyfried, A. Bode, and K. Schilling: ‘Hyper-Systolic Parallel Computing’, IEEE Trans. On Parallel and Distributed Systems, Vol. 9, No. 2, February 1998.

    Google Scholar 

  6. P. Palazzari, T. Lippert, K. Schilling: ’simulated Annealing Techniques for communication-efficient Hyper-Systolic parallel computing on Quadrics’. Nato advance Research whorkshop on High Performance Computing-Technology and Applications, June 24–25 1996, Cetraro (Italy).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bo Kågström Jack Dongarra Erik Elmroth Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Coletta, M., Lippert, T., Palazzari, P. (1998). Hyper-systolic implementation of BLAS-3 routines on the APE100/Quadrics machine. In: Kågström, B., Dongarra, J., Elmroth, E., Waśniewski, J. (eds) Applied Parallel Computing Large Scale Scientific and Industrial Problems. PARA 1998. Lecture Notes in Computer Science, vol 1541. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0095323

Download citation

  • DOI: https://doi.org/10.1007/BFb0095323

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65414-8

  • Online ISBN: 978-3-540-49261-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics