Skip to main content

Performance Analysis of Overheads for Matrix – Vector Multiplication in Cluster Environment

  • Conference paper
Advances in Informatics (PCI 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3746))

Included in the following conference series:

Abstract

This paper presents the basic parallel implementation and a variation for matrix – vector multiplication. We evaluated and compared the performance of the two implementations on a cluster of workstations using Message Passing Interface (MPI) library. The experimental results demonstrate that the basic implementation achieves lower performance than the other variation. Further, we analyzed the several classes of overheads contribute to lowered performance of the basic implementation. These analyses have identified cost of reading of data from disk and communication cost as the primary factors affecting performance of the basic parallel matrix – vector implementation. Finally, we present a performance model for estimating the performance of two proposed matrix – vector implementations on a cluster of heterogeneous workstations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anderson, E., Bai, Z., Bischof, C., Demmel, J., Dongarra, J., Du Croz, J., Greenbaum, A., Hammarling, S., McKenney, A., Ostrouchov, S., Sorensen, D.: LAPACK Users’ Guide. SIAM, Philadelphia (1992)

    MATH  Google Scholar 

  2. Beaumont, O., Boudet, V., Rastello, F., Robert, Y.: Matrix multiplication on heterogeneous platforms. IEEE Transactions on Parallel and Distributed Systems 12(10), 1033–1051 (2001)

    Article  MathSciNet  Google Scholar 

  3. Choi, J., Dongarra, J., Pozo, R., Walker, D.W.: ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers. In: Proc. Fourth Symposium on the Frontiers of Massively Parallel Computation, McLean, Virginia, pp. 120–127 (1992)

    Google Scholar 

  4. Dongarra, J., Moler, C.B., Bunch, J.R., Stewart, G.W.: LINPACK User’s Guide. SIAM, Philadelphia (1979)

    Google Scholar 

  5. Fox, G., Johnson, M., Lyzenga, G., Otto, S., Salmon, J., Walker, D.: Solving problems on concurrent processors, vol. I. Prentice-Hall, Englewood Cliffs (1988)

    Google Scholar 

  6. Kalinov, A.: Scalability analysis of matrix-matrix multiplication on heterogeneous clusters. In: Proc. ISPDC 2004 HeteroPar 2004, IEEE Computer Society, Los Alamitos (2004)

    Google Scholar 

  7. Kumar, V., Gramma, A., Gupta, A., Karypis, G.: Introduction to Parallel Computing, The Benjamin/Cummings. Publishing Company (1994)

    Google Scholar 

  8. Ohtaki, Y., Takahashi, D., Boku, T., Sato, M.: Parallel implementation of Srassen’s matrix multiplication algorithm for heterogeneous clusters. In: Proc. 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), IEEE Computer Society Press, Los Alamitos (2004) CD-ROM

    Google Scholar 

  9. Pacheco, P.: Parallel Programming with MPI. Morgan Kaufmann, San Francisco (1997)

    MATH  Google Scholar 

  10. Papadopoulos, P.M., Katz, M.J., Bruno, G.: NPACI Rocks: Tools and Techniques for Easily Deploying Manageable Linux Clusters, San Diego Supercomputer Center, University of California San Siego (2001)

    Google Scholar 

  11. Snir, M., Otto, S., Huss-Lederman, S., Walker, D.W., Dongarra, J.: MPI: The Complete Reference. The MIT Press, Cambridge (1996)

    Google Scholar 

  12. Tinetti, F., Quijano, A., Giusti, A.D., Luque, E.: Heterogeneous networks of workstations and the parallel matrix multiplication. In: Proc. 8th European PVM/MPI Users’ Group Meeting, pp. 296–303 (2001)

    Google Scholar 

  13. Typou, T., Stefanidis, V., Michailidis, P., Margaritis, K.: Matrix – Vector multiplication on a cluster of workstations. In: Proc. First International Conference From Scientific Computing to Computational Engineering, Athens, Greece (2004)

    Google Scholar 

  14. Yan, Y., Zhang, X., Song, Y.: An effective and practical performance prediction model for parallel computing on non-dedicated heterogeneous NOW. Journal of Parallel and Distributed Computing 38, 63–80 (1996)

    Article  Google Scholar 

  15. Wilkinson, B., Allen, M.: Parallel Programming: Techniques and Applications using Networking Workstations, 2nd edn (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Michailidis, P.D., Stefanidis, V., Margaritis, K.G. (2005). Performance Analysis of Overheads for Matrix – Vector Multiplication in Cluster Environment. In: Bozanis, P., Houstis, E.N. (eds) Advances in Informatics. PCI 2005. Lecture Notes in Computer Science, vol 3746. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573036_23

Download citation

  • DOI: https://doi.org/10.1007/11573036_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29673-7

  • Online ISBN: 978-3-540-32091-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics