Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system

Pichel, Juan C.; Lorenzo, Juan A.; Heras, Dora B.; Cabaleiro, Jose C.; Pena, Tomás F.

doi:10.1007/s11227-010-0392-4

Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system

Published: 16 February 2010

Volume 58, pages 195–205, (2011)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Juan C. Pichel¹,
Juan A. Lorenzo²,
Dora B. Heras²,
Jose C. Cabaleiro² &
…
Tomás F. Pena²

116 Accesses
4 Citations
Explore all metrics

Abstract

In this paper, the sparse matrix-vector product (SpMV) is evaluated on the FinisTerrae SMP-NUMA supercomputer. Its architecture particularities make the tuning of SpMV especially relevant due to the significant impact on the performance. First, we have estimated the influence of data and thread allocation. Moreover, because of the indirect and irregular memory access patterns of SpMV, we have also studied the influence of the memory hierarchy in the performance. According to the behavior observed in the study, a set of optimizations specially tuned for FinisTerrae were successfully applied to SpMV. Noticeable improvements are obtained in comparison with the SpMV naïve implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sparse CSB_Coo Matrix-Vector and Matrix-Matrix Performance on Intel Xeon Architectures

NUMA-Aware Optimization of Sparse Matrix-Vector Multiplication on ARMv8-Based Many-Core Architectures

Block-wise dynamic mixed-precision for sparse matrix-vector multiplication on GPUs

Article Open access 11 March 2024

References

Galicia Supercomputing Center (CESGA) http://www.cesga.es
Klug Tobias JW, Ott M, Trinitis C (2008) Autopin—Automated optimization of thread-to-core pinning on multicore systems. Trans HiPEAC, 3(4)
Broquedis F et al (2009) Dynamic task and data placement over NUMA architectures: an OpenMP runtime perspective. In: 5th Int workshop on OpenMP. LNCS, vol 5568. Springer, Berlin, pp 79–92
Google Scholar
Kotakemori H et al (2005) Performance evaluation of parallel sparse matrix-vector products on SGI Altix3700. In: 1st Int workshop on OpenMP. LNCS, vol 4315. Springer, Berlin, pp 153–166
Google Scholar
Williams S et al (2007) Optimization of sparse matrix-vector multiply on emerging multicore platforms. In: Proc of supercomputing (SC)
Goumas G et al (2008) Understanding the performance of sparse matrix-vector multiplication. In: Euromicro conf on parallel, distributed and network-based processing, pp 283–292
Hewllet-Packard Company. HP integrity rx7640 server quick specs
Saad Y (2003) Iterative methods for sparse linear systems. SIAM, New York
Book MATH Google Scholar
Davis T (1997) University of Florida Sparse Matrix Collection. NA Digest, 97(23), June 1997. http://www.cise.ufl.edu/research/sparse/matrices
Pichel JC, Singh DE, Carretero J (2008) Reordering algorithms for increasing locality on multicore processors. In: 10th IEEE int conf on high performance computing and communications, pp 123–130
Alam SR et al (2008) An evaluation of the Oak Ridge National Laboratory Cray XT3. Int J High Perform Comput Appl 22(1):52–80
Article Google Scholar

Download references

Author information

Authors and Affiliations

Galicia Supercomputing Center (CESGA), Santiago de Compostela, Spain
Juan C. Pichel
Electronics and Computer Science Dpt., Univ. of Santiago de Compostela, Santiago de Compostela, Spain
Juan A. Lorenzo, Dora B. Heras, Jose C. Cabaleiro & Tomás F. Pena

Authors

Juan C. Pichel
View author publications
You can also search for this author inPubMed Google Scholar
Juan A. Lorenzo
View author publications
You can also search for this author inPubMed Google Scholar
Dora B. Heras
View author publications
You can also search for this author inPubMed Google Scholar
Jose C. Cabaleiro
View author publications
You can also search for this author inPubMed Google Scholar
Tomás F. Pena
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Juan C. Pichel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pichel, J.C., Lorenzo, J.A., Heras, D.B. et al. Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system. J Supercomput 58, 195–205 (2011). https://doi.org/10.1007/s11227-010-0392-4

Download citation

Published: 16 February 2010
Issue Date: November 2011
DOI: https://doi.org/10.1007/s11227-010-0392-4

Keywords

Profiles

Tomás F. Pena View author profile

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Sparse CSB_Coo Matrix-Vector and Matrix-Matrix Performance on Intel Xeon Architectures

NUMA-Aware Optimization of Sparse Matrix-Vector Multiplication on ARMv8-Based Many-Core Architectures

Block-wise dynamic mixed-precision for sparse matrix-vector multiplication on GPUs

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Profiles

Subscribe and save

Buy Now