Abstract
In this paper, a novel architecture for sparse matrix multiplication is proposed. The architecture is suitable for implementation in specific environments such as dataflow engines. In order to avoid multiple streaming of elements from the host, we propose the architecture which buffers the elements from the input stream in on-chip memory in the form of pages. In the case of sparse matrices, the architecture processes only pages with non-zero elements. The proposed architecture allows replication of its blocks in order to parallelize the computation. The architecture is implemented on Maxeler dataflow engine based on Virtex 5 FPGA. The implementation results are given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Milovanovic, I., Bekakos, M.P., Tselepis, I.N., Milovanovic, E.I.: Forty-three ways of systolic matrix multiplication. Int. J. Comput. Math. 87(6), 1264–1276 (2010)
Smith, T.M., Van De Geijn, R., Smelyanskiy, M., Hammond, J.R., Van Zee, F.G.: Anatomy of high-performance many-threaded matrix multiplication. In: 28th IEEE International Parallel and Distributed Processing Symposium, pp. 1049–1059. IEEE (2014)
Matam, K., Indarapu, S., Kothapalli, K.: Sparse matrix-matrix multiplication on modern architectures. In: 19th International Conference on High Performance Computing (HiPC), pp. 1–10. IEEE (2012)
Saule, E., Kaya, K., Çatalyürek, Ü.V.: Performance evaluation of sparse matrix multiplication kernels on Intel Xeon Phi. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds.) PPAM 2013, Part I. LNCS, vol. 8384, pp. 559–570. Springer, Heidelberg (2014)
Weifeng, L., Vinter, B.: An efficient GPU general sparse matrix-matrix multiplication for irregular data. In: IEEE 28th International Parallel and Distributed Processing Symposium, pp. 370–381. IEEE (2014)
Yavits, L., Morad, A., Ginosar, R.: Sparse matrix multiplication on an associative processor. IEEE Trans. Parallel Distrib. Syst. (2014)
Ciric, V., Cvetkovic, A., Simic, V., Milentijevic, I.: Tropical algebra based framework for error propagation analysis in systolic arrays. Elsevier Appl. Math. Comput. 225, 512–525 (2013)
Maxeler Technologies, MaxCompiler documentation, Version 2014.1, Maxeler Technologies
Acknowledgment
The research was supported in part by the Serbian Ministry of Education, Science and Technological Development (Project TR32012).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Simic, V., Ciric, V., Savic, N., Milentijevic, I. (2016). Sparse Matrix Multiplication on Dataflow Engines. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K., Kitowski, J., Wiatr, K. (eds) Parallel Processing and Applied Mathematics. PPAM 2015. Lecture Notes in Computer Science(), vol 9573. Springer, Cham. https://doi.org/10.1007/978-3-319-32149-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-32149-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32148-6
Online ISBN: 978-3-319-32149-3
eBook Packages: Computer ScienceComputer Science (R0)