Abstract:
In this paper, we present an efficient design approach for the implementation of large size matrix multiplication with wide bit size elements targeting FPGAs. The propose...Show MoreMetadata
Abstract:
In this paper, we present an efficient design approach for the implementation of large size matrix multiplication with wide bit size elements targeting FPGAs. The proposed technique first partitions the input matrices into smaller dimensions, and then segment the word-width of the elements into smaller sections. The segmentation is driven by the architecture of the targeted FPGA platform. A highly optimized scalar signed multiplier has been developed, and used as a basic block to construct a 2 by 2 matrix multiplier on Xilinx' and Altera's FPGAs. The result of the implementations showed that our method has outperformed the techniques utilized by commercial tools, ISE and Quartus, and the balanced word-width decomposition approach to realize the matrix multiplications proposed in. Compared to these approaches we achieved delay reduction range from 7.1% to 28% and area saving range from 6.7% to 32%.
Published in: 2009 16th IEEE International Conference on Electronics, Circuits and Systems - (ICECS 2009)
Date of Conference: 13-16 December 2009
Date Added to IEEE Xplore: 17 February 2010
ISBN Information: