Abstract
Applications such as electrical power grid operation and planning rely on high-performance linear solvers involving large sparse matrices. Previous custom sparse solver hardware implemented on a Field Programmable Gate Array (FPGA) has shown an 8-fold performance gain over state-of-the-art sparse software packages. Generally, the drawback of hardware solvers lies in their design complexity. This paper presents an alternative architecture in which the host CPU software computes the main program and caches data that are streamed to a pipelined hardware, implemented on an FPGA, for part of the computation. With the lower-upper triangular decomposition solver, the hardware computes the sparse matrix row addition operation, called merging. The prototype merge core processes data at the optimum rate, i.e., the FPGA clock frequency. With the proposed triple-buffer bus architecture, the core is projected to attain a data rate of 250 MHz on the Virtex 6 FPGA in comparison to the average 200MHz for the merge software subroutine on a general-purpose processor.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Albur, A., Expsito, A.: Power System State Estimation: Theory and Implementation. Marcel Dekker, New York (2004)
Chagnon, T., Johnson, J., Vachranukunkiet, P., Nagvajara, P., Nwankpa, C.: Sparse LU Decomposition Using FPGA. In: PARA 2008: 9th International Workshop on State-of-the-Art in Scientific and Parallel Computing, Trondheim, Norway (May 2008)
Chagnon, T.: Architectural Support for Direct Sparse LU Algorithms. Masters Thesis, Drexel University (2010)
Vachranukunkiet, P.: Power Flow Computation using Field Programmable Gate Arrays. PhD Thesis, Drexel University (2007)
DRC Computer Corporation. DRC Coprocessor System Users Guide (July 2007)
HyperTransport Consortium. HyperTransport I/O Link Specification: Revision 3.10C (May 2010)
Nagvajara, P., Lin, Z., Nwankpa, C., Johnson, J.: State Estimation Using Sparse Givens Rotation Field Programmable Gate Array. In: Proc. IEEE North America Power System Symposium (September 2007)
Wang, Y., Cunningham, K., Johnson, J., Nagvajara, P.: Singular Value Decomposition Hardware for MIMO: State of the Art and Custom Design. In: Proc. IEEE Reconfigurable Computing and FPGA Conference, December 13-15 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cunningham, K., Nagvajara, P. (2011). Reconfigurable Stream-Processing Architecture for Sparse Linear Solvers. In: Koch, A., Krishnamurthy, R., McAllister, J., Woods, R., El-Ghazawi, T. (eds) Reconfigurable Computing: Architectures, Tools and Applications. ARC 2011. Lecture Notes in Computer Science, vol 6578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19475-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-19475-7_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19474-0
Online ISBN: 978-3-642-19475-7
eBook Packages: Computer ScienceComputer Science (R0)