Abstract
Traditional microprocessors are today getting more and more inefficient for a growing range of applications that are mainly about processing data-stream. These applications have two character characteristics: one is that lots of intensive computation tasks need to be processed, another is that the running time of these tasks occupy more than 90% of total time. Coarse grained reconfigurable computation is very fitful for these tasks and can achieve very high performance. This paper presents implementation of the task of fast parallel complex FFT on CTaiJi, the 16bits Reconfigurable computation platform, which is targeting on streamed applications such as multi-media and DSP (digital signal processing). The proposed mapping comprises fast store-address transformation and configuring the function of PEA (processing element array) to fit for FFT. More-over, the performance is scalable according to FFT sizes. Since there is no functionality specifically tailored to FFT, the results demonstrate the capability of CTaiJi architecture to extract parallelism from streamed applications. Further ration- ales are given based on the concepts of scalar operand networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agarwal, H., Keckler, B.: Clock rate versus IPC: the end of the road for conventional microarchitectures. In: Proceedings of the 27th InternationalSymposium on Computer Architecture, pp. 248–259 (2000)
Schlansker, R.: EPIC: Explicitly Parallel Instruction Computing. IEEE Computer 33(2), 37–45 (2000)
Taylor, et al.: The Raw microprocessor: a computational fabric for software circuits and general-purpose programs. IEEE Micro 22(2), 25–35 (2002)
Rixner, et al.: A bandwidth-efficient architecture for media processing. In: Proceedings of 31st Annual ACM/IEEE International Symposium on Microarchitecture, pp. 3–13 (1998)
Kozyrakis, P.: Vector vs. superscalar and VLIW architectures for embedded multimedia benchmarks. In: Proceedings of 35th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 283–293 (2002)
http://www.pactcorp.com (December 29, 2004)
Kamalizad, A.H., Pan, C., Bagherzadeh, N.: Fast Parallel FFT on a Reconfigurable Computation Platform. In: Proceedings of the 15th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2003) (2003)
http://www.ti.com (December 29, 2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Song, L., Jiang, Y. (2005). Fast Parallel FFT on CTaiJi: A Coarse-Grained Reconfigurable Computation Platform. In: Pan, Y., Chen, D., Guo, M., Cao, J., Dongarra, J. (eds) Parallel and Distributed Processing and Applications. ISPA 2005. Lecture Notes in Computer Science, vol 3758. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11576235_24
Download citation
DOI: https://doi.org/10.1007/11576235_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29769-7
Online ISBN: 978-3-540-32100-2
eBook Packages: Computer ScienceComputer Science (R0)