Skip to main content

Advertisement

Log in

Low-Power Application-Specific Processor for FFT Computations

  • Published:
Journal of Signal Processing Systems Aims and scope Submit manuscript

Abstract

In this paper, a processor architecture tailored for radix-4 and mixed-radix FFT computations is described. The processor has native support for power-of-two transform sizes. Several optimizations have been used to improve the energy-efficiency of the processor and experiments show that a programmable solution can possess energy-efficiency comparable to fixed-function ASICs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13

Similar content being viewed by others

References

  1. Baas, B. M. (1999). A low-power, high-performance, 1024-point FFT processor. IEEE Journal of Solid State Circuits, 43(3), 380–387.

    Article  Google Scholar 

  2. Baek, J. H., Kim, S. D., & Sunwoo, M. H. (2008). SPOCS: Application specific signal processor for OFDM communication systems. Journal of Signal Processing Systems, 53(3), 383–397.

    Article  Google Scholar 

  3. Cohen, D. (1976). Simplified control of FFT hardware. IEEE Transactions on Acoustics, Speech, and Signal Processing, 24(6), 577–579.

    Article  Google Scholar 

  4. Corporaal, H. (1997). Microprocessor architectures: From VLIW to TTA. Chichester: Wiley.

    Google Scholar 

  5. Deleganes, M., Douglas, J., Kommandur, B., & Patyra, M. (2002). Designing a 3 GHz, 130 nm, Intel Pentium4 processor. In Digest technical papers symp. VLSI circuits (pp. 230–233). Honolulu, HI.

  6. Granata, J., Conner, M., & Tolimieri, R. (1992). Recursive fast algorithms and the role of the tensor product. IEEE Transactions on Signal Processing,40(12), 2921–2930.

    Article  MATH  Google Scholar 

  7. Han, W., Erdogan, A. T., Arslan, T., & Hasan, M. (2008). High-performance low-power FFT cores. ETRI Journal, 30(3), 451–460.

    Article  Google Scholar 

  8. Heikkinen, J., & Takala, J. (2007). Effects of program compression. Journal of Systems Architecture,53(10), 679–688.

    Article  Google Scholar 

  9. Hung, C. H., Chen, S. G., & Chen, K. L. (2004). Design of an efficient variable-length FFT processor. In Proc. IEEE ISCAS (Vol. 2, pp. 833–836). Vancouver, Canada.

  10. Intel: StrongARM SA-110 microprocessor for portable applications brief datasheet (1999).

  11. Lee, H. Y., & Park, I. C.: Balanced binary-tree decomposition for area-efficient pipelined FFT processing. IEEE Transactions on Circuits and Systems, 54(4), 889–900.

  12. Li, X., Lai, Z., & Cui, J. (2007). A low-power and small area FFT processor for OFDM demodulator. IEEE Transactions on Consumer Electronics, 53(2), 274–277.

    Article  MATH  Google Scholar 

  13. Lim, S. Y., & Crosland, A. (2004). Implementing FFT in an FPGA co-processor. In Proc. int. embedded solutions event(pp. 230–233). Santa Clara, CA.

  14. Lin, Y. T., Tsai, P. Y., & Chiueh, T. D. (2005). Low-power variable-length fast Fourier transform processor. IEE Proceedings on Computer and Digital Techniques, 152(4), 499–506.

    Article  Google Scholar 

  15. Lin, Y. W., Liu, H. Y., & Lee, C. Y. (2004). Dynamic scaling FFT processor for DVB-T applications. IEEE Journal of Solid-State Circuits, 39(11), 2005–2013.

    Article  Google Scholar 

  16. Liu, G., & Feng, Q. (2007). ASIC design of low-power reconfigurable FFT processor. In Int. conf. ASIC (pp. 44–47). Guilin, China.

  17. Patel, K., Macii, E., & Poncino, M. (2004). Energy-performance tradeoffs for the shared memory in multi-processor systems-on-chip. In Proc. IEEE ISCAS (Vol. 2, pp. 361–364). Vancouver, BC, Canada.

  18. Pitkänen, T., Mäkinen, R., Heikkinen, J., Partanen, T., & Takala, J. (2006). Low-power, high-performance TTA processor for 1,024-point fast Fourier transform. In S. Vassiliadis, S. Wong, & T. D. Hämäläinen (Eds.), Embedded computer systems: Architectures, modeling, and simulation: Proc. 6th int. workshop SAMOS 2006. LNCS (Vol. 4017, pp. 227–236). Berlin: Springer.

    Google Scholar 

  19. Pitkänen, T., Mäkinen, R., Heikkinen, J., Partanen, T., & Takala, J. (2006). Transport triggered architecture processor for mixed-radix FFT. In Conf. record asilomar conf. signals syst. comput. (pp. 84–88). Pacific Grove, CA.

  20. Pitkänen, T., Partanen, T., & Takala, J. (2007). Low-power twiddle factor unit for FFT computation. In S. Vassiliadis, M. Bereković, & T. D. Hämäläinen (Eds.), Embedded computer systems: Architectures, modeling, and simulation: Proc. int. workshop SAMOS 2007, LNCS. (Vol. 4599, pp. 273–282). Berlin: Springer.

    Google Scholar 

  21. Pitkänen, T., & Takala, J. (2009). Low-power application-specific processor for FFT computations. In Proc. IEEE ICASSP (pp. 593–596). Taipei, Taiwan.

  22. Pitkänen, T., Tanskanen, J. K., Mäkinen, R., & Takala, J. (2009). Parallel memory architecture for application-specific instruction-set processors. Journal of Signal Processing Systems, 57(1), 21–32.

    Article  Google Scholar 

  23. Rabiner, L. R., & Gold, B. (1975). Theory and application of digital signal processing. Englewood Cliffs: Prentice Hall.

    Google Scholar 

  24. Rixner, S., Dally, W. J., Kapasi, U. J., Khailany, B., Lopez-Lagunas, A., Mattson, P. R., et al. (1998). A bandwidth-efficient architecture for media processing. In Proc. ann. ACM/IEEE int. symp. microarchitecture (pp. 3–13). Dallas, TX.

  25. Saleh, H., Mohd, B. J., Aziz, A., & Swartzlander Jr., E. E. (2007). Contention-free switch-based implementation of 1024-point radix-2 Fourier transform engine. In Proc. IEEE int. conf. comput. design (pp. 7–12). Lake Tahoe, CA, USA.

  26. Suleiman, A., Saleh, H., Hussein, A., & Akopian, D. (2008). A family of scalable FFT architectures and an implementation of 1024-point radix-2 FFT for real-time communications. In Proc. IEEE int. conf. comput. design (pp. 321–327). Lake Tahoe, CA, USA.

  27. Tampere University of Technology (2008). TTA-based codesign environment. http://tce.cs.tut.fi/.

  28. Texas Instruments, Inc., Dallas, TX: TMS320C64x DSP Library programmer’s reference (2003).

  29. Wang, A., & Chandrakasan, A. (2005). A 180-mV subthreshold FFT processor using a minimum energy design methodology. IEEE Journal of Solid-State Circuits, 40(1), 310–319.

    Article  Google Scholar 

  30. Wang, S. S., & Li, C. S. (2008). An area-efficient design of variable-length fast Fourier transform processor. Journal of Signal Processing Systems, 51(3), 245–256.

    Article  Google Scholar 

  31. Wey, C. L., Lin, S. Y., Tang, W. C., & Shiue, M. T. (2007). High-speed, low cost parallel memory-based FFT processors for OFDM applications. In IEEE int. conf. electronics circ. syst. (pp. 783–787). Marrakech, Marocco.

  32. Yang, Y. X., Li, J. F., Liu, H. N., & Wey, C. L. (2007). Design of cost-efficient memory-based FFT processors using single-port memories. In IEEE int. SOC conf. (pp. 321–327). Hsin Chu, Taiwan.

  33. Zhao, Y., Erdogan, A. T., & Arslan, T. (2005). A low-power and domain-specific reconfigurable FFT fabric for system-on-chip applications. In Proc. IEEE par. distributed process. symp. reconf. logic. Denver, CO.

Download references

Acknowledgement

This work has been supported in part by the Academy of Finland under funding decision 205743.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Teemu Oskari Pitkänen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pitkänen, T.O., Takala, J. Low-Power Application-Specific Processor for FFT Computations. J Sign Process Syst 63, 165–176 (2011). https://doi.org/10.1007/s11265-010-0528-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11265-010-0528-z

Keywords

Navigation