Design of the Processors for Fast Cosine and Sine Fourier Transforms

Tsmots, Ivan; Rabyk, Vasyl; Kryvinska, Natalia; Yatsymirskyy, Mykhaylo; Teslyuk, Vasyl

doi:10.1007/s00034-022-02012-8

Design of the Processors for Fast Cosine and Sine Fourier Transforms

Published: 11 April 2022

Volume 41, pages 4928–4951, (2022)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

197 Accesses
Explore all metrics

Abstract

To solve large number of digital signal processing problems, such as on-board radar-location or hydro-acoustic systems, it is necessary to perform discrete trigonometric transforms over intensive data flows in real time with the constraints on size and power consumption. To solve this problem, the hardware implementation in the form of the VLSI has been proposed. In particular, we improve an algorithm for the fast cosine and sine Fourier transforms with a focus on the parallel-streaming hardware implementation. A flow graph of the improved algorithm has been developed on the basis of addition, subtraction and multiplication of real numbers with the relation scheme of algorithms. A linear projection of the improved algorithm for fast cosine and sine Fourier transforms on the axis parallel to the data transmission has been obtained. This makes it possible to change the type and dimensions of the transforms. Further, we develop a structure of 2-4-8-16-point processor for fast cosine and sine Fourier transforms. Such an implementation provides a reduction of the dimensions, energy consumption and performance of the transforms in real time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 10

Fig. 12

Fig. 13

Small-Size FDCT/IDCT Algorithms with Reduced Multiplicative Complexity

Article 01 November 2019

Hardware architecture optimization for high-frequency zeroing and LFNST in H.266/VVC based on FPGA

Article 11 May 2024

Fast Fourier Transform in Large-Scale Systems

Data availability

Our manuscript has no associated data.

References

N. Ahmed, T. Natarajan, K.R. Rao, Discrete cosine transform. IEEE Trans. Comput. 23, 90–93 (1974)
Article MathSciNet Google Scholar
A. Batyuk, E. Struk, I. Tsmots, Development principles and criteria for the selection of VLSI-structures for coordinated parallel calculation of basic operations of real-time digital signal processing algorithms, in Proceedings of the 9th International Conference on The Experience of Designing and Application of CAD Systems in Microelectronics, CADSM 2007, Lviv-Polyana, Ukraine, 19–24 Feb. 2007, pp. 179–180.
R.E. Blahut, Fast algorithms for signal processing. 1ed. Cambridge University Press: New York, USA, 2010
N. Brisebarre, M. Joldeş, J.-M. Muller, A.-M. Naneş, J. Picot, Error analysis of some operations involved in the cooley-Tukey fast fourier transform. ACM Trans. Math. Soft. 46(2), 1–27 (2020)
Article MathSciNet Google Scholar
W. Chen, C. Smith, S. Fralick, A fast computational algorithm for the discrete cosine transform. IEEE Trans. Commun. 25(9), 1004–1009 (1977)
Article Google Scholar
J.W. Cooley, J.W. Tukey, An algorithm for machine calculation of complex Fourier series. Math. Comput. 19, 297–301 (1965)
Article MathSciNet Google Scholar
P. Duhamel, M. Vitterli, Fast Fourier transform: a tutorial review and a state of the art. Signal Process. 19, 259–299 (1990)
Article MathSciNet Google Scholar
Electronic components database. Available online: https://www.digchip.com/datasheets/parts/datasheet/033/EP3C16F484C6.php. Accessed 5 Aug 2020.
M. Garrido, J. Grajal, M.A. Sanchez, O. Gustafsson, Pipelined radix-2k feedforward FFT architectures. IEEE Trans Very Large Scale Integr. Syst. 21, 23–32 (2011)
Article Google Scholar
M. Garrido, S. Huang, S. Chen, O. Gustafsson, The serial commutator FFT. IEEE Trans. Circuits Syst. II Express Briefs 63(10), 974–978 (2016)
Article Google Scholar
L.O. Hnativ, Integer cosine transforms for high-efficiency image and video coding. Cybern. Syst. Anal. 52(5), 802–816 (2016)
Article MathSciNet Google Scholar
C. Ingemarsson, P. Källström, F. Qureshi, O. Gustafsson, Efficient FPGA Mapping of Pipeline. IEEE Trans. Very Large Scale Integr. Syst. 25, 2486–2497 (2017)
Article Google Scholar
K.J. Jones, R. Coster, Area-efficient and scalable solution to real-data fast fourier transform via regularised fast Hartley transform. IET Signal Proc. 1(3), 128–138 (2007)
Article Google Scholar
M. Kasyanchuk, I. Yakymenko, S. Ivas’ev, Ya. Nykolaychuk, Fundamental theoretical and algorithmic principles of the applied tasks decision of theory of numbers and construction of the high-performance special processors on their basis, in Proceedings of the XI International Conference on The Experience of Designing and Application of CAD Systems in Microelectronics, CADSM 2011, 23–25 February, 2011, Polyana-Svalyava (Zakarpattya), Ukraine, pp.168–169 (2011).
V. Kumar, K.K. Mahapatra, et al. An efficient distributed arithmetic based VLSI architecture for DCT. IEEE Trans. Circ. Syst. I Regular Papers, pp. 978–983 (2011).
A.C. Mert, E. Kalali, I. Hamzaoglu, High performance 2D transform hardware for future video coding. IEEE Trans. Consum. Electron. 63(2), 117–125 (2017)
Article Google Scholar
J.G. Nash, Distributed-memory-based FFT architecture and FPGA implementations. Electronics 7, 116 (2018)
Article Google Scholar
A. Nukada, Y. Maruyama, S. Matsuoka, High performance 3-D FFT using multiple CUDA GPUs, in Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5, New York, NY, USA, ACM, , pp. 57–63 (2012).
OBrien Labs. Altera Cyclone II, III, IV Development Kits. Available online: https://obrienlabs.blogspot.com/2010/12/altera-cyclone-iii-development-kits.html. (accessed on 05.08.2020).
I. Prots’ko, V. Teslyuk, Algorithm of efficient computation DSTI-IV using cyclic convolutions. WSEAS Trans. Signal Process. 10, 278–288 (2014)
Google Scholar
I. Prots’ko, Algorithm of efficient computation of generalised discrete Hartley transform based on cyclic convolutions. IET Signal Proc. 4(8), 301–308 (2014)
Article Google Scholar
D. Puchala, K. Stokfiszewski, B. Szczepaniak, M. Yatsymirskyy, Effectiveness of Fast Fourier Transform implementations on GPU and CPU. Przeglad Elektrotechniczny 92(7), 69–71 (2016)
Google Scholar
M. Raguraman, D. Saravanan, FPGA implementation of approximate 2d discrete cosine transforms. Circuits Syst. 7, 434–445 (2016)
Article Google Scholar
K.R. Rao, D.N. Kim, J.J. Hwang, Fast Fourier Transform—Algorithms and Applications (Springer, Berlin, 2011)
MATH Google Scholar
B.R. Rau, J.A. Fisher, Instruction-level parallel processing: history, overview and perspective. J. Supercomput. 7(1), 9–50 (1993)
Article Google Scholar
J.J. Rodriguez-Andina, M.J. Moure, M.D. Valdes-Pena, Advanced features and industrial applications of FPGAs—a review. IEEE Trans. Ind. Inform. 11, 853–864 (2015)
Article Google Scholar
P. Saha, A. Banerjee, A. Dandapat, P. Bhattacharyya, ASIC implementation of high speed processor for calculating discrete Fourier transformation using circular convolution technique. WSEAS Trans. Circuits Syst. 10, 278–288 (2011)
Google Scholar
S. Shen, W. Shen, Y. Fan, X. Zeng, A Unified 4/8/16/32-Point Integer IDCT architecture for multiple video coding standards, in IEEE International Conference on Multimedia and Expo, ICME, Melbourne, VIC Australia, 9-13, pp. 788–793 (2012)
G. Sohi, Instruction issue logic for high-performance interruptible, multiple functional unit, pipelined computers. IEEE Trans. Comput. 39(3), 349–359 (1990)
Article Google Scholar
K. Stokfiszewski, K. Wieloch, M. Yatsymirskyy, The fast fourier transform partitioning scheme for GPU’s computation effectiveness improvement, in Advances in Intelligent Systems and Computing, Springer: Lviv, Ukraine, 2018, Volume 689, pp. 511–522 (2018).
T.-Y. Sung, Y.-S. Shieh, An efficient VLSI linear array for DCT/IDCT using subband decomposition algorithm. Hindawi Publishing Corporation Mathematical Problems in Engineering, 2010, 87–93.
T. Tao, S. Liu, H. Ma, M. Li, X. Zhou, X. Wang, J. Weng, Twiddle factor neutralization method for heterodyne velocimetry. Rev. Sci. Instrum. 85(1), 013101 (2014). https://doi.org/10.1063/1.4859598
Article Google Scholar
Terasic Technologies FPGA Dev Kits for Altera Cyclone® II, III, & IV. Available online: https://ru.mouser.com/new/terasic-technologies/terasic-fpga-dev-cyclone-kits/ (accessed on 05.08.2020).
R.L. Tokheim, Digital Electronics: Principles and Application. 8th edition. McGraw Hill Higher Education: New York, USA, January 16, 576 (2013).
I.G. Tsmots, Informatsijni tehnologii ta spetsializovani zasoby obrobky sygnaliv i zobrazhen u realnomu chasi. UAD: Lviv, Ukraine, (2005). (in Ukrainian).
J. Wu, Jaja, j. High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform. Processing (IPDPS) 2013 IEEE 27th International Symposium on Parallel & Distributed, Boston, MA, USA, 20–24 May 2013, pp. 115–125.
M.M. Yatsymirskyy, Shvydki algorytmy ortogonalnyh trygonometrychnyh peretvoren. Akademichnyj Expres: Lviv, Ukraine, (1997). (in Ukrainian).
Md. ZainulAbidin, M.O. Sharrif, Tan shao theong design of a VLSI digit slicing fast Fourier transform processor. Microelectron. J. 22(5–6), 15–26 (1991)
Google Scholar
X. Zhao, J. Chen, M. Karczewicz, L. Zhang, X. Li, W. Chien, Enhanced multiple transform for video coding, in Proceedings on data compression conference, Snowbird, UT, USA, pp. 73–82 (2016).

Download references

Author information

Authors and Affiliations

Department of Automated Control Systems, Lviv Polytechnic National University, Lviv, 79013, Ukraine
Ivan Tsmots & Vasyl Teslyuk
Department of RadioPhysics and Computer Technologies, Ivan Franko National University of Lviv, 1, Universytetska Str., Lviv, 79000, Ukraine
Vasyl Rabyk
Department of Information Systems, Faculty of Management, Comenius University, Bratislava, Bratislava, 25 82005, Slovakia
Natalia Kryvinska
Institute of Information Technology, Lodz University of Technology, Wolczanska 215 Street, Lodz, Poland
Natalia Kryvinska & Mykhaylo Yatsymirskyy

Authors

Ivan Tsmots
View author publications
You can also search for this author inPubMed Google Scholar
Vasyl Rabyk
View author publications
You can also search for this author inPubMed Google Scholar
Natalia Kryvinska
View author publications
You can also search for this author inPubMed Google Scholar
Mykhaylo Yatsymirskyy
View author publications
You can also search for this author inPubMed Google Scholar
Vasyl Teslyuk
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding authors

Correspondence to Natalia Kryvinska or Vasyl Teslyuk.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest/competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tsmots, I., Rabyk, V., Kryvinska, N. et al. Design of the Processors for Fast Cosine and Sine Fourier Transforms. Circuits Syst Signal Process 41, 4928–4951 (2022). https://doi.org/10.1007/s00034-022-02012-8

Download citation

Received: 02 November 2020
Revised: 11 March 2022
Accepted: 11 March 2022
Published: 11 April 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s00034-022-02012-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Design of the Processors for Fast Cosine and Sine Fourier Transforms

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Small-Size FDCT/IDCT Algorithms with Reduced Multiplicative Complexity

Hardware architecture optimization for high-frequency zeroing and LFNST in H.266/VVC based on FPGA

Fast Fourier Transform in Large-Scale Systems

Data availability

References

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now