Abstract
The additional discrete transforms from cosine (DCT) and sine (DST) families that come with the Adaptive Multiple Transform (AMT) approach is one of the major enhancements involved in the new Versatile Video Coding (VVC) standard. They have increasingly brought additional complexity compared to HEVC standard. The transform module is one of the most consuming stages in terms of time and hardware resources. This paper focuses on the optimization of the DST-VII transform. It deals with a multicriteria optimization algorithm for approximate computing that aims at the identification of the optimal approximation of the DST-VII according to several approximation measures. The resulting transform matrix has extremely low arithmetic complexity as well as close proximity to the exact DST-VII. Moreover, hardware synthesis results denote that the simplified design of DST-VII consumes only a third of the hardware resources used by the original algorithm. Experimental results obtained from joint exploration model simulations show a slight bit-rate increase while maintaining almost the same video quality. Such results confirm the effectiveness of the proposed approximate transform. The latter yields good performance in terms of computational complexity reduction and proximity to the exact transform while exhibiting a video coding performance comparable to the original algorithm.



Similar content being viewed by others
References
Park, W., Lee, B., Kim, M.: Fast computation of integer DCT-V, DCT-VIII, and DST-VII for video coding. IEEE Trans. Image Process. 28(12), 5839–5851 (2019)
Garrido, M.J., Pescador, F., ChavarrÃas, M., Lobo, P.J., Sanz, C.: A high performance FPGA-based architecture for the future video coding adaptive multiple core transform. IEEE Trans. Consum. Electron. 64(1), 53–60 (2018). https://doi.org/10.1109/TCE.2018.2812459
Chen, J., Chen, Y., Karczewicz, M., Li, X., Liu, H., Zhang, L., Zhao, X.: Coding tools investigation for next generation video coding based on HEVC. In: Applications of Digital Image Processing XXXVIII, International Society for Optics and Photonics, vol. 9599, p. 95991B (2015)
Bross, B.: Versatile Video Coding. https://www.digitalmedia.fraunhofer.de/content/dam/dcinema/en/documents/ibc2020/2020_Flyer_Versatile%20Video%20Coding_final.pdf, online; Accessed January 2021 (2020)
Chen, J., Alshina, E., Sullivan, GJ., Ohm, J., Boyce, J.: Algorithm description of joint exploration test model 7 (jem 7). Joint Video Exploration Team (JVET) of ITU-T SG 16 (2017)
Mohammadi, AA., Alizadeh, MS., Sharifkhani, M.: Jvet encoder complexity analysis. In: 2018 IEEE International Symposium on Circuits and Systems (ISCAS), pp 1–5, https://doi.org/10.1109/ISCAS.2018.8351442 (2018)
Joint-Video-Exploration-Team (2019) JEVT Software Repository. https://jvet.hhi.fraunhofer.de/svn/svn_HMJEMSoftware/branches/HM-16.6-JEM-7.1-dev/, online; Accessed March 2019
Haweel, T.I.: A new square wave transform based on the DCT. Signal Process. 81(11), 2309–2319 (2001)
Cintra, R.J., Bayer, F.M.: A DCT approximation for image compression. IEEE Signal Process. Lett. 18(10), 579–582 (2011)
Bayer, F.M., Cintra, R.J., Edirisuriya, A., Madanayake, A.: A digital hardware fast algorithm and FPGA-based prototype for a novel 16-point approximate DCT for image compression applications. Meas. Sci. Technol. 23(11), 114010 (2012)
Bouguezel, S., Ahmad, M.O., Swamy, M.: Binary discrete cosine and Hartley transforms. IEEE Trans. Circuits Syst. I Regul. Pap. 60(4), 989–1002 (2013)
Oliveira, R.S., Cintra, R.J., Bayer, F.M., da Silveira, T.L., Madanayake, A., Leite, A.: Low-complexity 8-point DCT approximation based on angle similarity for image and video coding. Multidimens. Syst. Signal Process. 30(3), 1363–1394 (2019)
Jridi, M., Alfalou, A., Meher, P.K.: Efficient approximate core transform and its reconfigurable architectures for HEVC. J. Real-Time Image Process. 17(2), 329–339 (2020)
Jridi, M., Meher, P.K.: Scalable approximate DCT architectures for efficient HEVC-compliant video coding. IEEE Trans. Circuits Syst. Video Technol. 27(8), 1815–1825 (2017)
Pauchard, Y., Cintra, R.J., Madanayake, A., Bayer, F.M.: Fast computation of residual complexity image similarity metric using low-complexity transforms. IET Image Process. 9(8), 699–708 (2015)
Tablada, C., Bayer, F.M., Cintra, R.J.: A class of DCT approximations based on the Feig–Winograd algorithm. Signal Process. 113, 38–51 (2015)
Oliveira, P.A., Cintra, R.J., Bayer, F.M., Kulasekera, S., Madanayake, A.: Low-complexity image and video coding based on an approximate discrete tchebichef transform. IEEE Trans. Circuits Syst. Video Technol. 27(5), 1066–1076 (2016)
Cintra, R.J., Bayer, F.M., Madanayake, A., Potluri, U.S., Edirisuriya, A.: Fast algorithms and architectures for 8-point dst-ii/dst-vii approximations. J. Circuits Syst. Comput. 26(03), 1750045 (2017)
Bayer, F.M., Cintra, R.J.: Dct-like transform for image compression requires 14 additions only. Electron. Lett. 48(15), 919–921 (2012)
Potluri, U., Madanayake, A., Cintra, R., Bayer, F., Rajapaksha, N.: Multiplier-free DCT approximations for RF multi-beam digital aperture-array space imaging and directional sensing. Meas. Sci. Technol. 23(11), 114003 (2012)
Kammoun, A., Hamidouche, W., Philippe, P., Déforges, O., Belghith, F., Masmoudi, N., Nezan, J.F.: Forward-inverse 2d hardware implementation of approximate transform core for the vvc standard. IEEE Trans. Circuits Syst. Video Technol. (2019)
Britanak, V., Yip, P.C., Rao, K.R.: Discrete cosine and sine transforms: general properties, fast algorithms and integer approximations. Elsevier, Amsterdam (2010)
Kober, V., Ovseyevich, I.: Image restoration with sliding sinusoidal transforms. Pattern Recognit. Image Anal. 18(4), 649–653 (2008)
Zhang, Z., Zhao, X., Li, X., Li, Z., Liu, S.: Fast adaptive multiple transform for versatile video coding. In: 2019 Data Compression Conference (DCC), IEEE, pp. 63–72 (2019)
Chivukula, RK., Reznik, YA.: Fast computing of discrete cosine and sine transforms of types vi and vii. In: Applications of Digital Image Processing XXXIV, International Society for Optics and Photonics, vol. 8135, p. 813505 (2011)
Mert, A.C., Kalali, E., Hamzaoglu, I.: High performance 2d transform hardware for future video coding. IEEE Trans. Consum. Electron. 63(2), 117–125 (2017)
Bossen, F., Boyce, J., Suehring, K., Li, X., Seregin, V.: Jvet common test conditions and software reference configurations. JVET-K1010-v1 (2018)
Bjontegaard, G.: Calculation of average psnr differences between rd-curves (2001)
Mentor-modelsim (2019) functional-verification-tool-web. https://www.mentor.com/products/fv/modelsim
Intel-FPGA (2019) download-center. https://www.altera.com/downloads/downloadcenter. html
Intel/Altera (2018) Intel-arria-10-device-overview. https://www.altera.com/documentation/sam1403480274650.html
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ben Jdidia, S., Belghith, F., Jridi, M. et al. A multicriteria optimization of the discrete sine transform for versatile video coding standard. SIViP 16, 329–337 (2022). https://doi.org/10.1007/s11760-021-01925-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-021-01925-2