Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

Abdallah, Bouthaina; Belghith, Fatma; Ben Ayed, Mohamed Ali; Masmoudi, Nouri

doi:10.1007/s11760-020-01843-9

Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

Original paper
Published: 19 January 2021

Volume 15, pages 1153–1160, (2021)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Bouthaina Abdallah ORCID: orcid.org/0000-0002-1400-9431¹,
Fatma Belghith¹,
Mohamed Ali Ben Ayed¹ &
…
Nouri Masmoudi¹

505 Accesses
13 Citations
Explore all metrics

Abstract

Versatile Video Coding (VVC), the newest standard for future video coding, is currently under development. This proposal aimed to improve the encoder performance over the latest standard namely High Efficiency Video Coding, carried with a high increase in coding complexity. The VVC partition structure is mainly based on the quadtree with nested multi-type tree (QTMT) block scheme. Such an improvement leads to a more flexible block partition and promotes a high encoding efficiency, but generates a huge coding complexity. In order to deal with this issue, a fast QTMT intra partition algorithm, based on a deep neural network named Early Terminated Hierarchical Convolution Neural Network, is applied to predict the \(64\times \)64 block QT partition structure. The proposed algorithm determines the QTMT partition structure based on the decision of whether to split or skip the corresponding CU, in order to get \(128\times \)128 Coding Tree Unit partition architecture. In this paper, the proposed intra partition work achieves a significant speedup in encoding gain that reaches 32.96% in best cases for Ultra High Definition video sequences compared to the reference VVC software VTM-3.0. For all video sequences, 24.49% time saving is reached on average. This improvement comes with an increase of 4.18% and a decrease of 0.18 dB in terms of BDBR and BDPSNR, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fast QTMT decision tree for Versatile Video Coding based on deep neural network

Article 09 August 2022

CNN-based ternary tree partition approach for VVC intra-QTMT coding

Article 16 February 2024

Fast intra-coding unit partition decision in H.266/FVC based on deep learning

Article 09 July 2020

References

Akbari, A., Trocan, M., Granado, B.: Image compression using adaptive sparse representations over trained dictionaries. In: 2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP), IEEE, pp 1–6 (2016)
Amestoy, T., Mercat, A., Hamidouche, W., Bergeron, C., Menard, D.: Random forest oriented fast qtbt frame partitioning. ICASSP 2019–2019 IEEE International Conference on Acoustics, pp. 1837–1841. Speech and Signal Processing (ICASSP), IEEE (2019)
Bjøntegaard, G.: Calculation of average psnr differences between rd-curves (vceg-m33). In: VCEG Meeting (ITU-T SG16 Q. 6), pp 2–4 (2001)
Bossen, F., Boyce, J., Li, X., Seregin, V., Sühring, K.: Jvet common test conditions and software reference configurations for sdr video. Joint Video Experts Team (JVET) of ITU-T SG 16 (2018)
Cao, J., Tang, N., Wang, J., Liang, F.: Texture-based fast cu size decision and intra mode decision algorithm for vvc. In: International Conference on Multimedia Modeling, Springer, pp 739–751 (2020)
Grellert, M., Zatt, B., Bampi, S., da Silva Cruz, L.A.: Fast coding unit partition decision for hevc using support vector machines. IEEE Trans. Circuits Syst. Video Technol. 29(6), 1741–1753 (2018)
Article Google Scholar
Jin, Z., An, P., Shen, L.: Fast qtbt partition algorithm for jvet intra coding based on cnn. In: Pacific Rim Conference on Multimedia, Springer, pp 59–69 (2017)
Liu, Z., Yu, X., Gao, Y., Chen, S., Ji, X., Wang, D.: Cu partition mode decision for hevc hardwired intra encoder using convolution neural network. IEEE Trans. Image Process. 25(11), 5088–5103 (2016)
Article MathSciNet Google Scholar
Sidaty, N., Hamidouche, W., Deforges, O., Philippe, P.: Emerging video coding performance: 4k quality monitoring. In: 2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX), IEEE, pp 1–3 (2017)
“VVC Test Model (VTM) vesion 30” (December 2018) [online] available. https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware VTM/tree/VTM-3.0
Xu, M., Li, T., Wang, Z., Deng, X., Yang, R., Guan, Z.: Reducing complexity of hevc: a deep learning approach. IEEE Trans. Image Process. 27(10), 5044–5059 (2018)
Article MathSciNet Google Scholar
Yang, H., Shen, L., Dong, X., Ding, Q., An, P., Jiang, G.: Low complexity ctu partition structure decision and fast intra mode decision for versatile video coding. IEEE Trans. Circuits Syst. Video Technol. (2019)
Zhang, Y., Kwong, S., Wang, X., Yuan, H., Pan, Z., Xu, L.: Machine learning-based coding unit depth decisions for flexible complexity allocation in high efficiency video coding. IEEE Trans. Image Process. 24(7), 2225–2238 (2015)
Article MathSciNet Google Scholar
Zhu, L., Zhang, Y., Pan, Z., Wang, R., Kwong, S., Peng, Z.: Binary and multi-class learning based low complexity optimization for hevc encoding. IEEE Trans. Broadcast. 63(3), 547–561 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Electronics and Information Technology Laboratory, National Engineering School of Sfax, University of Sfax, Sfax, Tunisia
Bouthaina Abdallah, Fatma Belghith, Mohamed Ali Ben Ayed & Nouri Masmoudi

Authors

Bouthaina Abdallah
View author publications
You can also search for this author in PubMed Google Scholar
Fatma Belghith
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Ali Ben Ayed
View author publications
You can also search for this author in PubMed Google Scholar
Nouri Masmoudi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bouthaina Abdallah.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abdallah, B., Belghith, F., Ben Ayed, M.A. et al. Low-complexity QTMT partition based on deep neural network for Versatile Video Coding. SIViP 15, 1153–1160 (2021). https://doi.org/10.1007/s11760-020-01843-9

Download citation

Received: 06 September 2020
Revised: 01 December 2020
Accepted: 10 December 2020
Published: 19 January 2021
Issue Date: September 2021
DOI: https://doi.org/10.1007/s11760-020-01843-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

Abstract

Access this article

Similar content being viewed by others

Fast QTMT decision tree for Versatile Video Coding based on deep neural network

CNN-based ternary tree partition approach for VVC intra-QTMT coding

Fast intra-coding unit partition decision in H.266/FVC based on deep learning

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

Abstract

Access this article

Similar content being viewed by others

Fast QTMT decision tree for Versatile Video Coding based on deep neural network

CNN-based ternary tree partition approach for VVC intra-QTMT coding

Fast intra-coding unit partition decision in H.266/FVC based on deep learning

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation