Content-adaptive mode decision for low complexity 3D-HEVC

Song, Wenjun; Dai, Pu; Zhang, Qiuwen

doi:10.1007/s11042-023-14874-4

Content-adaptive mode decision for low complexity 3D-HEVC

Published: 02 March 2023

Volume 82, pages 26435–26450, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

160 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

3D-high efficiency video coding (3D-HEVC) provides great improvements in coding efficiency of multiview texture image and associated depth map. It inherits the prediction mode of HEVC, and several new coding tools for a better representation of the dependent texture and depth video are also employed by the 3D-HEVC encoder. These give a high coding efficiency, but require significantly high runtime due to huge complexity of mode decision. In this paper, we introduce a content-adaptive mode decision to reduce 3D-HEVC coding complexity. The basic idea of this method is to use the temporal-spatial, inter-view and texture-depth correlations to analyze content properties of treeblock, and adaptive skip some unnecessary prediction modes. Experimental results demonstrate that the proposed scheme can drastically save encoding time with no noticeable loss of rate distortion (RD) performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Video steganography: recent advances and challenges

Article Open access 04 April 2023

A comprehensive survey on video frame interpolation techniques

Article 04 January 2021

CNN-based intra partitioning process for spatial and SNR scalability for SHVC

Article 18 April 2024

Data availability

Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

References

Arora M, Kumar M (2021) AutoFER: PCA and PSO based automatic facial emotion recognition. Multimed Tools Appl 80:3039–3049
Article Google Scholar
Arora M, Kumar M, Garg NK (2018) Facial Emotion Recognition System Based on PCA and Gradient Features. National Acad Sci Lett 41:365–368
Article Google Scholar
Bakkouri S, Elyousfi A, Hamout H (2020) Fast CU size and mode decision algorithm for 3D-HEVC intercoding. Multimed Tools Appl 79:6987–7004
Article Google Scholar
Bjontegaard G (2001) Calculation of Average PSNR differences between RD curves. Document ITU-T SG16 Q.6, VCEG Meeting, VCEG-M33, Austin, TX, USA
Chen J, Wang B, Zeng H, Cai C, Ma K (2017) Sum-of-gradient based fast intra coding in 3D-HEVC for depth map sequence (SOG-FDIC). J Vis Commun Image R 48:329–339
Article Google Scholar
Gupta S, Thakur K, Kumar M (2021) 2D-human face recognition using SIFT and SURF descriptors of face's feature regions. Vis Comput 37:447–456
Article Google Scholar
Lei J, Sun J, Pan Z, Kwong S, Duan J, Hou C (2015) Fast mode decision using inter-view and inter-component correlations for multiview depth video coding. IEEE Trans Ind Inf 11(4):978–986
Article Google Scholar
Lei J, Duan J, Wu F, Ling N, Hou C (2018) Fast mode decision based on grayscale similarity and inter-view correlation for depth map coding in 3D-HEVC. IEEE Trans Circuits Syst Vid Technol 28(3):706–718
Article Google Scholar
Merkle P, Muller K, Marpe D, Wiegand T (2016) Depth intra coding for 3D video based on geometric primitives. IEEE Trans Circuits Syst Vid Technol 26(3):570–582
Article Google Scholar
Mora E, Jung J, Cagnazzo M, Pesquet-Popescu B (2014) Initialization, limitation and predictive coding of the depth and texture quadtree in 3D-HEVC video coding. IEEE Trans Circ Syst Vid Technol 24(9):1554–1565
Article Google Scholar
Moura C, Saldanha M, Sanchez G, Marcon C, Porto M, Agostini L (2020) Fast intra mode decision for 3D-HEVC depth map coding using decision trees. In: 2020 27th IEEE International Conference on Electronics, Circuits and Systems (ICECS), pp 1–4
Mueller K, Vetro A (2014) Common test conditions of 3DV core experiments. Joint Collaborative Team on 3D Video Coding Extensions (JCT-3V) document JCT3V-G1100, 7th Meeting: San Jose, CA, USA
Müller K, Schwarz H, Marpe D, Bartnik C, Bosse S, Brust H, Hinz T, Lakshman H, Merkle P, Rhee H, Tech G, Winken M, Wiegand T (2013) 3D high efficiency video coding for multi-view video and depth data. IEEE Trans Circuits Syst Vid Technol 22(9):3366–3378
MathSciNet MATH Google Scholar
Ning X, Duan P, Li W, Zhang S (2020) Real-time 3D face alignment using an encoder-decoder network with an efficient deconvolution layer. IEEE Signal Process Lett 27:1944–1948
Article Google Scholar
Pan Z, Lei J, Zhang Y, Sun X, Kwong S (2016) Fast motion estimation based on content property for low-complexity H.265/HEVC encoder. IEEE Trans Broadcast 62(3):675–684
Article Google Scholar
Saldanha M, Sanchez G, Marcon C, Agostini L (2018) “Fast 3D-HEVC depth maps intra-frame prediction using data mining,” in Proc. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1738–1742
Shen L, Liu Z, An P, Ma R, Zhang Z (2011) Low-complexity mode decision for MVC. IEEE Trans Circuits Syst Vid Technol 6(21):837–843
Article Google Scholar
Shen L, Zhang Z, Liu Z (2014) Effective CU size decision for HEVC intracoding. IEEE Trans Image Process 23(10):4232–4241
Article MathSciNet MATH Google Scholar
Shen L, Zhang Z, Liu Z (2014) Adaptive inter-mode decision for HEVC jointly utilizing inter-level and spatiotemporal correlations. IEEE Trans Circuits Syst Vid Technol 24(10):1709–1722
Article Google Scholar
Shen L, An P, Zhang Z, Hu Q, Chen Z (2015) A 3D-HEVC fast mode decision algorithm for real-time applications. ACM Trans Multimed Comput Commun Appl 11(3):34
Article Google Scholar
Shen L, Liu K, Feng G, Liu Z, An P (2018) Efficient intra mode selection for depth-map coding utilizing spatiotemporal, inter-component and inter-view correlations in 3D-HEVC. IEEE Trans Image Process 27(9):4195–4206
Article MathSciNet MATH Google Scholar
Singh S, Ahuja U, Kumar M, Kumar K, Sachdeva M (2021) Face mask detection using YOLOv3 and faster R-CNN models: COVID-19 environment. Multimed Tools Appl 80:19753–19768
Article Google Scholar
Stankiewicz O, Domanski M, Dziembowski A, Grzelka A, Mieloch D, Samelak J (2018) A free-viewpoint television system for horizontal virtual navigation. IEEE Trans Multimed 20(8):2182–2195
Article Google Scholar
Suhairi M, Wirawan E, Irfansyah AN (2019) “Complexity Reduction for Multiview HEVC Codec Using FPGA. In: 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT), pp 163–168
Sullivan GJ, Boyce JM, Ying C, Ohm J-R, Segall CA, Vetro A (2013) Standardized extensions of high efficiency video coding (HEVC). IEEE J Sel Top Sign Process 7(6):1001–1016
Article Google Scholar
Tanimoto M, Fujii T, Suzuki K (2008) View synthesis algorithm in view synthesis reference software 2.0 (VSRS 2.0). ISO/IEC JTC1/SC29/WG11 document M16090, Lausanne, Switzerland
Tech G, Chen Y, Müller K, Ohm J, Vetro A (2016) Overview of the multiview and 3D extensions of high efficiency video coding. IEEE Trans Circuits Syst Vid Technol 26(1):35–49
Article Google Scholar
Tohidypour HR, Pourazad MT, Nasiopoulos P (2014) A low complexity mode decision approach for HEVC-based 3D video coding using a Bayesian method. In: Proc. 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), pp 895–899
Tohidypour HR, Pourazad MT, Nasiopoulos P (2016) Online-learning-based complexity reduction scheme for 3D-HEVC. IEEE Trans Circuits Syst Vid Technol 26(10):1870–1883
Article Google Scholar
Walia S, Kumar K, Kumar M, Gao X-Z (2021) Fusion of handcrafted and deep features for forgery detection in digital images. IEEE Access 9:99742–99755
Article Google Scholar
Xu Y, Xing K, Liu H, Zhao T, Kwong S (2021) Flexible complexity optimization in Multiview video coding. IEEE Trans Circ Syst Vid Technol 31:4096–4106
Article Google Scholar
Yeh CH, Li MF, Chen MJ, Chi MC, Huang XX, Chi HW (2014) Fast mode decision algorithm through inter-view rate-distortion prediction for multiview video coding system. IEEE Trans Ind Inf 10(1):594–603
Article Google Scholar
Zhang Q, An P, Zhang Y, Shen L, Zhang Z (2011) Low complexity multiview video plus depth coding. IEEE Trans Consumer Electron 57(4):1857–1865
Article Google Scholar
Zhang Q, Li N, Gan Y (2014) Low complexity mode decision for 3D-HEVC. Sci World J
Zhang N, Zhao D, Chen Y, Lin J, Gao W (2014) Fast encoder decision for texture coding in 3D-HEVC. Signal Process Image Commun 29(9):951–961
Article Google Scholar
Zhang Q, Wang X, Huang X, Su R, Gan Y (2015) Fast mode decision algorithm for 3D-HEVC encoding optimization based on depth information. Digit Signal Process 44(9):37–46
Article Google Scholar
Zhang Q, Zhang N, Wei T, Huang K, Qian X, Gan Y (2017) Fast depth map mode decision based on depth-texture correlation and edge classification for 3D-HEVC. J Vis Commun Image Represent 45(1):170–180
Article Google Scholar
Zhang H, Fu C, Chan Y, Tsang S, Siu W (2018) Probability based depth intra mode skipping strategy and novel VSO metric for DMM decision in 3D-HEVC. IEEE Trans Circuits Syst Vid Techn 28(2):513–527
Article Google Scholar

Download references

Funding

This work was supported in part by the National Natural Science Foundation of China No.61771432, 61302118, and 61702464, the Basic Research Projects of Education Department of Henan No. 21zx003, and No.20A880004, and the Key projects Natural Science Foundation of Henan (2023045), and the Postgraduate Education Reform and Quality Improvement Project of Henan Province YJS2021KC12 and YJS2022AL034.

Author information

Authors and Affiliations

College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou, 450002, China
Wenjun Song, Pu Dai & Qiuwen Zhang

Authors

Wenjun Song
View author publications
You can also search for this author in PubMed Google Scholar
Pu Dai
View author publications
You can also search for this author in PubMed Google Scholar
Qiuwen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiuwen Zhang.

Ethics declarations

Conflict of interests

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Song, W., Dai, P. & Zhang, Q. Content-adaptive mode decision for low complexity 3D-HEVC. Multimed Tools Appl 82, 26435–26450 (2023). https://doi.org/10.1007/s11042-023-14874-4

Download citation

Received: 16 December 2021
Revised: 19 September 2022
Accepted: 06 February 2023
Published: 02 March 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s11042-023-14874-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Content-adaptive mode decision for low complexity 3D-HEVC

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

A comprehensive survey on video frame interpolation techniques

CNN-based intra partitioning process for spatial and SNR scalability for SHVC

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Content-adaptive mode decision for low complexity 3D-HEVC

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

A comprehensive survey on video frame interpolation techniques

CNN-based intra partitioning process for spatial and SNR scalability for SHVC

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation