Abstract
The use of machine learning techniques to reduce recent video coding standards complexity such as High Efficiency Video Coding (HEVC) has received prominent attention. In fact, the fascinating HEVC standard coding efficiency gap is performed at the expense of dramatically increasing coding complexity. HEVC adopts a similar block-based hybrid video coding framework its predecessor H.264 Advanced Video Coding (H.264/AVC), but it provides a highly flexible hierarchy of unit representation, which includes three units: coding unit (CU), prediction unit (PU) and transform unit (TU). The recursive splitting of CU is content adaptive, which is one of the biggest differences compared to H.264/AVC. Adopting a large variety of coding unit (CU) sizes, the quadtree partition takes the lion’s share of the HEVC encoding complexity, making it ever more challenging to use rigid traditional inference models to predict the efficient coding decisions. In this context, this paper investigates the resulting implications on both coding efficiency and encoding complexity, when using Fuzzy Support Vector Machine (FSVM) and convolutional Neural Network (CNN) models for partitioning in the HEVC intra-prediction. The first approach is an online FSVM-based algorithm designed to predict efficiently the CU partition module. The second one is a deep CNN method founded on a large-scale database of substantial CU partition training. Experimental results reveal that the proposed deep CNN approach, with 66.04% complexity reduction, outperforms the proposed online FSVM approach that achieves 45.33%. However, the FSVM with only 0.067% loss in coding efficiency compared to 1.69% engendered with the CNN, is considered as the approach that performs the best tradeoff between the compression efficiency and the complexity reduction when optimizing the HEVC complexity at all Intra configuration.
Similar content being viewed by others
References
Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M, et al (2016) Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467
Amna M, Imen W, Ezahra SF (2021) Lenet5-based approach for fast intra coding. In 2020 10th International Symposium on Signal, Image, Video and Communications (ISIVC), pp. 1–4
Bjontegaard G (2001) Calculation of average psnr differences between rd-curves. VCEG-M33
Bouaafia R, Khemiri S, Sayadi FE, Atri M (2020) Fast cu partition-based machine learning approach for reducing hevc complexity. J Real Time Image Process 17(1):185–196
Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM transactions on intelligent systems and technology (TIST) 2(3):1–27
Correa G, Assuncao P, Agostini L, L.A. eda SilvaCruz, (2012) Performance and computational complexity assessment of highefficiency video encoders. IEEE Transactions on Circuits and Systems for Video Technology 22(12):1899–1909
Fernandez DG, Alberto A, Del B, Botella G, Meyer-Baese U, Meyer-Baese A, Grecos C (2017) Information fusion based techniques for hevc. In Real-Time Image and Video Processing 10223:102230M
Fini MR, Zargari F (2019) Two stage fast mode decision algorithm for intra prediction. HEVC. Multimedia Tools and Applications 75(13):7541–7558
Grellert M, Zatt B, Bampi S, da Silva Cruz LA (2018) Fast coding unit partition decision for HEVC using support vector machines. Trans Circuits Syst Video Technol. 29(6):1741–1753
Hassan M, Shanableh T (2019) Predicting split decisions of coding units in hevc video compression using machine learning techniques. Multimedia Tools and Applications 78(23):32735–32754
Hosseini E, Pakdaman F, Hashemi MR, Ghanbari M (2019) A computationally scalable fast intra coding scheme for hevc video encoder. Multimedia Tools and Applications 78(9):11607–11630
JCT-VC (2019) Hm software. https://hevc.hhi.fraunhofer.de/. [Online; Accessed 15 Apr 2019]
Jiang W, Ma H, Chen Y (2012) Gradient based fast mode decision algorithm for intra prediction in hevc. In 2012 2nd international conference on consumer electronics, communications and networks (CECNet), pp. 1836–1840
Kuanar S, Rao KR, Conly C (2018) Fast mode decision in hevc intra prediction, using region wise cnn feature classification. In 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), pp. 1–4
Lan C, Xu J, Sullivan GJ, Wu F (2012) Intra transform skipping, document jctvc-i0408. Joint Collaborative Team on Video Coding (JCT-VC)
Li T, Xu M, Deng X (2017) A deep convolutional neural network approach for complexity reduction on intra-mode hevc. In 2017 IEEE International Conference on Multimedia and Expo (ICME), pp. 1255–1260
LI Z, Zhang Y, Li B (2012) Gradient-based fast decision for intra prediction in HEVC. In Vis Commun Image Process. pages 1–6
Lin CF, Wang SD (2002) Fuzzy support vector machines. IEEE transactions on neural networks 13(2):464–471
Liu X, Li Y, Liu D, Wang LT, Yang P (2017) An adaptive cu size decision algorithm for hevc intra prediction based on complexity classification using machine learning. IEEE Transactions on Circuits and Systems for Video Technology 29(1):144–155
Liu Z, Yu X, Gao Y, Chen S, Ji X, Wang D (2016) Cu partition mode decision for hevc hardwired intra encoder using convolution neural network. IEEE Transactions on Image Processing 25(11):5088–5103
Mallikarachchi T, Fernando A, Arachchi HK (2014) Efficient coding unit size selection based on texture analysis for hevc intra prediction. In 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6
Moraes D, Wainer J, Rocha A (2016) Low false positive learning with support vector machines. Journal of Visual Communication and Image Representation 38:340–350
Shen X, YU L (2013) A fast hevc inter cu selection method based on pyramid motion divergence. EURASIP journal on image and video processing 2013(1):1–11
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15(1):1929–1958
Sullivan GJ, Ohm JR, Han WJ, Wiegand T (2012) Overview of the high efficiency video coding (HEVC) standard. IEEE Trans Circuits Syst Video Technol 22(12):1649–1668
Tsai AC, Wang JF, Lin WG, Yang JF (2007) A simple and robust direction detection algorithm for fast h. 264 intra prediction. In 2007 IEEE International Conference on Multimedia and Expo, pp 1587–1590
Wang C, Yu L, Wang S (2018) Accelerate cu partition in hevc using large-scale convolutional neural network. arXiv preprint arXiv:1809.08617
Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the h. 264/avc video coding standard. IEEE Transactions on circuits and systems for video technology 13(7):560–576
Xiph.org (2019) Xiph.org video test media. https://media.xiph.org/video/derf. [Online; Accessed 15 Apr 2019]
Xiong J, Li H, Wu Q, Meng F (2013) A fast hevc inter cu selection method based on pyramid motion divergence. IEEE transactions on multimedia 16(2):559–564
Xu M, Li T, Wang Z, Deng X, Yang Z, Guan R (2018) Reducing complexity of hevc: A deep learning approach. IEEE Transactions on Image Processing 27(10):5044–5059
Yin Y, Yang X, Lin J, Chen Y, Fang R (2018) A fast block partitioning algorithm based on svm for hevc intra coding. In Proceedings of the 2018 the 2nd International Conference on Video and Image Processing, pp. 176–181
Zhang T, Sun MT, Zhao D, Gao W (2016) Fast intra-mode and cu size decision for hevc. IEEE Transactions on Circuits and Systems for Video Technology 27(8):1714–1726
Zhang Y, Kwong A, Wang X, Yuan H, Pan Z, Xu L (2015) Machine learning-based coding unit depth decisions for flexible complexity allocation in high efficiency video coding. IEEE Transactions on Image Processing 24(7):2225–2238
Zhu L, Zhang Y, Pan Z, Wang R, Kwong S, Peng Z (2017) Binary and multi-class learning based low complexity optimization for hevc encoding. IEEE Transactions on Broadcasting 63(3):547–561
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Amna, M., Imen, W., Soulef, B. et al. Machine Learning-Based approaches to reduce HEVC intra coding unit partition decision complexity. Multimed Tools Appl 81, 2777–2802 (2022). https://doi.org/10.1007/s11042-021-11678-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11678-2