Abstract
High Efficiency Video Coding (HEVC) has introduced a quad-tree (QT) based coding unit (CU) partition structure, which has significantly improved the compression performance compared with Advanced Video Coding (AVC). However, the use of rate-distortion optimization (RDO) techniques in the search for the optimal CU partition has increased the encoding complexity of the video. In this paper, we propose a fast CU partitioning algorithm based on image similarity, which makes decisions on the partitioning of the parent CU by comparing the similarity of the image content of four sub-CUs. We propose four different neural networks based on this algorithm, and experimental results demonstrate that our proposed network structure reduces encoding time by 59.8%, 58.7%, 58.5%, and 59.3% respectively, while increasing the Bjøntegaard delta bit-rate (BDBR) by 2.32%, 1.99%, 1.82%, and 1.91%, respectively.










Similar content being viewed by others
Data availability
The datasets generated during or analyzed during the current study are available from the corresponding author upon reasonable request.
References
Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the h.264/avc video coding standard. IEEE Trans Circ Syst Vid Technol 13(7):560–576
Sullivan GJ, Ohm JR, Han WJ, Wiegand T (2013) Overview of the high efficiency video coding (hevc) standard. IEEE Trans Circ Syst Vid Technol 22(12):1649–1668
Menon VV, Amirpour H, Timmerer C, Ghanbari M (2021) Incept: Intra cu depth prediction for hevc. In: 2021 IEEE 23rd International workshop on multimedia signal processing (MMSP), pp 1–6
Hou J, Li D, Li Z, Jiang X (2013) Fast cu size decision based on texture complexity for hevc intra coding. In: Proceedings 2013 International conference on mechatronic sciences, electric engineering and computer (MEC), pp 1096–1099
Shen Y, Zhang S, Yang C (2014) Image texture based fast cu size selection algorithm for hevc intra coding. In: 2014 IEEE International conference on signal processing, communications and computing (ICSPCC), pp 363–367
Shen L, Zhang Z, An P (2013) Fast cu size decision and mode decision algorithm for hevc intra coding. IEEE Trans Consum Electron 59(1):207–213
Cho S, Kim M (2013) Fast cu splitting and pruning for suboptimal cu partitioning in hevc intra coding. IEEE Trans Circ Syst Vid Technol 23(9):1555–1564
Shi H, Fan L, Chen H (2015) A fast cu size decision algorithm based on adaptive depth selection for hevc encoder. In: 2014 International conference on audio, language and image processing
Kim N, Jeon S, Shim HJ, Jeon B, Lim S-C, Ko H (2016) Adaptive keypointbased cu depth decision for hevc intra coding. In: 2016 IEEE International symposium on broadband multimedia systems and broadcasting (BMSB), pp 1–3
Zao L, Fan X, Ma S, Zhao D (2014) Fast intra-encoding algorithm for high efficiency video coding. Signal Process Image Commun 29(9):935–944
Lee D, Jeong J (2017) Fast intra coding unit decision for high efficiency video coding based on statistical information. Signal Process Image Commun 55:121–129
Li Z, Zhao Y, Dai Z, Rogeany K, Cen Y, Xiao Z, Yang W (2019) A fast cu partition method based on cu depth spatial correlation and rd cost characteristics for hevc intra coding. Signal Process Image Commun 75:141–146
Zhang M, Lai D, Liu Z, An C (2019) A novel adaptive fast partition algorithm based on cu complexity analysis in hevc. Multimed Tools Appl 78:1035–1051
Zhang Y, Li N, Kwong S, Jiang G, Zeng H (2019) Statistical early termination and early skip models for fast mode decision in hevc intra coding. ACM Trans Multimed Comput, Commun, Appl (TOMM) 15(3):1–23
Ruiz D, VeliborAdzic Fernández-Escribano G, Kalva H, Cuenca P (2014) Fast partitioning algorithm for hevc intra frame coding using machine learning. In: Image processing (ICIP):2014 IEEE international conference on
Hu N, Yang EH (2015) Fast mode selection for hevc intra-frame coding with entropy coding refinement based on a transparent composite model. IEEE Trans Circ Syst Vid Technol 25(9):1521–1532
Hu Q, Shi Z, Zhang X, Gao Z (2016) Fast hevc intra mode decision based on logistic regression classification. In: 2016 IEEE International symposium on broadband multimedia systems and broadcasting (BMSB), pp 1–4
Duanmu F, Ma Z, Wang Y (2016) Fast mode and partition decision using machine learning for intra-frame coding in hevc screen content coding extension. IEEE J Emerg Sel Top Circ Syst 6(4):517–531
Zhang T, Sun MT, Zhao D, Gao W (2017) Fast intra-mode and cu size decision for hevc. IEEE Trans Circ Syst Vid Technol 27(8):1714–1726
Liu X, Li Y, Liu D, Wang P, Yang LT (2019) An adaptive cu size decision algorithm for hevc intra prediction based on complexity classification using machine learning. IEEE Trans Circ Syst Vid Technol 29(1):144–155
Kuang W, Chan Y-L, Tsang S-H, Siu W-C (2020) Online-learning-based bayesian decision rule for fast intra mode and cu partitioning algorithm in hevc screen content coding. IEEE Trans Image Process 29:170–185. https://doi.org/10.1109/TIP.2019.2924810
Zhang Y, Pan Z, Li N, Wang X, Jiang G, Kwong S (2018) Effective data driven coding unit size decision approaches for hevc intra coding. IEEE Trans Circ Syst Vid Technol 28(11):3208–3222
Erabadda B, Mallikarachchi T, Kulupana G, Fernando A (2019) Content adaptive fast cu size selection for hevc intra-prediction. In: 2019 IEEE International conference on consumer electronics (ICCE)
Werda I, Maraoui A, Sayadi FE, Masmoudi N (2022) Fast cu partition and intra mode prediction method for hevc. In: 2022 IEEE 9th International conference on sciences of electronics, technologies of information and telecommunications (SETIT)
Amna M, Imen W, Nacir O, Ezahra SF (2022) Svm-based method to reduce hevc cu partition complexity. In: 2022 19th International multi-conference on systems, signals and devices (SSD), pp 480–484
Amna M, Imen W, Bouaafia S, Sayadi FE (2022) Machine learning-based approaches to reduce hevc intra coding unit partition decision complexity. Multimed Tools Appl 81(2):2777–2802
Feng Z, Liu P, Jia K, Duan K (2018) Fast intra ctu depth decision for hevc. IEEE Access 6:45262–45269
Kim K, Ro WW (2018) Fast cu depth decision for hevc using neural networks. IEEE Trans Circ Syst Vid Technol 29(5):1462–1473
Xu M, Li T, Wang Z, Deng X, Yang R, Guan Z (2018) Reducing complexity of hevc: A deep learning approach. IEEE Trans Image Process 27(10):5044–5059
Shi J, Gao C, Chen Z (2019) Asymmetric-kernel cnn based fast ctu partition for hevc intra coding. In: 2019 IEEE International symposium on circuits and systems (ISCAS)
Wang Z, Li F (2021) Convolutional neural network based low complexity HEVC intra encoder. Multimed Tools Appl 80(2):2441–2460
Amna M, Imen W, Sayadi FE (2021) Lenet5-based approach for fast intra coding. In: 2020 10th International symposium on signal, image, video and communications (ISIVC)
Chen Z, Shi J, Li W (2020) Learned fast hevc intra coding. IEEE Trans Image Process 29:5431–5446
Zhao R, Huang H, Zhang R, Jing X (2023) Fast hevc intra ctu partition algorithm based on lightweight cnn. In: Signal and information processing, networking and computers, pp 996–1003
Zaki F, Mohamed AE, Sayed SG (2021) Ctunet: A deep learning-based framework for fast ctu partitioning of h265/hevc intra- coding. Ain Shams Eng J 12(2):1859–1866
Hari P, Jadhav V, Rao BKNS (2022) Ctu partition for intra-mode hevc using convolutional neural network. In: 2022 IEEE International symposium on smart electronic systems (iSES), pp 548–551
Imen W, Amna M, Fatma B, Ezahra SF, Masmoudi N (2022) Fast hevc intra-cu decision partition algorithm with modified lenet-5 and alexnet 16(7):1811–1819
Zagoruyko S, Komodakis N (2015) Learning to compare image patches via convolutional neural networks. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 4353–4361
Acknowledgements
The work is supported by the National Key Research and Development Program of China(2022YFF0607000), National Natural Science Foundation of China (61871188), Guangdong Basic and Applied Basic Research Foundation (2023A1515010993), Guangdong Provincial Key Laboratory of Human Digital Twin (2022B1212010004), Guangzhou City Science and Technology Research Projects (2023B01J0011).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
We declared there is no conflict of interest in terms of this submission.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A
Appendix B different architecture with different sizes of CUs
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Cao, Y., Wu, W., Zhou, Z. et al. Fast CU patition based on image similarity using neural network. Multimed Tools Appl 83, 33185–33205 (2024). https://doi.org/10.1007/s11042-023-16962-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-16962-x