Abstract
This article describes a framework to speed up the HEVC encoding decisions for on-demand transrating of bitstreams. The methods proposed collect information from a high-quality reference bitstream which after processing is used to limit the number of modes evaluated in subsequent re-encodings at different bitrates. In this way, the time required to process re-encode-time computing-intensive decisions, such as partitioning and motion estimation is significantly reduced. The methods proposed are a combination of heuristics with a statistical basis and fast decision techniques trained using automatic learning methodologies. Experimental results using the HEVC reference encoder show that jointly the methods proposed reduce the transcoding computational complexity by up to 78.8%, with Bjontegaard bitrate deltas penalties smaller than 1.06%. A comparison with related works showed that the proposed method is able to outperform state-of-the-art solutions in terms of combined rate-distortion–complexity performance indicators.
Similar content being viewed by others
References
Bjontegaard, G.: Calculation of average PSNR differences between RD-curves. In: ITU-T Q. 6/SG16 VCEG, 15th Meeting, Austin, Texas (2001)
Bubolz, T.L.A., Conceição, R.A., Grellert, M., Agostini, L., Zatt, B., Correa, G.: Quality and energy-aware hevc transrating based on machine learning. IEEE Trans. Circuits Syst. I: Regul. Papers 66(6), 2124–2136 (2019). https://doi.org/10.1109/TCSI.2019.2903978
Correa, G., Assuncao, P.A., Agostini, L.V., da Silva Cruz, L.A.: Fast hevc encoding decisions using data mining. IEEE Trans. Circuits Syst. Video Technol. 25(4), 660–673 (2015). https://doi.org/10.1109/TCSVT.2014.2363753
De Praeter, J., Díaz-Honrubia, A.J., Van Kets, N., Van Wallendael, G., De Cock, J., Lambert, P., Van de Walle, R.: Fast simultaneous video encoder for adaptive streaming. In: Multimedia Signal Processing, 2015 IEEE 17th International Workshop on, IEEE, pp 1–6 (2015)
Díaz-Honrubia, A.J., Cebrián-Márquez, G., Martínez, J.L., Cuenca, P., Puerta, J.M., Gámez, J.A.: Low-complexity heterogeneous architecture for h. 264/hevc video transcoding. J. Real-Time Image Process. 12(2), 311–327 (2016)
Feurer, M., Hutter, F.: Hyperparameter optimization. Automated Machine Learning, pp. 3–33. Springer, Cham (2019)
Goswami, K., Kim, B.: A design of fast high-efficiency video coding scheme based on markov chain monte carlo model and bayesian classifier. IEEE Trans. Ind. Electron. 65(11), 8861–8871 (2018). https://doi.org/10.1109/TIE.2018.2815941
Goswami, K., Kim, B.G.: A design of fast high-efficiency video coding scheme based on markov chain monte carlo model and bayesian classifier. IEEE Trans. Ind. Electron. 65(11), 8861–8871 (2018)
Goswami, K., Lee, J.H., Kim, B.G.: Fast algorithm for the high efficiency video coding (hevc) encoder using texture analysis. Inf. Sci. 364, 72–90 (2016)
Grellert, M., Oliveira, T., Duarte, CR., da Silva Cruz, LA.: Fast HEVC transrating using random forests. In: Proc. of the IEEE International Conference Visual Communications and Image Processing, pp 1–4 (2018a)
Grellert, M., Zatt, B., Bampi, S., da Silva Cruz, L.A.: Fast coding unit partition decision for hevc using support vector machines. IEEE Trans. Circuits Syst. Video Technol. 29(6), 1741–1753 (2018b)
Kim, C.K., Hr, Lee, Tj, Jung, Kim, B.G., Seo, Kd: An efficient delay-constrained arq scheme for mmt packet-based real-time video streaming over ip networks. J. Real-Time Image Process. 12(2), 257–271 (2016)
Mallikarachchi, T., Talagala, D.S., Arachchi, H.K., Fernando, A.: Content-adaptive feature-based cu size prediction for fast low-delay video encoding in hevc. IEEE Trans. Circuits Syst. Video Technol. 28(3), 693–705 (2018). https://doi.org/10.1109/TCSVT.2016.2619499
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al.: Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12(Oct), 2825–2830 (2011)
Schroeder, D., Ilangovan, A., Reisslein, M., Steinbach, E.: Efficient multi-rate video encoding for HEVC-based adaptive http streaming. IEEE Trans. Circuits Syst. Video Technol. 28(1), 143–157 (2018)
Sharman, K., Sühring, K.: JCTVC-X1100 - Common Test Conditions. In: ITU-T SG16WP3 and ISO/IEC JTC1/SC29/WG11 24th JCT-VC Meeting Docs (2016)
Shen, L., Feng, G.: Content-based adaptive shvc mode decision algorithm. IEEE Trans. Multimed. 21(11), 2714–2725 (2019). https://doi.org/10.1109/TMM.2019.2909859
Stockhammer, T.: Dynamic adaptive streaming over HTTP: standards and design principles. In: Proceedings of the second annual ACM conference on Multimedia systems, ACM, pp. 133–144 (2011)
Van, L.P., De Praeter, J., Van Wallendael, G., Van Leuven, S., De Cock, J., Van de Walle, R.: Efficient bit rate transcoding for high efficiency video coding. IEEE Trans. Multimed. 18(3), 364–378 (2016)
Vetro, A., Christopoulos, C., Sun, H.: Video transcoding architectures and techniques: an overview. IEEE Signal Process. Mag. 20(2), 18–29 (2003)
Wallendael, GV., Cock, JD., de Walle, RV.: Fast transcoding for video delivery by means of a control stream. In: 2012 19th IEEE International Conference on Image Processing, pp. 733–736 (2012)
Xu, Z., Min, B., Cheung, R.C.: A fast inter cu decision algorithm for hevc. Signal Process. Image Commun. 60, 211–223 (2018)
Yang, SH., Zhong, CC.: Fast coding-unit mode decision for HEVC transrating. In: Computer and Information Technology (CIT), 2017 IEEE International Conference on, IEEE, pp. 93–100 (2017)
Acknowledgements
The authors wish to acknowledge the financial support of the Portuguese research funding agency FCT under project UIDB/EEA/50008/2020.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Grellert, M., da Silva Cruz, L.A., Zatt, B. et al. Coding mode decision algorithm for fast HEVC transrating using heuristics and machine learning. J Real-Time Image Proc 18, 1881–1896 (2021). https://doi.org/10.1007/s11554-020-01063-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-020-01063-x