Abstract
Aiming at the low-power video communication with low latency at resource-constrained terminals in the Internet of Things, a machine learning-based fast transcoding from distributed video coding (DVC) to high-efficiency video coding (HEVC) is put forward. In order to accelerate the transcoding, the DVC decoding information is efficiently exploited to reduce HEVC encoding complexity at both levels of coding unit (CU) and prediction unit (PU). First of all, the predictions of CU partition and PU modes are regarded as two binary classification tasks. Subsequently, the initial features are extracted from the DVC decoding information and the support vector machine (SVM)-recursive feature elimination algorithm is adopted to select feature vectors to construct the training data which is used to train the SVM classifiers for CU and PU, respectively. By means of the top-down division prediction method, the CU partition is first determined by the trained SVM classifier. For the CUs which are not further split, the PU modes will be predicted to terminate the quad-tree coding process of HEVC in advance, so that HEVC encoding complexity is reduced. Experimental results show that the proposed algorithm can reduce 57.64% computational complexity on average with Bjøntegaard delta bit-rate 2.43%. In terms of transcoding time efficiency, the proposed algorithm outperforms the state-of-the-art fast DVC to HEVC transcoding algorithms based on machine learning.
Similar content being viewed by others
References
Chen, W.: Intelligent manufacturing production line data monitoring system for industrial internet of things. Comput Commun. 151, 31–41 (2020)
Shafi, U., Mumtaz, R., García-Nieto, J., et al.: Precision agriculture techniques and practices: from considerations to applications. Sensors (Basel) 19(17), 3796 (2019)
Yin, Y., Zeng, Y., Chen, X., et al.: The internet of things in healthcare: an overview. J. Ind. Inf. Integration 1, 3–13 (2016)
Baucas, M.J., Spachos, P., Gregori, S.: Internet-of-Things devices and assistive technologies for health care: applications, challenges, and opportunities. IEEE Signal Process. Mag. 38(4), 65–77 (2021)
Akpakwu, G.A., Silva, B.J., Hancke, G.P., et al.: A survey on 5G networks for the internet of things: communication technologies and challenges. IEEE Access. 6, 3619–3647 (2018)
Girod, B., Aaron, A.M., Rane, S., et al.: Distributed video coding. Proc. IEEE. 93(1), 71–83 (2005)
Kodavalla VK (2018) Transcoding of next generation distributed video codec for mobile video. In: Proceedings of ICAECC, pp 1–7
Peixoto E, Queiroz RL. de, Mukherjee D (2008) Mobile video communications using a Wyner–Ziv transcoder. In: Proceedings of SPIE, pp 1–11
Peixoto, E., de Queiroz, R.L., Mukherjee, D.: A Wyner–Ziv video transcoder. IEEE Trans. Circ. Syst. Video Technol. 20(2), 189–200 (2010)
Martínez, J.L., Fernández-Escribano, G., Kalva, H., et al.: Motion vector refinement in a Wyner–Ziv to H.264 transcoder for mobile telephony. IET Image Process. 3(6), 335–339 (2009)
Corrales-García, A., Martínez, J.L., Fernández-Escribano, G., et al.: Variable and constant bitrate in a DVC to H.264/AVC transcoder. Signal Process Image Commun. 26(6), 310–323 (2011)
Corrales-García, A., Rodríguez-Sánchez, R., Martínez, J. L., et al.: A GPU-Based DVC to H.264/AVC Transcoder. In: Proceedings of HAIS, pp. 233–240 (2010)
Corrales-García, A., Rodríguez-Sánchez, R., Martínez, J.L., et al.: Multimedia communications using a fast and flexible DVC to H.264/AVC/SVC transcoder. J. Signal Process. Syst. 79(3), 211–232 (2015)
Kao, C., Huang, T., Chen, H. H., et al.: Perceptully lossless video re-encoding for cloud transcoding. In: Proceedings of China SIP, pp. 301–305 (2014)
Rong, S., Qing, L., Xu, Y., et al.: Mobile video communications based on cloud transcoding. Proceedings. 1(3), 1431–1433 (2017)
Lei, T.C.W., Tseng, F.S.: Light-weight video codec at terminal for video coding in the cloud. J. Signal Process Syst. 91, 627–639 (2019)
Zhu, L., Zhang, Y., Li, N., et al.: Machine learning based fast H.264/AVC to HEVC transcoding exploiting block partition similarity. J. Vis. Commun. Image Rep. 38, 824–837 (2016)
Zhang, D., Tong, J., Zang, D.: Fast CU partition for H.264/AVC to HEVC Transcoding Based on Fisher Discriminant Analysis. In: Proceedings of VCIP, pp. 1–4 (2016)
Lin, H., He, X., Qing, L., et al.: Machine learning-based H.264/AVC to HEVC transcoding via motion information reuse and coding mode similarity analysis. IET Image Process 13(1), 34–43 (2019)
Martínez, J.L., Corrales-García, A., Cuenca, P., et al.: Applying data mining techniques in a Wyner–Ziv to H.264 video transcoder. In: Proceedings of IWANN, pp. 497–504 (2011)
Qing, L., Hua, L., He, X., et al.: A video transcoding method from DVC to HEVC based on Naive Bayes. China patent, CN201811479353.9 (2019)
Hua, L., Tang, T., Qing, L., et al.: Fast transcoding of DVC-HEVC based on naive Bayes classification. J. Terahertz Sci. Electron. Inf. Technol. 18(2), 235–240 (2020)
Qing, L., Hua, L., He, X., et al.: A video transcoding method from DVC to HEVC based on fisher discriminant. China patent, CN201810573870.6 (2018)
Yang, J., Qing, L., He, X., et al.: A fast DVC to HEVC transcoding for mobile video communication. In: Proceedings of ITNEC, pp. 505–509 (2019)
Van, L.P., Praeter, J.D., Wallendael, G.V., et al.: Efficient bit rate transcoding for high efficiency video coding. IEEE Trans. Multimedia 18(3), 364–378 (2016)
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machine. ACM Trans. Intell. Syst. Technol. 2(3), 1–27 (2011)
Guyon, I., Weston, J., Barnhill, S., et al.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002)
Yan, K., Zhang, D.: Feature selection and analysis on correlated gas sensor data with recursive feature elimination. Sens. Actuators, B Chem. 212, 353–363 (2015)
Wu, J.: Beauty of Mathematics. Posts and Telecommunications Press, Beijing (2020)
Yang, J., Qing, L., Zeng, W., et al.: High-order statistical modeling based on a decision tree for distributed video coding. IEEE Trans. Circuits Syst. Video Technol. 29(5), 1488–1502 (2019)
Bjøntegaard, G.: Calculation of average PSNR differences between RD-curves. document M33, ITU-T Video Coding Experts Group, Austin, TX, USA (2001)
Acknowledgements
This work was supported in part by Central Government Funds of Guiding Local Scientific and Technological Development for Sichuan Province under Grant No. 2021ZYD0030, the State Scholarship Fund of China Scholarship Council under Grant No. 202108515045, and the Scientific Research Project of Sichuan Science and Technology Department under Grant No.2021YJ0099.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yang, J., He, PY., Yang, H. et al. Machine learning-based fast transcoding for low-power video communication in Internet of Things. SIViP 16, 2183–2191 (2022). https://doi.org/10.1007/s11760-022-02182-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02182-7