Machine learning-based fast transcoding for low-power video communication in Internet of Things

Yang, Jia; He, Pi-Yan; Yang, Hong; Xue, Ya-Juan; Jin, Yan-Hua

doi:10.1007/s11760-022-02182-7

Machine learning-based fast transcoding for low-power video communication in Internet of Things

Original Paper
Published: 27 March 2022

Volume 16, pages 2183–2191, (2022)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Jia Yang¹,
Pi-Yan He ORCID: orcid.org/0000-0003-1562-9592²,
Hong Yang³,
Ya-Juan Xue¹ &
…
Yan-Hua Jin²

172 Accesses
2 Citations
Explore all metrics

Abstract

Aiming at the low-power video communication with low latency at resource-constrained terminals in the Internet of Things, a machine learning-based fast transcoding from distributed video coding (DVC) to high-efficiency video coding (HEVC) is put forward. In order to accelerate the transcoding, the DVC decoding information is efficiently exploited to reduce HEVC encoding complexity at both levels of coding unit (CU) and prediction unit (PU). First of all, the predictions of CU partition and PU modes are regarded as two binary classification tasks. Subsequently, the initial features are extracted from the DVC decoding information and the support vector machine (SVM)-recursive feature elimination algorithm is adopted to select feature vectors to construct the training data which is used to train the SVM classifiers for CU and PU, respectively. By means of the top-down division prediction method, the CU partition is first determined by the trained SVM classifier. For the CUs which are not further split, the PU modes will be predicted to terminate the quad-tree coding process of HEVC in advance, so that HEVC encoding complexity is reduced. Experimental results show that the proposed algorithm can reduce 57.64% computational complexity on average with Bjøntegaard delta bit-rate 2.43%. In terms of transcoding time efficiency, the proposed algorithm outperforms the state-of-the-art fast DVC to HEVC transcoding algorithms based on machine learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

Fig. 4

Low complexity based ultra-high quality video compression method for multimedia-centric internet of things (IoT) services

Article 05 June 2017

HEVC coding unit decision based on machine learning

Article 21 January 2022

Improved Prediction Estimation Based H.264 to HEVC Intra Transcoding

References

Chen, W.: Intelligent manufacturing production line data monitoring system for industrial internet of things. Comput Commun. 151, 31–41 (2020)
Article Google Scholar
Shafi, U., Mumtaz, R., García-Nieto, J., et al.: Precision agriculture techniques and practices: from considerations to applications. Sensors (Basel) 19(17), 3796 (2019)
Article Google Scholar
Yin, Y., Zeng, Y., Chen, X., et al.: The internet of things in healthcare: an overview. J. Ind. Inf. Integration 1, 3–13 (2016)
Google Scholar
Baucas, M.J., Spachos, P., Gregori, S.: Internet-of-Things devices and assistive technologies for health care: applications, challenges, and opportunities. IEEE Signal Process. Mag. 38(4), 65–77 (2021)
Article Google Scholar
Akpakwu, G.A., Silva, B.J., Hancke, G.P., et al.: A survey on 5G networks for the internet of things: communication technologies and challenges. IEEE Access. 6, 3619–3647 (2018)
Article Google Scholar
Girod, B., Aaron, A.M., Rane, S., et al.: Distributed video coding. Proc. IEEE. 93(1), 71–83 (2005)
Article Google Scholar
Kodavalla VK (2018) Transcoding of next generation distributed video codec for mobile video. In: Proceedings of ICAECC, pp 1–7
Peixoto E, Queiroz RL. de, Mukherjee D (2008) Mobile video communications using a Wyner–Ziv transcoder. In: Proceedings of SPIE, pp 1–11
Peixoto, E., de Queiroz, R.L., Mukherjee, D.: A Wyner–Ziv video transcoder. IEEE Trans. Circ. Syst. Video Technol. 20(2), 189–200 (2010)
Article Google Scholar
Martínez, J.L., Fernández-Escribano, G., Kalva, H., et al.: Motion vector refinement in a Wyner–Ziv to H.264 transcoder for mobile telephony. IET Image Process. 3(6), 335–339 (2009)
Article Google Scholar
Corrales-García, A., Martínez, J.L., Fernández-Escribano, G., et al.: Variable and constant bitrate in a DVC to H.264/AVC transcoder. Signal Process Image Commun. 26(6), 310–323 (2011)
Article Google Scholar
Corrales-García, A., Rodríguez-Sánchez, R., Martínez, J. L., et al.: A GPU-Based DVC to H.264/AVC Transcoder. In: Proceedings of HAIS, pp. 233–240 (2010)
Corrales-García, A., Rodríguez-Sánchez, R., Martínez, J.L., et al.: Multimedia communications using a fast and flexible DVC to H.264/AVC/SVC transcoder. J. Signal Process. Syst. 79(3), 211–232 (2015)
Article Google Scholar
Kao, C., Huang, T., Chen, H. H., et al.: Perceptully lossless video re-encoding for cloud transcoding. In: Proceedings of China SIP, pp. 301–305 (2014)
Rong, S., Qing, L., Xu, Y., et al.: Mobile video communications based on cloud transcoding. Proceedings. 1(3), 1431–1433 (2017)
Google Scholar
Lei, T.C.W., Tseng, F.S.: Light-weight video codec at terminal for video coding in the cloud. J. Signal Process Syst. 91, 627–639 (2019)
Article Google Scholar
Zhu, L., Zhang, Y., Li, N., et al.: Machine learning based fast H.264/AVC to HEVC transcoding exploiting block partition similarity. J. Vis. Commun. Image Rep. 38, 824–837 (2016)
Article Google Scholar
Zhang, D., Tong, J., Zang, D.: Fast CU partition for H.264/AVC to HEVC Transcoding Based on Fisher Discriminant Analysis. In: Proceedings of VCIP, pp. 1–4 (2016)
Lin, H., He, X., Qing, L., et al.: Machine learning-based H.264/AVC to HEVC transcoding via motion information reuse and coding mode similarity analysis. IET Image Process 13(1), 34–43 (2019)
Article Google Scholar
Martínez, J.L., Corrales-García, A., Cuenca, P., et al.: Applying data mining techniques in a Wyner–Ziv to H.264 video transcoder. In: Proceedings of IWANN, pp. 497–504 (2011)
Qing, L., Hua, L., He, X., et al.: A video transcoding method from DVC to HEVC based on Naive Bayes. China patent, CN201811479353.9 (2019)
Hua, L., Tang, T., Qing, L., et al.: Fast transcoding of DVC-HEVC based on naive Bayes classification. J. Terahertz Sci. Electron. Inf. Technol. 18(2), 235–240 (2020)
Google Scholar
Qing, L., Hua, L., He, X., et al.: A video transcoding method from DVC to HEVC based on fisher discriminant. China patent, CN201810573870.6 (2018)
Yang, J., Qing, L., He, X., et al.: A fast DVC to HEVC transcoding for mobile video communication. In: Proceedings of ITNEC, pp. 505–509 (2019)
Van, L.P., Praeter, J.D., Wallendael, G.V., et al.: Efficient bit rate transcoding for high efficiency video coding. IEEE Trans. Multimedia 18(3), 364–378 (2016)
Article Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machine. ACM Trans. Intell. Syst. Technol. 2(3), 1–27 (2011)
Article Google Scholar
Guyon, I., Weston, J., Barnhill, S., et al.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002)
Article Google Scholar
Yan, K., Zhang, D.: Feature selection and analysis on correlated gas sensor data with recursive feature elimination. Sens. Actuators, B Chem. 212, 353–363 (2015)
Article Google Scholar
Wu, J.: Beauty of Mathematics. Posts and Telecommunications Press, Beijing (2020)
Google Scholar
Yang, J., Qing, L., Zeng, W., et al.: High-order statistical modeling based on a decision tree for distributed video coding. IEEE Trans. Circuits Syst. Video Technol. 29(5), 1488–1502 (2019)
Article Google Scholar
Bjøntegaard, G.: Calculation of average PSNR differences between RD-curves. document M33, ITU-T Video Coding Experts Group, Austin, TX, USA (2001)

Download references

Acknowledgements

This work was supported in part by Central Government Funds of Guiding Local Scientific and Technological Development for Sichuan Province under Grant No. 2021ZYD0030, the State Scholarship Fund of China Scholarship Council under Grant No. 202108515045, and the Scientific Research Project of Sichuan Science and Technology Department under Grant No.2021YJ0099.

Author information

Authors and Affiliations

College of Communication Engineering, Chengdu University of Information Technology, Chengdu, 610225, China
Jia Yang & Ya-Juan Xue
School of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Chengdu, 611731, China
Pi-Yan He & Yan-Hua Jin
College of Electronics and Information Engineering, Sichuan University, Chengdu, 610065, China
Hong Yang

Authors

Jia Yang
View author publications
You can also search for this author in PubMed Google Scholar
Pi-Yan He
View author publications
You can also search for this author in PubMed Google Scholar
Hong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Juan Xue
View author publications
You can also search for this author in PubMed Google Scholar
Yan-Hua Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pi-Yan He.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, J., He, PY., Yang, H. et al. Machine learning-based fast transcoding for low-power video communication in Internet of Things. SIViP 16, 2183–2191 (2022). https://doi.org/10.1007/s11760-022-02182-7

Download citation

Received: 12 October 2021
Revised: 09 February 2022
Accepted: 14 February 2022
Published: 27 March 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s11760-022-02182-7

Keyword

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Machine learning-based fast transcoding for low-power video communication in Internet of Things

Abstract

Access this article

Similar content being viewed by others

Low complexity based ultra-high quality video compression method for multimedia-centric internet of things (IoT) services

HEVC coding unit decision based on machine learning

Improved Prediction Estimation Based H.264 to HEVC Intra Transcoding

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keyword

Navigation

Machine learning-based fast transcoding for low-power video communication in Internet of Things

Abstract

Access this article

Similar content being viewed by others

Low complexity based ultra-high quality video compression method for multimedia-centric internet of things (IoT) services

HEVC coding unit decision based on machine learning

Improved Prediction Estimation Based H.264 to HEVC Intra Transcoding

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keyword

Search

Navigation