Skip to main content
Log in

Machine learning-based fast transcoding for low-power video communication in Internet of Things

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Aiming at the low-power video communication with low latency at resource-constrained terminals in the Internet of Things, a machine learning-based fast transcoding from distributed video coding (DVC) to high-efficiency video coding (HEVC) is put forward. In order to accelerate the transcoding, the DVC decoding information is efficiently exploited to reduce HEVC encoding complexity at both levels of coding unit (CU) and prediction unit (PU). First of all, the predictions of CU partition and PU modes are regarded as two binary classification tasks. Subsequently, the initial features are extracted from the DVC decoding information and the support vector machine (SVM)-recursive feature elimination algorithm is adopted to select feature vectors to construct the training data which is used to train the SVM classifiers for CU and PU, respectively. By means of the top-down division prediction method, the CU partition is first determined by the trained SVM classifier. For the CUs which are not further split, the PU modes will be predicted to terminate the quad-tree coding process of HEVC in advance, so that HEVC encoding complexity is reduced. Experimental results show that the proposed algorithm can reduce 57.64% computational complexity on average with Bjøntegaard delta bit-rate 2.43%. In terms of transcoding time efficiency, the proposed algorithm outperforms the state-of-the-art fast DVC to HEVC transcoding algorithms based on machine learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Chen, W.: Intelligent manufacturing production line data monitoring system for industrial internet of things. Comput Commun. 151, 31–41 (2020)

    Article  Google Scholar 

  2. Shafi, U., Mumtaz, R., García-Nieto, J., et al.: Precision agriculture techniques and practices: from considerations to applications. Sensors (Basel) 19(17), 3796 (2019)

    Article  Google Scholar 

  3. Yin, Y., Zeng, Y., Chen, X., et al.: The internet of things in healthcare: an overview. J. Ind. Inf. Integration 1, 3–13 (2016)

    Google Scholar 

  4. Baucas, M.J., Spachos, P., Gregori, S.: Internet-of-Things devices and assistive technologies for health care: applications, challenges, and opportunities. IEEE Signal Process. Mag. 38(4), 65–77 (2021)

    Article  Google Scholar 

  5. Akpakwu, G.A., Silva, B.J., Hancke, G.P., et al.: A survey on 5G networks for the internet of things: communication technologies and challenges. IEEE Access. 6, 3619–3647 (2018)

    Article  Google Scholar 

  6. Girod, B., Aaron, A.M., Rane, S., et al.: Distributed video coding. Proc. IEEE. 93(1), 71–83 (2005)

    Article  Google Scholar 

  7. Kodavalla VK (2018) Transcoding of next generation distributed video codec for mobile video. In: Proceedings of ICAECC, pp 1–7

  8. Peixoto E, Queiroz RL. de, Mukherjee D (2008) Mobile video communications using a Wyner–Ziv transcoder. In: Proceedings of SPIE, pp 1–11

  9. Peixoto, E., de Queiroz, R.L., Mukherjee, D.: A Wyner–Ziv video transcoder. IEEE Trans. Circ. Syst. Video Technol. 20(2), 189–200 (2010)

    Article  Google Scholar 

  10. Martínez, J.L., Fernández-Escribano, G., Kalva, H., et al.: Motion vector refinement in a Wyner–Ziv to H.264 transcoder for mobile telephony. IET Image Process. 3(6), 335–339 (2009)

    Article  Google Scholar 

  11. Corrales-García, A., Martínez, J.L., Fernández-Escribano, G., et al.: Variable and constant bitrate in a DVC to H.264/AVC transcoder. Signal Process Image Commun. 26(6), 310–323 (2011)

    Article  Google Scholar 

  12. Corrales-García, A., Rodríguez-Sánchez, R., Martínez, J. L., et al.: A GPU-Based DVC to H.264/AVC Transcoder. In: Proceedings of HAIS, pp. 233–240 (2010)

  13. Corrales-García, A., Rodríguez-Sánchez, R., Martínez, J.L., et al.: Multimedia communications using a fast and flexible DVC to H.264/AVC/SVC transcoder. J. Signal Process. Syst. 79(3), 211–232 (2015)

    Article  Google Scholar 

  14. Kao, C., Huang, T., Chen, H. H., et al.: Perceptully lossless video re-encoding for cloud transcoding. In: Proceedings of China SIP, pp. 301–305 (2014)

  15. Rong, S., Qing, L., Xu, Y., et al.: Mobile video communications based on cloud transcoding. Proceedings. 1(3), 1431–1433 (2017)

    Google Scholar 

  16. Lei, T.C.W., Tseng, F.S.: Light-weight video codec at terminal for video coding in the cloud. J. Signal Process Syst. 91, 627–639 (2019)

    Article  Google Scholar 

  17. Zhu, L., Zhang, Y., Li, N., et al.: Machine learning based fast H.264/AVC to HEVC transcoding exploiting block partition similarity. J. Vis. Commun. Image Rep. 38, 824–837 (2016)

    Article  Google Scholar 

  18. Zhang, D., Tong, J., Zang, D.: Fast CU partition for H.264/AVC to HEVC Transcoding Based on Fisher Discriminant Analysis. In: Proceedings of VCIP, pp. 1–4 (2016)

  19. Lin, H., He, X., Qing, L., et al.: Machine learning-based H.264/AVC to HEVC transcoding via motion information reuse and coding mode similarity analysis. IET Image Process 13(1), 34–43 (2019)

    Article  Google Scholar 

  20. Martínez, J.L., Corrales-García, A., Cuenca, P., et al.: Applying data mining techniques in a Wyner–Ziv to H.264 video transcoder. In: Proceedings of IWANN, pp. 497–504 (2011)

  21. Qing, L., Hua, L., He, X., et al.: A video transcoding method from DVC to HEVC based on Naive Bayes. China patent, CN201811479353.9 (2019)

  22. Hua, L., Tang, T., Qing, L., et al.: Fast transcoding of DVC-HEVC based on naive Bayes classification. J. Terahertz Sci. Electron. Inf. Technol. 18(2), 235–240 (2020)

    Google Scholar 

  23. Qing, L., Hua, L., He, X., et al.: A video transcoding method from DVC to HEVC based on fisher discriminant. China patent, CN201810573870.6 (2018)

  24. Yang, J., Qing, L., He, X., et al.: A fast DVC to HEVC transcoding for mobile video communication. In: Proceedings of ITNEC, pp. 505–509 (2019)

  25. Van, L.P., Praeter, J.D., Wallendael, G.V., et al.: Efficient bit rate transcoding for high efficiency video coding. IEEE Trans. Multimedia 18(3), 364–378 (2016)

    Article  Google Scholar 

  26. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machine. ACM Trans. Intell. Syst. Technol. 2(3), 1–27 (2011)

    Article  Google Scholar 

  27. Guyon, I., Weston, J., Barnhill, S., et al.: Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002)

    Article  Google Scholar 

  28. Yan, K., Zhang, D.: Feature selection and analysis on correlated gas sensor data with recursive feature elimination. Sens. Actuators, B Chem. 212, 353–363 (2015)

    Article  Google Scholar 

  29. Wu, J.: Beauty of Mathematics. Posts and Telecommunications Press, Beijing (2020)

    Google Scholar 

  30. Yang, J., Qing, L., Zeng, W., et al.: High-order statistical modeling based on a decision tree for distributed video coding. IEEE Trans. Circuits Syst. Video Technol. 29(5), 1488–1502 (2019)

    Article  Google Scholar 

  31. Bjøntegaard, G.: Calculation of average PSNR differences between RD-curves. document M33, ITU-T Video Coding Experts Group, Austin, TX, USA (2001)

Download references

Acknowledgements

This work was supported in part by Central Government Funds of Guiding Local Scientific and Technological Development for Sichuan Province under Grant No. 2021ZYD0030, the State Scholarship Fund of China Scholarship Council under Grant No. 202108515045, and the Scientific Research Project of Sichuan Science and Technology Department under Grant No.2021YJ0099.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pi-Yan He.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, J., He, PY., Yang, H. et al. Machine learning-based fast transcoding for low-power video communication in Internet of Things. SIViP 16, 2183–2191 (2022). https://doi.org/10.1007/s11760-022-02182-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-022-02182-7

Keyword

Navigation