Skip to main content
Log in

Fast macroblock mode decision for H.264/AVC baseline profile video transcoder based on support vector machines

  • Regular Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

In video transcoding, accuracy and efficiency of macroblock mode decision are critical issues at the re-encoder side due to the changes in frame size, frame rate, and bit rate. In this paper, a fast macroblock mode decision scheme based on support vector machines is proposed for H.264/AVC baseline profile video transcoder. Features including motion vectors, residual data, pre-encoded macroblock modes, and quantization parameters are extracted from incoming bitstream in both of training stage and classification stage. Feature extraction methods are investigated for spatial resolution transcoder, temporal resolution transcoder, and bit-rate transcoder. After off-line training and simplification of support vectors, the obtained support vector machine classifier can determine macroblock mode in the re-encoder accurately. Extensive experiments are carried out on different types of transcoders and results show that the proposed method can save about 80% in computational complexity compared to full mode search algorithm implemented in the latest H.264/AVC reference software (JM17.1), while maximum peak signal-to-noise ratio is degraded by 0.2–1.1 dB depending on different sequences and bit rate.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Ahmad, I., Wei, X., Sun, Y., Zhang, Y.-Q.: Video transcoding: an overview of various techniques and research issues. IEEE Trans. Multimed. 7(5), 793–804 (2005)

    Article  Google Scholar 

  2. Xin, J., Lin, C.-W., Sun, M.-T.: Digital video transcoding. Proc. IEEE 93(1), 84–97 (2005)

    Article  Google Scholar 

  3. Kim, J.-H., Kim, B.-G.: Fast block mode decision algorithm in H.264/AVC video coding. J. Vis. Commun. Image. Represent. 19(3), 175–183 (2008)

    Article  Google Scholar 

  4. Richardson, I.E.G.: H.264 and MPEG-4 video compression: video coding for next-generation multimedia. Wiley, England (2003)

    Book  Google Scholar 

  5. Kuo, T.-Y., Chan, C.-H.: Fast variable block size motion estimation for H.264 using likelihood and correlation of motion Field. IEEE. Transact. Circ. Syst. Video Technol. 16(10), 1185–1195 (2006)

    Article  Google Scholar 

  6. Tsai, A.-C., Paul, A., Wang, J.-C., Wang, J.-F.: Intensity gradient technique for efficient intra-prediction in H.264/AVC. IEEE. Transact. Circ. Syst. Video Technol. 18(5), 694–698 (2008)

    Article  Google Scholar 

  7. Xin, J., Vetro, A., Sun, H., Su, Y.: Efficient MPEG-2 to H.264/AVC transcoding of intra-coded video. EURASIP J. Adv. Signal Process., 12 (2007)

  8. Lefol, D., Bull, D., Canagarajah, N., Redmill, D.: An efficient complexity-scalable video transcoder with mode refinement. Signal. Process. Image. Commun. 22, 421–433 (2007)

    Article  Google Scholar 

  9. Kalva, H., Petljanski, B.: Exploiting the directional features in MPEG-2 for H.264 intra transcoding. IEEE Trans. Consumer Electron. 52(2), 706–711 (2006)

    Article  Google Scholar 

  10. Bialkowski, J., Barkowsky, M., Kaup, A.: Fast video transcoding from H.263 to H.264/MPEG-4 AVC. Multime´d. Tools App. 35, 127–146 (2007)

    Article  Google Scholar 

  11. Lee, Y.-K., Lee, S.-S., Lee, Y.-L.: MPEG-4 to H.264 transcoding with frame rate reduction. Multime´d. Tools App. 35, 147–162 (2007)

    Article  Google Scholar 

  12. Li, C.-H., Wang, C.-N., Chiang, T.: A fast downsizing video transcoder based on H.264/AVC standard. In: 5th Pacific Rim Conference on Multimedia (PCM 2004), Lecture Notes in Computer Science vol. 3331. Springer, Japan, pp. 215–223 (2004)

  13. Lefol, D., Bull, D.: Mode refinement algorithm for H.264 inter frame requantization, IEEE International Conference on Image Processing (ICIP 2006), pp. 845–848, Atlanta, GA, USA, Oct. (2006)

  14. De Cock, J., Notebasert, S., Lambert, P., De Schrijver, D., Van de Walle, R.: Requantization transcoding in pixel and frequency domain for intra 16x16 in H.264/AVC, Advanced Concepts for Intelligent Vision Systems (ACIVS 2006), LNCS vol. 4179, pp. 533–544, Belgium, Sept. (2006)

  15. Notebasert, S., De Cock, J., De Wolf, K., Van de Walle, R.: Requantization transcoding of H.264/AVC bitstreams for intra 4 × 4 prediction modes. In: 2006 Pacific-Rim Conference on Multimedia (PCM 2006), Hangzhou, China. LNCS vol. 4261, pp. 808–817, Nov. (2006)

  16. Liu, Z.-G., Wang, Y.: Intra Mode Selection in Downsizing Video Transcoder Based on H.264. Int. J. Imaging Syst. Technol. 19(4), 340–349 (2009)

    Article  Google Scholar 

  17. Lamy-Bergot, C., Renan, E., Gadat, B., Lavaux, D.: Data supervision for adaptively transcoded video surveillance over wireless links. In: IEEE 9th International Conference on Intelligent Transport Systems Telecommunications (ITST 2009), France, pp. 415–419 LILLE, Oct. 2009

  18. Fernández-Escribano, G., Bialkowski, J., Gámez, J.A., Kalva, H., Cuenca, P., Orozco-Barbosa, L., Kaup, A.: Low-complexity heterogeneous video transcoding using data mining. IEEE Trans. Multime´d. 10(2), 286–299 (2008)

    Article  Google Scholar 

  19. Jing, X., Siu, W.-C., Chau, L.-P., Constantinides, A.G.: Efficient inter mode decision for H.263 to H.264 video transcoding using support vector machines. IEEE International Symposium on Circuits and Systems (ISCAS 2009), pp. 2349–2352, Taipei May 2009

  20. Liu, Y., Zheng, Y.F.: Soft SVM and its application in video-object extraction. IEEE Trans. Signal Process. 55(7), 3272–3282 (2007)

    Article  MathSciNet  Google Scholar 

  21. Yuan, J., Wang, H., Xiao, L., Zheng, W., Jianmin, L., Lin, F., Zhang, B.: A formal study of shot boundary detection. IEEE Trans. Circuits Syst. Video Technol. 17(2), 168–186 (2007)

    Article  Google Scholar 

  22. Zhou, J., Gao, D., Zhang, D.: Moving vehicle detection for automatic traffic monitoring. IEEE Trans. Veh. Technol. 56(1), 51–59 (2007)

    Article  Google Scholar 

  23. Shanableh, T., Ghanbari, M.: Heterogeneous video transcoding to lower spatio-temporal resolutions and different encoding formats. IEEE Trans. Multime´d. 2(2), 101–110 (2000)

    Article  Google Scholar 

  24. Björk, N., Christopoulos, C.: Transcoder architectures for video coding. IEEE Trans. Consumer Electron. 44(1), 88–98 (1998)

    Article  Google Scholar 

  25. Youn, J., Sun, M.-T., Lin, C.-W.: Motion vector refinement for high-performance transcoding. IEEE Trans. Multimed. 1(1), 30–40 (1999)

    Article  Google Scholar 

  26. Chen, M.-J., Chu, M.-C., Pan, C.-W.: Efficient motion-estimation algorithm for reduced frame-rate video transcoder. IEEE Lett. Circuits Syst. Video Technol. 12(4), 269–275 (2002)

    Article  Google Scholar 

  27. Lee, W.J., Ho, W.J.: Adaptive frame-skipping for video transcoding, IEEE International Conference on Image Processing (ICIP 2003) 1:165–168 (2003)

  28. Downs, T., Gates, K.E., Masters, A.: Exact simplification of support vector solutions. J. Mach. Learn. Res. 2, 293–297 (2001)

    Google Scholar 

  29. Nguyen, D.D., Ho, T.B.: A bottom-up method for simplifying support vector solutions. IEEE Trans. Neural Netw. 17, 792–796 (2006)

    Article  Google Scholar 

  30. Wu, T.-F., Lin, C.-J., Weng, R.C.: Probability estimates for multi-class classification by pairwise coupling. J. Mach. Learn. Res. 5, 975–1005 (2004)

    MathSciNet  MATH  Google Scholar 

  31. Tao, Q., Wu, G.-W., Wang, Fei.-Yue., Wang, J.: Posterior probability support vector machines for unbalanced data. IEEE Trans. Neural Netw. 16(6), 1561–1573 (2005)

    Article  Google Scholar 

  32. Gönen, M., Tanugur, A.G., Alpaydın, E.: Multiclass posterior probability support vector machines. IEEE Trans. Neural Netw. 19(1), 130–139 (2008)

    Article  Google Scholar 

  33. Chang, C.-C., Lin, C.-J. LIBSVM: a Library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm/

Download references

Acknowledgement

This work is supported by the National Natural Science Foundation of China (No. 61103118, No. 61073162), and the Shandong Natural Science Foundation of China (No. ZR2011FQ021).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhao-Guang Liu.

Additional information

Communicated by Thomas Haenselmann.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, ZG., Yang, Y. & Ji, XH. Fast macroblock mode decision for H.264/AVC baseline profile video transcoder based on support vector machines. Multimedia Systems 18, 359–372 (2012). https://doi.org/10.1007/s00530-011-0256-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-011-0256-7

Keywords