Skip to main content
Log in

A first approach to speeding-up the inter mode selection in MPEG-2/H.264 transcoders using machine learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The H.264 standard achieves much higher coding efficiency than the MPEG-2 standard, due to its improved inter and intra prediction modes which come with a cost of higher computation complexity. Transcoding MPEG-2 video to H.264 is important to enable gradual migration to H.264. However, given the significant differences between the MPEG-2 and the H.264 coding algorithms, transcoding is much more complex and new approaches to transcoding are necessary. In this paper, we introduce and evaluate a low complexity macroblock partition mode decision algorithm, to be used as part of a high-efficient inter-frame prediction in MPEG-2 to H.264 transcoder. The proposed tools are used to compute an optimal MB coding mode decision with significantly reduced computational complexity. Specifically, we achieve the computational savings by using the following MB information coming from MPEG-2: the MB coding modes, the coded block pattern (CBPC) in MPEG-2, and the mean and variance of the 16 4 × 4 sub blocks of the MPEG-2 residual MBs. We use data mining algorithms to develop a decision tree for H.264 coding mode decisions. The decision trees are built using RD optimized mode decisions and result in highly efficient mode decisions, with significantly reduced computational complexity. The proposed transcoder is 35% faster than the RD optimized H.264 reference transcoder without a significant PSNR degradation (0.05 dB on average). The proposed transcoder performs over 0.4 dB better on average than the SAE cost based H.264 transcoding.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Bialkowski J, Barkowsky M, Kaup A (2006) Overview of low-complexity video transcoding from H.263 to H.264. In: Proceeding of the International Conference on Multimedia & Expo ICME 2006, July

  2. Bialkowski J, Barkowsky M, Leschka F, Kaup A (2006) Low-complexity transcoding of inter coded video frames from H.264 to H.263. In: Proceeding of the International Conference on Image Processing ICIP 2006, October

  3. Bjontegaard G (2001) Calculation of average PSNR differences between RD-Curves. Presented at the 13th VCEG-M33 Meeting, Austin, TX, April

  4. Chen C, Wu P-H, Chen H (2004) MPEG-2 to H.264 transcoding. Picture Coding Symposium, 15–17 Dec

  5. Chen G, Zhang Y, Lin S, Dai F (2004) Efficient block size selection for MPEG-2 to H.264 transcoding. In: Proceedings of the 12th annual ACM Multimedia 2004, pp 300–303

  6. Fernandez-Escribano G, Cuenca P, Orozco-Barbosa L, Garrido A (2005) A fast intra-frame prediction algorithm for MPEG-2/H.264 video transcoders. In: Proceeding of the International Conference on Image Processing ICIP 2005. Genoa, Italy, September, pp 684–687

  7. Fernandez-Escribano G, Cuenca P, Orozco-Barbosa L, Garrido A (2005) Computacional complexity reduction of intra-frame prediction in MPEG-2/H.264 video transcoders. In: Proceeding of ICME 2005, Amsterdam, The Netherlands, July

  8. Fernandez-Escribano G, Kalva H, Cuenca P, Orozco-Barbosa L (2006) A very low complexity MPEG-2 to H.264 transcoding using machine learning. In: Proceeding of the ACM Multimedia 2006, Santa Barbara (California), USA, October, pp 931–940

  9. Gonzalez CA, Yeo H, Kuo CJ (1999) Requirements for motion-estimation search range in MPEG-2 coded video. IBM J Res Develop 43(4), July

  10. Implementation Studies Group (2002) Main results of the AVC complexity analysis. MPEG Document N4964, ISO/IEC JTC11/SC29/WG11, July

  11. ISO/IEC 13818-2 (1994) Generic coding of moving picture and associated audio. MPEG-2 International Standard

  12. ITU-T Recommendation H.264 (2003) Advanced video coding for generic audiovisual services, May

  13. Joint Video Team (JVT) (2006) of ISO/IEC MPEG and ITU-T VCEG, Reference Software to Committee Draft. JVT-F100 JM10.2. Available on-line at http://iphome.hhi.de/suehring/tml/

  14. JVT Test Model Ad Hoc Group (2003) Evaluation sheet for motion estimation. Draft version 4, February

  15. Kim S-E, Han J-K, Kim J-G (2005) Efficient motion estimation algorithm for MPEG-4 to H.264 transcoder. In: Proceeding of the International Conference on Image Processing ICIP 2005, Genoa, September

  16. Lu X, Tourapis AM, Yin P, Boyce J (2005) Fast mode decision and motion estimation for H.264 with a focus on MPEG-2/H.264 transcoding. In: Proceedings of the IEEE International Symposium on Circuits and Systems ISCAS

  17. Petljanski B, Kalva H (2006) DCT domain intra MB mode decision for MPEG-2 to H.264 transcoding. In: Proceedings of the International Conference on Consumer Electronics ICCE 2006. January, pp 419–420

  18. Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann

  19. Su Y, Xin J, Vetro A, Sun H (2005) Efficient MPEG-2 to H.264/AVC intra transcoding in transform-domain. Proceedings of the IEEE International Symposium on Circuits and Systems ISCAS 2005, May, pp 1234–1237

  20. Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco

    MATH  Google Scholar 

  21. Xin J, Vetro A, Sekiguchi S, Sugimoto K (2006) MPEG-2 to H.264/AVC transcoding for efficient storage of broadcast video bitstreams. In: International Conference on Consumer Electronics 2006, January, pp 417–418

  22. Youn J, Sun M-T, Lin C-W (1999) Motion vector refinement for high-performance transcoding. IEEE Trans Multimedia 1(1):30–40, Mar

    Article  Google Scholar 

  23. Zhou Z. Sun S, Lei S, Sun M-T (2005) Motion information and coding mode reuse for MPEG-2 to H.264 transcoding. In: Proceedings of the IEEE International Symposium on Circuits and Systems ISCAS 2005, 23–26 May, Vol. 2, pp 1230–1233

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gerardo Fernández-Escribano.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fernández-Escribano, G., Kalva, H., Cuenca, P. et al. A first approach to speeding-up the inter mode selection in MPEG-2/H.264 transcoders using machine learning. Multimed Tools Appl 35, 225–240 (2007). https://doi.org/10.1007/s11042-007-0124-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-007-0124-9

Keywords

Navigation