Skip to main content
Log in

Fast algorithms for designing nearly optimal lookup tables for complexity control of the H.264 encoder

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

The H.264/AVC standard significantly improves video compression performance over earlier standards at the cost of increased complexity. In this paper, we present two offline algorithms for generating a lookup table of parameter settings that can be used by a complexity control algorithm for controlling the speed of the H.264 encoder. Our algorithms to design the lookup table of parameter settings take fewer than 2% of the number of encodings required by an exhaustive search of all possible parameter settings and find parameter settings that offer high peak signal-to-noise ratio (PSNR) with low encoding time at a given bitrate. Our parameter settings are fairly robust over different videos and bitrates. We focus on low-resolution videos at bitrates less than 300 kb/s. We compare the performance of our algorithms to both exhaustive search and a multiobjective optimization algorithm. Our parameter settings improve the average encoding speed over the default parameter setting of the x264 encoder on both PC and cell phone platforms by up to 37.4 and 94.1%, respectively, with PSNR difference of up to 0.3 dB.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. JM ver. 10.2 http://iphome.hhi.de/suehring/tml/index.htm

  2. Mobile ASL http://mobileasl.cs.washington.edu

  3. x264 http://developers.videolan.org/x264.html

  4. YUV QCIF samples http://www.tkn.tu-berlin.de/research/evalvid/qcif.html

  5. Advanced video coding for generic audiovisual services. ITU-T Recommendation H.264 (2005)

  6. Agrawal, P., Chen, S., Ramanathan, P., Sivalingam, K.: Battery power sensitive video processing in wireless networks. In: The Ninth IEEE International Symposium on Personal, Indoor And Mobile Radio Communications, vol. 1, pp. 116–120 (1998)

  7. Akyol, E., Mukherjee, D., Liu, Y.: Complexity control for real-time video coding. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 77–80 (2007)

  8. Breiman L., Friedman J., Olshen R., Stone C.: Classification and Regression Trees. Wadsworth and Brooks, Monterey (1984)

    MATH  Google Scholar 

  9. Chen, Z., Zhou, P., He, Y.: Fast integer pel and fractional pel motion estimation for JVT. JVT-F017 (2002)

  10. Chon, J., Cherniavsky, N., A.Riskin, E., E.Ladner, R.: Enabling access through real-time sign language communication over cell phones. In: Proceedings of the 43rd Asilomar Conference on Signals, Systems, and Computers. Pacific Grove, CA (2009)

  11. Chou P.A., Lookabaugh T.D., Gray R.M.: Optimal pruning with applications to tree-structured source coding and modeling. IEEE Trans. Inf. Theory 35(2), 299–315 (1989)

    Article  MathSciNet  Google Scholar 

  12. Coello Coello C.A., Toscano Pulido G., Salazar Lechuga M.: Handling multiple objectives with particle swarm optimization. IEEE Trans. Evol. Comput. 8(3), 256–279 (2004)

    Article  Google Scholar 

  13. Collette Y., Siarry P.: Multiobjective Optimization. Principles and Case Studies. Springer, New York (2003)

    Google Scholar 

  14. CS MSU Graphics Media Lab Video Group: MPEG-4 AVC/H.264 video codec comparison (2005). http://www.compression.ru/video/index.htm

  15. He Z., Cheng W., Chen X.: Energy minimization of portable video communication devices based on power-rate-distortion optimization. IEEE Trans. Circuits Syst. Video Technol. 18(5), 596–608 (2008)

    Article  Google Scholar 

  16. He Z., Liang Y., Chen L., Ahmad I., Wu D.: Power-rate-distortion analysis for wireless video communication under energy constraints. IEEE Trans. Circuits Syst. Video Technol. 15(5), 645–658 (2005)

    Article  Google Scholar 

  17. Hu, Y., Li, Q., Ma, S., Kuo, C.C.: Joint rate-distortion-complexity optimization for H.264 motion search. In: 2006 IEEE International Conference on Multimedia and Expo, pp. 1949–1952 (2006)

  18. Ivanov Y.V., Bleakley C.J.: Real-time H.264 video encoding in software with fast mode decision and dynamic complexity control. ACM Trans. Multimedia Comput. Commun. Appl. 6, 5:1–5:21 (2010)

    Article  Google Scholar 

  19. Jeon, B., Lee, J.: Fast mode decision for H.264. JVT-J033 (2003)

  20. Kaminsky E., Grois D., Hadar O.: Dynamic computational complexity and bit allocation for optimizing H.264/AVC video compression. J. Vis. Commun. Image Represent. 19, 56–74 (2008)

    Article  Google Scholar 

  21. Kannangara C.S., Richardson I.E.G., Miller A.J.: Computational complexity management of a real-time H.264/AVC encoder. IEEE Trans. Circuits Syst. Video Technol. 18(9), 1191–1200 (2008)

    Article  Google Scholar 

  22. Kannangara C.S., Richardson I.E., Bystrom M., Zhao Y.: Complexity control of H.264/AVC based on mode-conditional cost probability distributions. IEEE Trans. Multimed. 11, 433–442 (2009)

    Article  Google Scholar 

  23. Kiang S.Z., Baker R.L., Sullivan G.J., Chiu C.Y.: Recursive optimal pruning with applications to tree structured vector quantizers. IEEE Trans. Image Process. 1(2), 162–169 (1992)

    Article  Google Scholar 

  24. Kwon D.N., Driessen P.F., Basso A., Agathoklis P.: Performance and computational complexity optimization in configurable hybrid video coding system. IEEE Trans. Circuits Syst. Video Technol. 16(1), 31–42 (2006)

    Article  Google Scholar 

  25. Lechuga M.S., Pulido G.T.: Multi-objective particle swarm optimization (MOPSO). http://delta.cs.cinvestav.mx/~ccoello/EMOO/mopso.tar.gz

  26. Li D., Sun Y., Feng Z.: Joint power allocation and rate control for real-time video transmission over wireless system. In: Proceedings of IEEE Globecom, vol. 4, pp. 2164–2168 (2005)

  27. Lu M.T., Yao J.J., Chen H.H.: A complexity-aware video adaptation mechanism for live streaming systems. EURASIP J. Appl. Signal Process. 2007(1), 215 (2007)

    Google Scholar 

  28. Merritt, L., Vanam, R.: Improved rate control and motion estimation for H.264 encoder. In: Proceedings of IEEE International Conference on Image Processing (ICIP), vol. 5, pp. 309–312 (2007)

  29. Pan F., Lin X., Rahardja S., Lim K.P., Li Z.G., Wu D., Wu S.: Fast mode decision algorithm for intraprediction in H.264/AVC video coding. IEEE Trans. Circuits Syst. Video Technol. 15(7), 813–822 (2005)

    Article  Google Scholar 

  30. Reed, C.M., Delhorne, L.A., Durlach, N.I., Fischer, S.D.: A study of the tactual and visual reception of fingerspelling. J. Speech Hear. Res. (33), 786–797 (1990)

  31. Rhee C.E., Jung J.S., Lee H.J.: A real-time H.264/AVC encoder with complexity-aware time allocation. IEEE Trans. Circuits Syst. Video Technol. 20(12), 1848–1862 (2010)

    Article  Google Scholar 

  32. Riskin E.A.: Optimal bit allocation via the generalized BFOS algorithm. IEEE Trans. Inf. Theory 37(2), 400–402 (1991)

    Article  MathSciNet  Google Scholar 

  33. Stottrup-Andersen, J., Forchhammer, S., Aghito, S.M.: Rate-distortion-complexity optimization of fast motion estimation in H.264/MPEG-4 AVC. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 111–114 (2004)

  34. Su, L., Lu, Y., Wu, F., Li, S., Gao, W.: Real-time video coding under power constraint based on H.264 codec. SPIE Vis. Commun. Image Process. 6508, 1–12 (2007)

  35. Su, L., Lu, Y., Wu, F., Li, S., Gao, W.: Complexity-constrained h.264 video encoding. IEEE Trans. Circuits Syst. Video Technol. 19, 477–490 (2009). doi:10.1109/TCSVT.2009.2014017. http://dl.acm.org/citation.cfm?id=1641645.1641648

  36. Tourapis, A.M.: Enhanced predictive zonal search for single and multiple frame motion estimation. In: Proceedings of VCIP, pp. 1069–1079 (2002)

  37. Vanam, R.: Rate-distortion-complexity optimization of video encoders with applications to sign language video compression. Ph.D. thesis, University of Washington, Seattle, WA (2010)

  38. Vanam, R., Riskin, E.A., Hemami, S.S., Ladner, R.E.: Distortion-complexity optimization of the H.264/MPEG-4 AVC encoder using the GBFOS algorithm. In: Proceedings of the IEEE Data Compression Conference, pp. 303–312. Snowbird, Utah (2007)

  39. Vanam, R., Riskin, E.A., Ladner, R.E.: H.264/MPEG-4 AVC encoder parameter selection algorithms for complexity distortion tradeoff. In: Proceedings of the IEEE Data Compression Conference, pp. 372–381. Snowbird, Utah (2009)

  40. Wiegand T., Sullivan G.J., Bjøntegaard G., Luthra A.: Overview of the H.264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 560–576 (2003)

    Article  Google Scholar 

  41. Woelders W.W., Frowein H.W., Nielsen J., Questa P., Sandini G.: New developments in low-bit rate videotelephony for people who are deaf. J. Speech Lang. Hear. Res. 40, 1425–1433 (1997)

    Google Scholar 

  42. Wu, M., Forchhammer, S., Aghito, S.M.: Complexity control of fast motion estimation in H.264/MPEG-4 AVC with rate-distortion-complexity optimization. In: Chen, C., Schonfeld, D., Luo, J., (eds.) Visual Communications and Image Processing 2007, Proceedings of SPIE, vol. 6508, pp. 650,824–650,824-11 (2007)

  43. Yin, P., Tourapis, A.M., Boyce, J.: Fast mode decision and motion estimation for JVT/H.264. In: Proceedings of IEEE International Conference on Image Processing (ICIP), vol. 3, pp. 853–856 (2003)

  44. Zhang L., Gao W.: Reusable architecture and complexity-controllable algorithm for the integer/fractional motion estimation of H.264. IEEE Trans. Consumer Electron. 18(9), 1191–1200 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rahul Vanam.

Additional information

This work is supported by National Science Foundation under grant numbers CCF-0514353, CCF-0514357, and IIS-0811884. Preliminary versions of this work appeared in the IEEE Data Compression Conference 2007 and IEEE Data Compression Conference 2009. The first author performed this work while at the University of Washington.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vanam, R., Riskin, E.A., Ladner, R.E. et al. Fast algorithms for designing nearly optimal lookup tables for complexity control of the H.264 encoder. SIViP 7, 991–1003 (2013). https://doi.org/10.1007/s11760-012-0288-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-012-0288-2

Keywords

Navigation