Abstract
The H.264/AVC standard significantly improves video compression performance over earlier standards at the cost of increased complexity. In this paper, we present two offline algorithms for generating a lookup table of parameter settings that can be used by a complexity control algorithm for controlling the speed of the H.264 encoder. Our algorithms to design the lookup table of parameter settings take fewer than 2% of the number of encodings required by an exhaustive search of all possible parameter settings and find parameter settings that offer high peak signal-to-noise ratio (PSNR) with low encoding time at a given bitrate. Our parameter settings are fairly robust over different videos and bitrates. We focus on low-resolution videos at bitrates less than 300 kb/s. We compare the performance of our algorithms to both exhaustive search and a multiobjective optimization algorithm. Our parameter settings improve the average encoding speed over the default parameter setting of the x264 encoder on both PC and cell phone platforms by up to 37.4 and 94.1%, respectively, with PSNR difference of up to 0.3 dB.
Similar content being viewed by others
References
JM ver. 10.2 http://iphome.hhi.de/suehring/tml/index.htm
Mobile ASL http://mobileasl.cs.washington.edu
YUV QCIF samples http://www.tkn.tu-berlin.de/research/evalvid/qcif.html
Advanced video coding for generic audiovisual services. ITU-T Recommendation H.264 (2005)
Agrawal, P., Chen, S., Ramanathan, P., Sivalingam, K.: Battery power sensitive video processing in wireless networks. In: The Ninth IEEE International Symposium on Personal, Indoor And Mobile Radio Communications, vol. 1, pp. 116–120 (1998)
Akyol, E., Mukherjee, D., Liu, Y.: Complexity control for real-time video coding. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 77–80 (2007)
Breiman L., Friedman J., Olshen R., Stone C.: Classification and Regression Trees. Wadsworth and Brooks, Monterey (1984)
Chen, Z., Zhou, P., He, Y.: Fast integer pel and fractional pel motion estimation for JVT. JVT-F017 (2002)
Chon, J., Cherniavsky, N., A.Riskin, E., E.Ladner, R.: Enabling access through real-time sign language communication over cell phones. In: Proceedings of the 43rd Asilomar Conference on Signals, Systems, and Computers. Pacific Grove, CA (2009)
Chou P.A., Lookabaugh T.D., Gray R.M.: Optimal pruning with applications to tree-structured source coding and modeling. IEEE Trans. Inf. Theory 35(2), 299–315 (1989)
Coello Coello C.A., Toscano Pulido G., Salazar Lechuga M.: Handling multiple objectives with particle swarm optimization. IEEE Trans. Evol. Comput. 8(3), 256–279 (2004)
Collette Y., Siarry P.: Multiobjective Optimization. Principles and Case Studies. Springer, New York (2003)
CS MSU Graphics Media Lab Video Group: MPEG-4 AVC/H.264 video codec comparison (2005). http://www.compression.ru/video/index.htm
He Z., Cheng W., Chen X.: Energy minimization of portable video communication devices based on power-rate-distortion optimization. IEEE Trans. Circuits Syst. Video Technol. 18(5), 596–608 (2008)
He Z., Liang Y., Chen L., Ahmad I., Wu D.: Power-rate-distortion analysis for wireless video communication under energy constraints. IEEE Trans. Circuits Syst. Video Technol. 15(5), 645–658 (2005)
Hu, Y., Li, Q., Ma, S., Kuo, C.C.: Joint rate-distortion-complexity optimization for H.264 motion search. In: 2006 IEEE International Conference on Multimedia and Expo, pp. 1949–1952 (2006)
Ivanov Y.V., Bleakley C.J.: Real-time H.264 video encoding in software with fast mode decision and dynamic complexity control. ACM Trans. Multimedia Comput. Commun. Appl. 6, 5:1–5:21 (2010)
Jeon, B., Lee, J.: Fast mode decision for H.264. JVT-J033 (2003)
Kaminsky E., Grois D., Hadar O.: Dynamic computational complexity and bit allocation for optimizing H.264/AVC video compression. J. Vis. Commun. Image Represent. 19, 56–74 (2008)
Kannangara C.S., Richardson I.E.G., Miller A.J.: Computational complexity management of a real-time H.264/AVC encoder. IEEE Trans. Circuits Syst. Video Technol. 18(9), 1191–1200 (2008)
Kannangara C.S., Richardson I.E., Bystrom M., Zhao Y.: Complexity control of H.264/AVC based on mode-conditional cost probability distributions. IEEE Trans. Multimed. 11, 433–442 (2009)
Kiang S.Z., Baker R.L., Sullivan G.J., Chiu C.Y.: Recursive optimal pruning with applications to tree structured vector quantizers. IEEE Trans. Image Process. 1(2), 162–169 (1992)
Kwon D.N., Driessen P.F., Basso A., Agathoklis P.: Performance and computational complexity optimization in configurable hybrid video coding system. IEEE Trans. Circuits Syst. Video Technol. 16(1), 31–42 (2006)
Lechuga M.S., Pulido G.T.: Multi-objective particle swarm optimization (MOPSO). http://delta.cs.cinvestav.mx/~ccoello/EMOO/mopso.tar.gz
Li D., Sun Y., Feng Z.: Joint power allocation and rate control for real-time video transmission over wireless system. In: Proceedings of IEEE Globecom, vol. 4, pp. 2164–2168 (2005)
Lu M.T., Yao J.J., Chen H.H.: A complexity-aware video adaptation mechanism for live streaming systems. EURASIP J. Appl. Signal Process. 2007(1), 215 (2007)
Merritt, L., Vanam, R.: Improved rate control and motion estimation for H.264 encoder. In: Proceedings of IEEE International Conference on Image Processing (ICIP), vol. 5, pp. 309–312 (2007)
Pan F., Lin X., Rahardja S., Lim K.P., Li Z.G., Wu D., Wu S.: Fast mode decision algorithm for intraprediction in H.264/AVC video coding. IEEE Trans. Circuits Syst. Video Technol. 15(7), 813–822 (2005)
Reed, C.M., Delhorne, L.A., Durlach, N.I., Fischer, S.D.: A study of the tactual and visual reception of fingerspelling. J. Speech Hear. Res. (33), 786–797 (1990)
Rhee C.E., Jung J.S., Lee H.J.: A real-time H.264/AVC encoder with complexity-aware time allocation. IEEE Trans. Circuits Syst. Video Technol. 20(12), 1848–1862 (2010)
Riskin E.A.: Optimal bit allocation via the generalized BFOS algorithm. IEEE Trans. Inf. Theory 37(2), 400–402 (1991)
Stottrup-Andersen, J., Forchhammer, S., Aghito, S.M.: Rate-distortion-complexity optimization of fast motion estimation in H.264/MPEG-4 AVC. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp. 111–114 (2004)
Su, L., Lu, Y., Wu, F., Li, S., Gao, W.: Real-time video coding under power constraint based on H.264 codec. SPIE Vis. Commun. Image Process. 6508, 1–12 (2007)
Su, L., Lu, Y., Wu, F., Li, S., Gao, W.: Complexity-constrained h.264 video encoding. IEEE Trans. Circuits Syst. Video Technol. 19, 477–490 (2009). doi:10.1109/TCSVT.2009.2014017. http://dl.acm.org/citation.cfm?id=1641645.1641648
Tourapis, A.M.: Enhanced predictive zonal search for single and multiple frame motion estimation. In: Proceedings of VCIP, pp. 1069–1079 (2002)
Vanam, R.: Rate-distortion-complexity optimization of video encoders with applications to sign language video compression. Ph.D. thesis, University of Washington, Seattle, WA (2010)
Vanam, R., Riskin, E.A., Hemami, S.S., Ladner, R.E.: Distortion-complexity optimization of the H.264/MPEG-4 AVC encoder using the GBFOS algorithm. In: Proceedings of the IEEE Data Compression Conference, pp. 303–312. Snowbird, Utah (2007)
Vanam, R., Riskin, E.A., Ladner, R.E.: H.264/MPEG-4 AVC encoder parameter selection algorithms for complexity distortion tradeoff. In: Proceedings of the IEEE Data Compression Conference, pp. 372–381. Snowbird, Utah (2009)
Wiegand T., Sullivan G.J., Bjøntegaard G., Luthra A.: Overview of the H.264/AVC video coding standard. IEEE Trans. Circuits Syst. Video Technol. 13(7), 560–576 (2003)
Woelders W.W., Frowein H.W., Nielsen J., Questa P., Sandini G.: New developments in low-bit rate videotelephony for people who are deaf. J. Speech Lang. Hear. Res. 40, 1425–1433 (1997)
Wu, M., Forchhammer, S., Aghito, S.M.: Complexity control of fast motion estimation in H.264/MPEG-4 AVC with rate-distortion-complexity optimization. In: Chen, C., Schonfeld, D., Luo, J., (eds.) Visual Communications and Image Processing 2007, Proceedings of SPIE, vol. 6508, pp. 650,824–650,824-11 (2007)
Yin, P., Tourapis, A.M., Boyce, J.: Fast mode decision and motion estimation for JVT/H.264. In: Proceedings of IEEE International Conference on Image Processing (ICIP), vol. 3, pp. 853–856 (2003)
Zhang L., Gao W.: Reusable architecture and complexity-controllable algorithm for the integer/fractional motion estimation of H.264. IEEE Trans. Consumer Electron. 18(9), 1191–1200 (2008)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by National Science Foundation under grant numbers CCF-0514353, CCF-0514357, and IIS-0811884. Preliminary versions of this work appeared in the IEEE Data Compression Conference 2007 and IEEE Data Compression Conference 2009. The first author performed this work while at the University of Washington.
Rights and permissions
About this article
Cite this article
Vanam, R., Riskin, E.A., Ladner, R.E. et al. Fast algorithms for designing nearly optimal lookup tables for complexity control of the H.264 encoder. SIViP 7, 991–1003 (2013). https://doi.org/10.1007/s11760-012-0288-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-012-0288-2