Abstract
Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode decisions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, which is a function of rate, distortion and a multiplier called Lambda. This paper proposes to improve the RDO process by applying two modifications. The first modification is to increase the accuracy of rate estimation, which is achieved by computing a non-integer number of bits for arithmetic coding of the syntax elements. This leads to a more accurate cost computation and therefore a better mode decision. The second modification is to search and adjust the value of Lambda based on the characteristics of each coding stage. For the encoder used, this paper proposes to search multiple values of Lambda for the intra-4×4mode decision. Moreover, a simple shift in Lambda value is proposed for motion estimation. Each of these modifications offers a certain gain in RDO performance, and, when all are combined, an average bit-rate saving of up to 7.0% can be achieved for the H.264/AVC codec while the same concept is applicable to the H.265/HEVC codec as well. The extra added complexity is contained to a certain level, and is also adjustable according to the processing resources available.
Similar content being viewed by others
References
ITU-T Recommendation H.265. Series H: audiovisual and multimedia systems: infrastructure of audiovisual services — coding of moving video. High Efficiency Video Coding, 2013
ITU-T Recommendation H.264. Advanced video coding for generic audiovisual services. ISO/IEC, 2013
Sullivan G J, Wiegand T. Rate-distortion optimization for video compression. IEEE Signal Processing Magazine, 1998, 15(6): 74–90
Wiegand T, Schwarz H, Joch A, Kossentini F, Sullivan G J. Rateconstrained coder control and comparison of video coding standards. IEEE Transactions on Circuits and Systems for Video Technology, 2003, 13(7): 688–703
An C, Nguyen T Q. Iterative rate-distortion optimization of H.264 with constant bit rate constraint. IEEE Transactions on Image Processing, 2008, 17(9): 1605–1615
You J, Choi C, Jeong J. Modified rate distortion optimization using inter-block dependence for H.264/AVC intra coding. IEEE Transactions on Consumer Electronics, 2008, 54(3): 1383–1388
Li B, Xu J, Zhang D, Li H. QP refinement according to Lagrange multiplier for high efficiency video coding. In: Proceedings of 2013 IEEE International Symposium on Circuits and Systems (ISCAS). 2013, 477–480
Wu S, Liu P, Huang Y, Liu Q, Ikenaga T. On bit allocation and Lagrange Multiplier adjustment for rate-distortion optimized H. 264 rate control. In: Proceedings of IEEE International Workshop on Multimedia Signal Processing. 2009, 1–6
Lee Y M, Sun Y T, Lin Y. SATD-based intra mode decision for H.264/AVC video coding. IEEE Transactions on Circuits and Systems for Video Technology, 2010, 20(3): 463–469
Ghandi M M, Ghanbari M. A Lagrangian optimized rate control algorithm for the H. 264/AVC encoder. In: Processings of 2004 International Conference on Image Processing., 2004, 123–126
Zhang J, Yi X, Ling N, Shang W. Context adaptive Lagrange multiplier (CALM) for rate-distortion optimal motion estimation in video coding. IEEE Transactions on Circuits and Systems for Video Technology,, 2010, 20(6): 820–828
Li X, Oertel N, Hutter A, Kaup A. Laplace distribution based Lagrangian rate distortion optimization for hybrid video coding. IEEE Transactions on Circuits and Systems for Video Technology,, 2009, 19(2): 193–205
Marpe D, Schwarz H, Wiegand T. Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard. IEEE Transactions on Circuits and Systems for Video Technology,, 2003, 13(7): 620–636
Sze V, Budagavi M. High throughput CABAC entropy coding in HEVC. IEEE Transactions on Circuits and Systems for Video Technology,, 2012, 22(12): 1778–1791
Richardson I E. The H.264 Advanced Video Compression Standard. New York: John Wiley & Sons,, 2011
Ugur K, Andersson K, Fuldseth A, Bjontegaard G, Endresen L P, Lainema J, Wu Z. High performance, low complexity video coding and the emerging HEVC standard. IEEE Transactions on Circuits and Systems for Video Technology,, 2010, 20(12): 1688–1697
Han W J, Min J, Kim I K, Alshina E, Alshin A, Lee T, Park J H. Improved video compression efficiency through flexible unit representation and corresponding extension of coding tools. IEEE Transactions on Circuits and Systems for Video Technology,, 2010, 20(12): 1709–1720
Wiegand T. WD3: working draft 3 of high-efficiency video coding. Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO. IEC JTC1/SC29/WG11,, 2011
Shannon C E, Weaver W. The Mathematical Theory of Communication. Urbana: The University of Illinois Press,, 1949
Senzaki K, Chono K, Aoki H, Tajime J, Senda Y. BD-PSNR/Rate computation tool for five data points, In: Proceedings of the Meeting of Joint Collaborative Team on Video Coding. 2010
Im S K, Ghandi M M, Lam C T. Non-integer bit estimation for ratedistortion optimized video coding. In: Processings of 2012 IEEE International Conference on Consumer Electronics., 2012, 88–89
Li Z, Tourapis A M. Motion estimation with entropy coding considerations in H. 264/AVC. In: Processings of the 15th IEEE International Conference on Image Processing., 2008, 2140–2143
Author information
Authors and Affiliations
Corresponding author
Additional information
Sio Kei Im received his degree in computer science in 1998 and the master degree in enterprise information system in 1999, both from King’s College, University of London, UK. He received the PhD degree in electronic engineering from Queen Mary University of London (QMUL), UK in 2007. He gained the position of Lecturer within the Computing Programme at Macao Polytechnic Institute (MPI) in 2001. In 2005, he became the operations manager of the MPIQMUL information systems research centre jointly operated byMPI and QMUL where he carried out signal processing work. In 2009, he was promoted to be the associate professor at the Macao Polytechnic Institute. Currently, he is a visiting scholar at School of Engineering, UCLA and a member of IET.
Mohammad Mahdi Ghandi received his BS and MS degrees in electronics engineering from the University of Tehran, Iran in 1998 and 2001 respectively. After two years of industrial experience in image and video coding, he, as a senior research officer, joined the Video Networking Group at the University of Essex, UK in 2003, where he published several papers in the field of video communications. He was also granted a PhD degree from this university. Since 2006, he returned back to TV broadcasting industry as a designer of compression systems while maintaining some of his academic activities. His research interests include reliable image and video transmission, advanced video codecs, and video transcoding.
Rights and permissions
About this article
Cite this article
Im, S.K., Ghandi, M.M. Improved rate-distortion optimized video coding using non-integer bit estimation and multiple Lambda search. Front. Comput. Sci. 10, 157–166 (2016). https://doi.org/10.1007/s11704-015-5066-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11704-015-5066-1