Skip to main content
Log in

Improved rate-distortion optimized video coding using non-integer bit estimation and multiple Lambda search

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode decisions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, which is a function of rate, distortion and a multiplier called Lambda. This paper proposes to improve the RDO process by applying two modifications. The first modification is to increase the accuracy of rate estimation, which is achieved by computing a non-integer number of bits for arithmetic coding of the syntax elements. This leads to a more accurate cost computation and therefore a better mode decision. The second modification is to search and adjust the value of Lambda based on the characteristics of each coding stage. For the encoder used, this paper proposes to search multiple values of Lambda for the intra-4×4mode decision. Moreover, a simple shift in Lambda value is proposed for motion estimation. Each of these modifications offers a certain gain in RDO performance, and, when all are combined, an average bit-rate saving of up to 7.0% can be achieved for the H.264/AVC codec while the same concept is applicable to the H.265/HEVC codec as well. The extra added complexity is contained to a certain level, and is also adjustable according to the processing resources available.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. ITU-T Recommendation H.265. Series H: audiovisual and multimedia systems: infrastructure of audiovisual services — coding of moving video. High Efficiency Video Coding, 2013

    Google Scholar 

  2. ITU-T Recommendation H.264. Advanced video coding for generic audiovisual services. ISO/IEC, 2013

    Google Scholar 

  3. Sullivan G J, Wiegand T. Rate-distortion optimization for video compression. IEEE Signal Processing Magazine, 1998, 15(6): 74–90

    Article  Google Scholar 

  4. Wiegand T, Schwarz H, Joch A, Kossentini F, Sullivan G J. Rateconstrained coder control and comparison of video coding standards. IEEE Transactions on Circuits and Systems for Video Technology, 2003, 13(7): 688–703

    Article  Google Scholar 

  5. An C, Nguyen T Q. Iterative rate-distortion optimization of H.264 with constant bit rate constraint. IEEE Transactions on Image Processing, 2008, 17(9): 1605–1615

    Article  MathSciNet  Google Scholar 

  6. You J, Choi C, Jeong J. Modified rate distortion optimization using inter-block dependence for H.264/AVC intra coding. IEEE Transactions on Consumer Electronics, 2008, 54(3): 1383–1388

    Article  Google Scholar 

  7. Li B, Xu J, Zhang D, Li H. QP refinement according to Lagrange multiplier for high efficiency video coding. In: Proceedings of 2013 IEEE International Symposium on Circuits and Systems (ISCAS). 2013, 477–480

    Google Scholar 

  8. Wu S, Liu P, Huang Y, Liu Q, Ikenaga T. On bit allocation and Lagrange Multiplier adjustment for rate-distortion optimized H. 264 rate control. In: Proceedings of IEEE International Workshop on Multimedia Signal Processing. 2009, 1–6

    Google Scholar 

  9. Lee Y M, Sun Y T, Lin Y. SATD-based intra mode decision for H.264/AVC video coding. IEEE Transactions on Circuits and Systems for Video Technology, 2010, 20(3): 463–469

    Article  Google Scholar 

  10. Ghandi M M, Ghanbari M. A Lagrangian optimized rate control algorithm for the H. 264/AVC encoder. In: Processings of 2004 International Conference on Image Processing., 2004, 123–126

    Google Scholar 

  11. Zhang J, Yi X, Ling N, Shang W. Context adaptive Lagrange multiplier (CALM) for rate-distortion optimal motion estimation in video coding. IEEE Transactions on Circuits and Systems for Video Technology,, 2010, 20(6): 820–828

    Article  Google Scholar 

  12. Li X, Oertel N, Hutter A, Kaup A. Laplace distribution based Lagrangian rate distortion optimization for hybrid video coding. IEEE Transactions on Circuits and Systems for Video Technology,, 2009, 19(2): 193–205

    Article  Google Scholar 

  13. Marpe D, Schwarz H, Wiegand T. Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard. IEEE Transactions on Circuits and Systems for Video Technology,, 2003, 13(7): 620–636

    Article  Google Scholar 

  14. Sze V, Budagavi M. High throughput CABAC entropy coding in HEVC. IEEE Transactions on Circuits and Systems for Video Technology,, 2012, 22(12): 1778–1791

    Article  Google Scholar 

  15. Richardson I E. The H.264 Advanced Video Compression Standard. New York: John Wiley & Sons,, 2011

    Google Scholar 

  16. Ugur K, Andersson K, Fuldseth A, Bjontegaard G, Endresen L P, Lainema J, Wu Z. High performance, low complexity video coding and the emerging HEVC standard. IEEE Transactions on Circuits and Systems for Video Technology,, 2010, 20(12): 1688–1697

    Article  Google Scholar 

  17. Han W J, Min J, Kim I K, Alshina E, Alshin A, Lee T, Park J H. Improved video compression efficiency through flexible unit representation and corresponding extension of coding tools. IEEE Transactions on Circuits and Systems for Video Technology,, 2010, 20(12): 1709–1720

    Article  Google Scholar 

  18. Wiegand T. WD3: working draft 3 of high-efficiency video coding. Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO. IEC JTC1/SC29/WG11,, 2011

    Google Scholar 

  19. Shannon C E, Weaver W. The Mathematical Theory of Communication. Urbana: The University of Illinois Press,, 1949

    MATH  Google Scholar 

  20. Senzaki K, Chono K, Aoki H, Tajime J, Senda Y. BD-PSNR/Rate computation tool for five data points, In: Proceedings of the Meeting of Joint Collaborative Team on Video Coding. 2010

    Google Scholar 

  21. Im S K, Ghandi M M, Lam C T. Non-integer bit estimation for ratedistortion optimized video coding. In: Processings of 2012 IEEE International Conference on Consumer Electronics., 2012, 88–89

    Chapter  Google Scholar 

  22. Li Z, Tourapis A M. Motion estimation with entropy coding considerations in H. 264/AVC. In: Processings of the 15th IEEE International Conference on Image Processing., 2008, 2140–2143

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sio Kei Im.

Additional information

Sio Kei Im received his degree in computer science in 1998 and the master degree in enterprise information system in 1999, both from King’s College, University of London, UK. He received the PhD degree in electronic engineering from Queen Mary University of London (QMUL), UK in 2007. He gained the position of Lecturer within the Computing Programme at Macao Polytechnic Institute (MPI) in 2001. In 2005, he became the operations manager of the MPIQMUL information systems research centre jointly operated byMPI and QMUL where he carried out signal processing work. In 2009, he was promoted to be the associate professor at the Macao Polytechnic Institute. Currently, he is a visiting scholar at School of Engineering, UCLA and a member of IET.

Mohammad Mahdi Ghandi received his BS and MS degrees in electronics engineering from the University of Tehran, Iran in 1998 and 2001 respectively. After two years of industrial experience in image and video coding, he, as a senior research officer, joined the Video Networking Group at the University of Essex, UK in 2003, where he published several papers in the field of video communications. He was also granted a PhD degree from this university. Since 2006, he returned back to TV broadcasting industry as a designer of compression systems while maintaining some of his academic activities. His research interests include reliable image and video transmission, advanced video codecs, and video transcoding.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Im, S.K., Ghandi, M.M. Improved rate-distortion optimized video coding using non-integer bit estimation and multiple Lambda search. Front. Comput. Sci. 10, 157–166 (2016). https://doi.org/10.1007/s11704-015-5066-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-015-5066-1

Keywords

Navigation