Skip to main content

Spatio-temporal Visual Distortion and Rate Optimization for Video Coding

  • Conference paper
Advances in Multimedia Information Processing – PCM 2012 (PCM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7674))

Included in the following conference series:

  • 3464 Accesses

Abstract

Rate-distortion optimization (RDO) plays a significant role in video coding. However, in most RDO methods, the distortion measurement metrics consider only the spatial distortion of statistical pixel errors. People have concerns about not only the information of independent pixels, but also the spatial and temporal correlations between them. In order to make the distortion assessment more consistent with human perception, temporal information of the successive images and the characteristics of human visual perception should be considered as well. In this paper, we propose a rate-distortion model based on spatio-temporal video structural similarity (stVSSIM) index, which takes both spatial and temporal visual quality into account. Meanwhile, to obtain a reasonable trade-off between bit-rate and visual quality dynamically, a perceptual adaptive Lagrange multiplier selection method is presented. Simulation results show that the proposed method averagely reduces 20% bit-rate under the equal visual quality and the adaptive Lagrange multiplier can further improve the results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sullivan, G.J., Wiegand, T.: Rate-distortion Optimization for Video Compression. IEEE Signal Process. Mag. 15(6), 74–90 (1998)

    Article  Google Scholar 

  2. Wiegand, T., Girod, B.: Lagrangian Multiplier Selection in Hybrid Video Coder Control. In: IEEE Int. Conf. of Image Process., vol. 3, pp. 542–545 (2001)

    Google Scholar 

  3. He, Z., Kim, Y.K., Mitra, S.K.: Low-delay Rate Control for DCT Video Coding via ρ-domain Source Modeling. IEEE Trans. on Circuits Syst. for Video Tech. 11, 928–940 (2001)

    Article  Google Scholar 

  4. Chen, L., Garbacea, I.: Adaptive Lambda Estimation in Lagrangian Rate-distortion Optimization for Video Coding. In: Visual Commun. Image Process., VCIP, vol. 6077, pp. 1–8 (2006)

    Google Scholar 

  5. Li, X., Oertel, N., Hutter, A., Kaup, A.: Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding. IEEE Trans. on Circuits Syst. for Video Tech. 19(2), 193–205 (2009)

    Article  Google Scholar 

  6. Wang, Z., Bovik, A.C.: Mean Squared Error: Love It or Leave It? A New Look at Signal Fidelity Measures. IEEE Signal Process. Mag. 26(1), 98–117 (2009)

    Article  Google Scholar 

  7. Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. on Image Process. 13(4), 600–612 (2004)

    Article  Google Scholar 

  8. Yang, C., Leung, R., Po, L., Mai, Z.: An SSIM-optimal H.264/AVC Inter Frame Encoder. In: IEEE Proc. Intelligent Computing and Intelligent Systems, vol. 4, pp. 291–295 (2009)

    Google Scholar 

  9. Huang, Y., Ou, T., Su, P., Chen, H.: Perceptual Rate-distortion Optimization Using Structural Similarity Index as Quality Metric. IEEE Trans. on Circuits Syst. for Video Tech. 20 (2010) 1051-8215

    Google Scholar 

  10. Wang, S., Rehman, A., Wang, Z., Ma, S., Gao, W.: Rate-SSIM Optimization for Video Coding. In: IEEE Proc. Acoustics, Speech and Signal Processing, pp. 833–836 (2011)

    Google Scholar 

  11. Wang, X., Su, L., Huang, Q., Liu, C.: Visual Perception Based Lagrangian Rate Distortion Optimization for Video Coding. In: IEEE Int. Conf. of Image Process., pp. 1653–1656 (2011)

    Google Scholar 

  12. Moorthy, A.K., Bovik, A.C.: Efficient Motion Weighted Spatio-temporal Video SSIM Index. In: SPIE Proc. Human Vision and Electronic Imaging, vol. 7527, p. 72571 (2010)

    Google Scholar 

  13. Gish, H., Pierce, J.N.: Asymptotically Efficient Quantizing. IEEE Trans. on Information Theory 14, 676–683 (1968)

    Article  Google Scholar 

  14. Liu, H., Jiang, S., Huang, Q., Xu, C.: A Generic Virtual Content Insertion System Based on Visual Attention Analysis. In: ACM Multimedia, pp. 379–388 (2008)

    Google Scholar 

  15. Wang, M., Ling, N.: Lagrangian Multiplier Based Joint Three-layer Rate Control for H.264/AVC. IEEE Signal Process. Letters 16(8), 679–682 (2009)

    Article  Google Scholar 

  16. H.264 reference software JM14.1, http://iphome.hhi.de/suehring/tml/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hu, F., Su, L., Qi, H., Huang, Q. (2012). Spatio-temporal Visual Distortion and Rate Optimization for Video Coding. In: Lin, W., et al. Advances in Multimedia Information Processing – PCM 2012. PCM 2012. Lecture Notes in Computer Science, vol 7674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34778-8_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34778-8_60

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34777-1

  • Online ISBN: 978-3-642-34778-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics