Abstract
Rate-distortion optimization (RDO) plays a significant role in video coding. However, in most RDO methods, the distortion measurement metrics consider only the spatial distortion of statistical pixel errors. People have concerns about not only the information of independent pixels, but also the spatial and temporal correlations between them. In order to make the distortion assessment more consistent with human perception, temporal information of the successive images and the characteristics of human visual perception should be considered as well. In this paper, we propose a rate-distortion model based on spatio-temporal video structural similarity (stVSSIM) index, which takes both spatial and temporal visual quality into account. Meanwhile, to obtain a reasonable trade-off between bit-rate and visual quality dynamically, a perceptual adaptive Lagrange multiplier selection method is presented. Simulation results show that the proposed method averagely reduces 20% bit-rate under the equal visual quality and the adaptive Lagrange multiplier can further improve the results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sullivan, G.J., Wiegand, T.: Rate-distortion Optimization for Video Compression. IEEE Signal Process. Mag. 15(6), 74–90 (1998)
Wiegand, T., Girod, B.: Lagrangian Multiplier Selection in Hybrid Video Coder Control. In: IEEE Int. Conf. of Image Process., vol. 3, pp. 542–545 (2001)
He, Z., Kim, Y.K., Mitra, S.K.: Low-delay Rate Control for DCT Video Coding via ρ-domain Source Modeling. IEEE Trans. on Circuits Syst. for Video Tech. 11, 928–940 (2001)
Chen, L., Garbacea, I.: Adaptive Lambda Estimation in Lagrangian Rate-distortion Optimization for Video Coding. In: Visual Commun. Image Process., VCIP, vol. 6077, pp. 1–8 (2006)
Li, X., Oertel, N., Hutter, A., Kaup, A.: Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding. IEEE Trans. on Circuits Syst. for Video Tech. 19(2), 193–205 (2009)
Wang, Z., Bovik, A.C.: Mean Squared Error: Love It or Leave It? A New Look at Signal Fidelity Measures. IEEE Signal Process. Mag. 26(1), 98–117 (2009)
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. on Image Process. 13(4), 600–612 (2004)
Yang, C., Leung, R., Po, L., Mai, Z.: An SSIM-optimal H.264/AVC Inter Frame Encoder. In: IEEE Proc. Intelligent Computing and Intelligent Systems, vol. 4, pp. 291–295 (2009)
Huang, Y., Ou, T., Su, P., Chen, H.: Perceptual Rate-distortion Optimization Using Structural Similarity Index as Quality Metric. IEEE Trans. on Circuits Syst. for Video Tech. 20 (2010) 1051-8215
Wang, S., Rehman, A., Wang, Z., Ma, S., Gao, W.: Rate-SSIM Optimization for Video Coding. In: IEEE Proc. Acoustics, Speech and Signal Processing, pp. 833–836 (2011)
Wang, X., Su, L., Huang, Q., Liu, C.: Visual Perception Based Lagrangian Rate Distortion Optimization for Video Coding. In: IEEE Int. Conf. of Image Process., pp. 1653–1656 (2011)
Moorthy, A.K., Bovik, A.C.: Efficient Motion Weighted Spatio-temporal Video SSIM Index. In: SPIE Proc. Human Vision and Electronic Imaging, vol. 7527, p. 72571 (2010)
Gish, H., Pierce, J.N.: Asymptotically Efficient Quantizing. IEEE Trans. on Information Theory 14, 676–683 (1968)
Liu, H., Jiang, S., Huang, Q., Xu, C.: A Generic Virtual Content Insertion System Based on Visual Attention Analysis. In: ACM Multimedia, pp. 379–388 (2008)
Wang, M., Ling, N.: Lagrangian Multiplier Based Joint Three-layer Rate Control for H.264/AVC. IEEE Signal Process. Letters 16(8), 679–682 (2009)
H.264 reference software JM14.1, http://iphome.hhi.de/suehring/tml/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hu, F., Su, L., Qi, H., Huang, Q. (2012). Spatio-temporal Visual Distortion and Rate Optimization for Video Coding. In: Lin, W., et al. Advances in Multimedia Information Processing – PCM 2012. PCM 2012. Lecture Notes in Computer Science, vol 7674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34778-8_60
Download citation
DOI: https://doi.org/10.1007/978-3-642-34778-8_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34777-1
Online ISBN: 978-3-642-34778-8
eBook Packages: Computer ScienceComputer Science (R0)