Abstract
A depth image represents a relative distance from a camera to an object in the three-dimensional (3-D) space and it is widely used as 3-D information in computer vision and computer graphics. Generally, the depth is represented as an image format and it is uniformly quantized in the disparity/intensity domain whereas it is non-uniformly quantized in the depth domain. Thus, the conventional bi-prediction applied in the disparity/intensity domain does not catch up the value for the linearly moving object. To solve this problem, we propose a non-linear bi-directional prediction for depth coding. Experimental results demonstrate that the proposed non-linear bi-directional prediction method achieves by 0.68 dB of the PSNR gain over the conventional method when the hierarchical-B picture coding is used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chan, S.C., Shum, H.Y., Ng, K.T.: Image-based Rendering and Synthesis. Proceeding of IEEE Signal Processing Magazines, 22–33 (2007)
Smolic, A., Kimata, H., Vetro, A.: Development of MPEG Standards for 3-D and Free Viewpoint Video. In: Proceeding of Optics East 2005: Communications, Multimedia & Display Technologies, vol. 6014, pp. 262–273 (2005)
ISO/IEC JTC1/SC29/WG11 MPEG: Survey of Algorithms used for Multi-view Video Coding (MVC). N6909 (2005)
Kawakita, M., Kurita, T., Kikuchi, H., Inoue, S.: HDTV Axi-vision Camera. In: Proceeding of International Broadcasting Conference, pp. 397–404 (2002)
Scharstein, D., Szeliski, R.: A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. Microsoft Research Technical Report MSR-TR-2001-81 (2001)
Kawakita, M., Kurita, T., Kikuchi, H., Inoue, S.: HDTV Axi-vision Camera. In: Proceeding of International Broadcasting Conference, pp. 397–404 (2002)
Scharstein, D., Szeliski, R.: A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. Microsoft Research Technical Report MSR-TR-2001-81 (2001)
Chai, J., Tong, X., Chan, S., Shum, H.: Plenoptic Sampling. In: Proceeding of ACM SIGGRAPH, pp. 307–318 (2000)
Schwarz, H., Marpe, D., Wiegand, T.: Analysis of Hierarchical B Pictures and MCTF. In: Proceeding of International Conferences on Multimedia & Expo., pp. 1929–1932 (2006)
Boyce, J.M.: Weighted Prediction in the H.264/MPEG AVC Video Coding Standard. In: Proceeding of International Symposium on Circuits and Systems, vol. 3, pp. 789–792 (2004)
JVT Reference Software Version 14.0, http://iphome.hhi.de/suehring/tml/download/old_jm/
ITU-T SG16 Q.6: An Excel Add-in for Computing Bjontegaard Metric and Its Evolution. VCEG-AE07 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oh, KJ., Ho, YS. (2009). Non-linear Bi-directional Prediction for Depth Coding. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-10467-1_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10466-4
Online ISBN: 978-3-642-10467-1
eBook Packages: Computer ScienceComputer Science (R0)