Abstract
In this paper, we propose an MPEG-4 based multiview video encoder. The main view of the sequences is encoded using the MPEG-4 encoder and the auxiliary views are encoded by joint motion and disparity compensation. The output of the encoder contains multiple bitstreams and the main bitstream can be decoded by a standard MPEG-4 decoder. Extensive experimental results show that our proposed multiview coding can achieve significant performance gain over the conventional multiview video encoder, which is implemented by applying the concept of the MPEG-2 Multi-View Profile (MVP) on the MPEG-4 platform. The improvements come from a more efficient reference structure and the joint estimation of disparity and motion fields in our proposed multiview coding system. In addition, in the case of five-view encoding, we also compare four different prediction structures in order to find the best structure under certain scenarios. The proposed encoder is very promising for the applications of video-conferencing and 3D telepresence.
This project is funded by the University Academic Research Fund (URC) RG3/02 – Depth-Based Video Segmentation of Nanyang Technological University, Singapore.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Smolic, A., Kimata, H.: Applications and requirements for 3DAV. ISO/IEC JTC1/SC29/WG11 N5877 (July 2003)
Siegel, M.W., Sethuraman, S., McVeigh, J.S., Jordan, A.G.: Compression and interpolation of 3D-stereoscopic and multi-view video. In: Proceedings of the SPIE Stereoscopic Displays and Virtual Reality Systems IV, February 1997, vol. 3012, pp. 227–238 (1997)
Puri, A., Kollarits, R.V., Haskell, B.G.: Stereoscopic video compression using temporal scalability. Visual Communication: Image Processing 2501, 745–756 (1995)
Chen, X., Luthra, A.: MPEG-2 multi-view profile and its application in 3DTV. SPIE/IS&T Multimedia Hardware Architectures, 212–223 (February 1997)
Yan, L., Zhaoyang, Z., Ping, A.: Stereo video coding based on frame estimation and interpolation. IEEE Trans. on Broadcasting 49(1), 14–21 (2003)
Yang, W., Ngan, K.N.: MPEG-4 based stereoscopic video sequences encoder. In: IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP (May 2004)
Kim, H.S., Sohn, K.H.: Feature-based disparity estimation for intermediate view reconstruction of multiview images. In: International Conference on Imaging Science, Systems, and Technology, June 2001, vol. 2, pp. 1–8 (2001)
Li, W., Ohm, J.-R., van der Schaar, M., Jiang, H., Li, S.: MPEG- 4 video verification model version 18.0. ISO/IEC JTC1/SC29/WG11 N3908 (January 2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, W., Ngan, K.N., Cai, J. (2004). Efficient Multiview Video Coding Based on MPEG-4. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30543-9_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-30543-9_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23985-7
Online ISBN: 978-3-540-30543-9
eBook Packages: Computer ScienceComputer Science (R0)