Abstract
The inconsistency between the temporal motion vector and the disparity vector fields causes a drastic loss in coding performance during multiview video coding. In order to resolve this inconsistency problem, we investigate a new approach for motion vector coding. In this paper, degradation in accuracy using a conventional method is experimentally analyzed using mathematical tools. Then, we introduce a new framework for motion vector coding, including modified Inter-mode and Adaptive Direct-mode, in which an advanced motion vector prediction method is employed. The main idea of the proposed motion vector prediction method is to separate each motion vector field from the others. For this purpose, we introduce three alternative motion vectors; a virtual disparity vector, a virtual temporal motion vector and a scaled motion vector. The experimental results show that the proposed method can resolve the inconsistency problem and consequently improve the overall rate-distortion performance. The average bit-rate savings is about 7.3% compared to JMVC 6.0. At maximum, the gain obtained is about 20%, corresponding to a PSNR of 1.42 dB.
Similar content being viewed by others
References
Bjontegaard G (2001) “Calculation of average PSNR differences between RD-curves,” ITU-T video coding experts group document VCEG-M33, Mar
Cyganek B, Siebert JP (2009) An introduction to 3D computer vision techniques and algorithms. Wiley, Chichenster
Fecker U, Barkowsky M, Kaup A (2008) Histogram-based prefiltering for luminance and chrominance compensation of multiview video. IEEE Trans Circ Syst Video Tech 18(9):1258–1267
Fehn C (2004) “Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV.” In: Proc. SPIE Stereoscopic Display and Virtual Reality Systems XI, San Jose, CA, USA, Jan
Guo X, Lu Y, Wu F, Gao W (2006) Inter-view direct mode for multiview video coding. IEEE Trans Circ Syst Video Tech 16(12):1527–1532
Hur JH, Cho SH, Lee YL (2007) Adaptive local illumination change compensation method for H.264/AVC-based multiview video coding. IEEE Trans Circ Syst Video Tech 17(11):1496–1505
Kim JH, Lai P, Lopez J, Ortega A, Su Y, Yin P, Gomila C (2007) New coding tools for illumination and focus mismatch compensation in multiview video coding. IEEE Trans Circ Syst Video Tech 17(11):1519–1535
Kim D, Min D, Sohn K (2008) A stereoscopic video generation method using stereoscopic display characterization and motion analysis. IEEE Trans Broadcast 52(2):188–197
Kitahara M, Kimata H, Shimizu S, Kamikura K, Yashimata Y, Yamamoto K, Yendo T, Fujii T, Tanimoto M (2006) “Multi-view video coding using view interpolation and reference picture selection.” In: Proc. IEEE International Conference on Multimedia and Exposition, Toronto, Canada, Jul
Konieczny J, Domanski M (2010) “Depth-based inter-view prediction of motion vectors for improved multiview video coding.” In: Proc. IEEE 3DTV-Conference: The True Vision–Capture, Transmission and Display of 3D Video, Tampere, Finland, Jun
Koo HS, Jeon YJ, Jeon BM (2006) “Motion skip mode for MVC,” ITU-T and ISO/IEC JTC1, JVT-U091, Hangzhou, China, Oct
Kubota A, Smolic A, Magnor M, Tanimoto M, Chen T, Zhang C (2007) Multiview imaging and 3DTV. IEEE Signal Process Mag 24(6):10–21
Lee JY, Wey H, Park D-S, Kim C-Y (2011) “Temporal and inter-view skip modes for multi-view video coding.” In: Proc. IEEE 3DTV-Conference: The True Vision–Capture, Transmission and Display of 3D Video, Antalya, Turkey, May
Lee SH, Yang JH, Cho NI (2010) A motion vector prediction method for multi-view video coding. J Vis Commun Image Represent 21(7):677–681
Merkel P, Smolic A, Muller K, Wiegand T (2007) Efficient prediction structures for multiview video coding. IEEE Trans Circ Syst Video Tech 17(11):1461–1473
Ryu S, Seo J, Kim DH, Lee JY, Wey H, Sohn K (2011) “Adaptive competition for motion vector prediction in multi-view video coding.” In: Proc. IEEE 3DTV-Conference: The True Vision–Capture, Transmission and Display of 3D Video, Antalya, Turkey, May
Ryu S, Seo J, Kim DH, Lee JY, Wey H, Sohn K (2012) “An independent motion and disparity vector prediction method for multiview video coding.” In: Proc. SPIE Conference on Stereoscopic and Applications XXIII, San Francisco, CA, USA, Jan. Accepted
Ryu S, Seo J, Liu X, Lee JY, Wey H, Sohn K (2011) “Analysis of motion vector predictor in multiview video coding system.” In: Proc. IEEE International Symposium on Parallel and Distributed Processing with Applications Workshops, Busan, South Korea, May
Schwarz H, Marpe D, Wiegand T (2005) “Hierarchical B pictures,” ISO/IEC JTC1/SC29/WG11 and ITU-T Q6/SG16, Doc. JVT-P014, Poznan, Poland, Jul
Schwarz H, Marpe D, Wiegand T (2010) “Description of exploration experiments in 3D video coding,” ISO/IEC JTC1/SC29/WG11 MPEG2010/N11274, Dresden, Germany, Apr
Smolic A, Mueller K, Merkel P, Kauff P, Wiegand T (2009) “An overview of available and emerging 3D video formats and depth enhanced stereo as efficient generic solution.” In: Proc. Picture Coding Symposium, Chicago, Illinois, USA, May
Vetro A, Chen Y, Shimizu S, Pandit P, Lim CS (2009) Joint video team of ITU-T VCEG and ISO/IEC MPEG WD1 reference software for MVC (JMVC) 6.0, Doc. JVT-AF14, Geneva, Switzerland, Nov
Vetro A, Wiegand T, Sullivan GJ (2011) Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard. Proc IEEE 99(4):626–642
Vetro A, Yea S, Smolic A (2008) “Towards a 3D video format for auto-stereoscopic displays.” In: Proc. SPIE Conference on Applications of Digital Image Processing XXXI, San Diego, USA, Sep
Xu F, Er G, Xie X, Dai Q (2008) “2D-to-3D conversion based on motion and color mergence.” In: Proc. 3DTV-Conference: The True Vision–Capture, Transmission and Display of 3D Video, Istanbul, Turkey, May
Yamamoto K, Kitahara M, Kimata H, Yendo T, Fujii T, Tanimoto M, Shimizu S, Kamikura K, Yashima Y (2007) Multi-view video coding using view interpolation and color correction. IEEE Trans Circ Syst Video Tech 17(11):1436–1449
Yamamoto K, Yendo T, Fujii T, Tanimoto M, Kitahara M, Kimata H, Shimizu S, Kamikura K, Yashima Y (2006) “Multi-view video coding using view-interpolated reference images.” In: Proc. Picture Coding Symposium, Beijing, China, Apr
Yang H, Chang Y, Huo J (2009) Fine-granular motion matching for inter-view motion skip mode in multiview video coding. IEEE Trans Circ Syst Video Tech 19(6):887–892
Yea S, Vetro A (2008) “View synthesis prediction for rate-overhead reduction in FTV.” In: Proc. IEEE 3DTV-Conference: The True Vision–Capture, Transmission and Display of 3D Video, Istanbul, Turkey, May
Yea S, Vetro A (2009) View synthesis prediction for multiview video coding. Signal Process Image Comm 24(1):89–100
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ryu, S., Seo, J., Lee, J.Y. et al. Advanced motion vector coding framework for multiview video sequences. Multimed Tools Appl 67, 49–70 (2013). https://doi.org/10.1007/s11042-011-0930-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-011-0930-y