Skip to main content
Log in

Multi-View Stereo Reconstruction and Scene Flow Estimation with a Global Image-Based Matching Score

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

We present a new variational method for multi-view stereovision and non-rigid three-dimensional motion estimation from multiple video sequences. Our method minimizes the prediction error of the shape and motion estimates. Both problems then translate into a generic image registration task. The latter is entrusted to a global measure of image similarity, chosen depending on imaging conditions and scene properties. Rather than integrating a matching measure computed independently at each surface point, our approach computes a global image-based matching score between the input images and the predicted images. The matching process fully handles projective distortion and partial occlusions. Neighborhood as well as global intensity information can be exploited to improve the robustness to appearance changes due to non-Lambertian materials and illumination changes, without any approximation of shape, motion or visibility. Moreover, our approach results in a simpler, more flexible, and more efficient implementation than in existing methods. The computation time on large datasets does not exceed thirty minutes on a standard workstation. Finally, our method is compliant with a hardware implementation with graphics processor units. Our stereovision algorithm yields very good results on a variety of datasets including specularities and translucency. We have successfully tested our motion estimation algorithm on a very challenging multi-view video sequence of a non-rigid scene.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Barron, J., Fleet, D., and Beauchemin, S. 1994. Performance of optical flow techniques. The International Journal of Computer Vision, 12(1):43–77.

    Article  Google Scholar 

  • Boykov, Y. and Kolmogorov, V. 2003. Computing geodesics and minimal surfaces via graph cuts. In International Conference on Computer Vision, 1:26–33.

  • Broadhurst, A., Drummond, T., and Cipolla, R. 2001. A probabilistic framework for space carving. In International Conference on Computer Vision, 1:388–393.

  • Carceroni, R. and Kutulakos, K. 2002. Multi-view scene capture by surfel sampling: From video streams to non-rigid 3D motion, shape and reflectance. The International Journal of Computer Vision, 49(2–3):175–214.

    Article  Google Scholar 

  • Caselles, V., Kimmel, R., and Sapiro, G. 1997. Geodesic active contours. The International Journal of Computer Vision, 22(1):61–79.

    Article  Google Scholar 

  • Duan, Y., Yang, L., Qin, H., and Samaras, D. 2004. Shape reconstruction from 3D and 2D data using PDE-based deformable surfaces. In European Conference on Computer Vision, 3:238–251.

  • Faugeras, O. and Hermosillo, G. 2004. Well-posedness of two non-rigid multimodal image registration methods. Siam Journal of Applied Mathematics, 64(5):1550–1587.

    Article  MathSciNet  Google Scholar 

  • Faugeras, O. and Keriven, R. 1998. Variational principles, surface evolution, PDE’s, level set methods and the stereo problem. IEEE Transactions on Image Processing, 7(3):336–344.

    Article  MathSciNet  Google Scholar 

  • Goldlücke, B. and Magnor, M. 2004a. Space-time isosurface evolution for temporally coherent 3D reconstruction. In International Conference on Computer Vision and Pattern Recognition, 1:350–355.

  • Goldlücke, B. and Magnor, M. 2004b. Weighted minimal hypersurfaces and their applications in computer vision. In European Conference on Computer Vision, 2:366–378.

  • Hermosillo, G., Chefd’hotel, C., and Faugeras, O. 2002. Variational methods for multimodal image matching. The International Journal of Computer Vision, 50(3):329–343.

    Article  Google Scholar 

  • Hernández Esteban, C. and Schmitt, F. 2004. Silhouette and stereo fusion for 3D object modeling. Computer Vision and Image Understanding, 96(3):367–392.

    Article  Google Scholar 

  • Jin, H., Soatto, S., and Yezzi, A. 2005. Multi-view stereo reconstruction of dense shape and complex appearance. The International Journal of Computer Vision, 63(3):175–189.

    Article  Google Scholar 

  • Kanade, T. and Okutomi, M. 1994. A stereo matching algorithm with an adaptive window: Theory and experiment. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(9):920–932.

    Article  Google Scholar 

  • Kass, M., Witkin, A., and Terzopoulos, D. 1987. Snakes: Active contour models. The International Journal of Computer Vision, 1(4):321–331.

    Article  Google Scholar 

  • Kolmogorov, V. and Zabih, R. 2002. Multi-camera scene reconstruction via graph cuts. In European Conference on Computer Vision, 3:82–96.

  • Kolmogorov, V. and Zabih, R. 2004. What energy functions can be minimized via graph cuts? IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(2):147–159.

    Article  Google Scholar 

  • Kutulakos, K. 2000. Approximate n-View Stereo. In European Conference on Computer Vision, 1:67–83.

  • Kutulakos, K. and Seitz, S. 2000. A theory of shape by space carving. The International Journal of Computer Vision, 38(3):199–218.

    Article  Google Scholar 

  • Lhuillier, M. and Quan, L. 2003. Surface reconstruction by integrating 3D and 2D data of multiple views. In International Conference on Computer Vision, 2:1313–1320.

  • Neumann, J. and Aloimonos, Y. 2002. Spatio-temporal stereo using multi-resolution subdivision surfaces. The International Journal of Computer Vision, 47:181–193.

    Article  Google Scholar 

  • Osher, S. and Sethian, J. 1988. Fronts propagating with curvature-dependent speed: Algorithms based on Hamilton–Jacobi formulations. Journal of Computational Physics, 79(1):12–49.

    Article  MathSciNet  Google Scholar 

  • Parzen, E. 1962. On estimation of a probability density function and mode. Annals Mathematical Statistics, 33:1065–1076.

    MathSciNet  Google Scholar 

  • Roche, A., Malandain, G., Pennec, X., and Ayache, N. 1998. The correlation ratio as new similarity metric for multimodal image registration. In W. Wells, A. Colchester, and S. Delp (Eds.), Medical Image Computing and Computer-Assisted Intervention-MICCAI’98. Cambridge, MA, USA, pp. 1115–1124.

  • Scharstein, D. and Szeliski, R. 1998. Stereo matching with nonlinear diffusion. The International Journal of Computer Vision, 28(2):155–174.

    Article  Google Scholar 

  • Scharstein, D. and Szeliski, R. 2002. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. The International Journal of Computer Vision, 47(1):7–42.

    Article  Google Scholar 

  • Segal, M., Korobkin, C., van Widenfelt, R., Foran, J., and Haeberli P. 1992. Fast shadows and lighting effects using texture mapping. Computer Graphics, 26(2):249–252.

    Article  Google Scholar 

  • Seitz, S. and Dyer, C. 1999. Photorealistic scene reconstruction by voxel coloring. The International Journal of Computer Vision, 35(2):151–173.

    Article  Google Scholar 

  • Soatto, S., Yezzi, A., and Jin, H. 2003. Tales of shape and radiance in multi-view stereo. In International Conference on Computer Vision, 2:974–981.

  • Solem, J. and Overgaard, N. 2005. A geometric formulation of gradient descent for variational problems with moving surfaces. In International Conference on Scale Space and PDE Methods in Computer Vision, pp. 419–430.

  • Strecha, C., Fransens, R., and Van Gool, L. 2004. Wide-baseline stereo from multiple views: A probabilistic account. In International Conference on Computer Vision and Pattern Recognition, 2:552–559.

  • Strecha, C., Tuytelaars, T., and Van Gool, L. 2003. Dense matching of multiple wide-baseline views. In International Conference on Computer Vision, 2:1194–1201.

  • Szeliski, R. 1999. Prediction error as a quality metric for motion and stereo. In International Conference on Computer Vision, 2:781–788.

  • Treuille, A., Hertzmann, A., and Seitz, S. 2004. Example-based stereo with general brdfs. In European Conference on Computer Vision, 2:457–469.

  • Vedula, S., Baker, S., and Kanade, T. 2002. Spatio-temporal view interpolation. In ACM Eurographics Workshop on Rendering, pp. 65–76.

  • Vedula, S., Baker, S., Rander, P., Collins, R., and Kanade, T. 2005. Three-dimensional scene flow. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(3):475–480.

    Article  Google Scholar 

  • Viola, P. and Wells III, W. M. 1997. Alignement by maximization of mutual information. The International Journal of Computer Vision, 24(2):137–154.

    Article  Google Scholar 

  • Yang, R., Pollefeys, M., and Welch, G. 2003. Dealing with textureless regions and specular highlights: A progressive space carving scheme using a novel photo-consistency measure. In International Conference on Computer Vision, 1:576–584.

  • Zhang, Y. and Kambhamettu, C. 2001. On 3D scene flow and structure estimation. In International Conference on Computer Vision and Pattern Recognition, 2:778–785.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jean-Philippe Pons.

Electronic supplementary material

Supplementary material (474 KB)

Supplementary material (544 KB)

Supplementary material (646 KB)

Supplementary material (496 KB)

Supplementary material (324 KB)

Supplementary material (346 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pons, JP., Keriven, R. & Faugeras, O. Multi-View Stereo Reconstruction and Scene Flow Estimation with a Global Image-Based Matching Score. Int J Comput Vision 72, 179–193 (2007). https://doi.org/10.1007/s11263-006-8671-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11263-006-8671-5

Keywords

Navigation