Automatic Single-Image 3d Reconstructions of Indoor Manhattan World Scenes

Delage, Erick; Lee, Honglak; Ng, Andrew Y.

doi:10.1007/978-3-540-48113-3_28

Erick Delage⁷,
Honglak Lee⁷ &
Andrew Y. Ng⁷

Part of the book series: Springer Tracts in Advanced Robotics ((STAR,volume 28))

5352 Accesses
6 Altmetric

Abstract

3d reconstruction from a single image is inherently an ambiguous problem. Yet when we look at a picture, we can often infer 3d information about the scene. Humans perform single-image 3d reconstructions by using a variety of single-image depth cues, for example, by recognizing objects and surfaces, and reasoning about how these surfaces are connected to each other. In this paper, we focus on the problem of automatic 3d reconstruction of indoor scenes, specifically ones (sometimes called “Manhattan worlds”) that consist mainly of orthogonal planes. We use a Markov random field (MRF) model to identify the different planes and edges in the scene, as well as their orientations. Then, an iterative optimization algorithm is applied to infer the most probable position of all the planes, and thereby obtain a 3d reconstruction. Our approach is fully automatic—given an input image, no human intervention is necessary to obtain an approximate 3d reconstruction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Manhattan-World Urban Reconstruction from Point Clouds

Real-time indoor scene reconstruction with Manhattan assumption

Article 21 December 2017

Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

Article 03 July 2015

References

J. Besag. Spatial interaction and the statistical analysis of lattice systems. Journal of the Royal Statistical Society, Series B, 1974.
Google Scholar
J. Canny. A computational approach to edge detection. IEEE Trans. Pattern Analysis and Machine Intelligence, 8(6):679–698, 1986.
Article Google Scholar
J. Coughlan and A.L. Yuille. Manhattan world: Compass direction from a single image by bayesian inference. In IEEE International Conference on Computer Vision, 1999.
Google Scholar
J. Coughlan and A.L. Yuille. Manhattan world. Neural Computation, 15:1063–1088, 2003.
Article Google Scholar
A. Criminisi, I. Reid, and A. Zisserman. Single view metrology. International Journal of Computer Vision, 40:123–148, 2000.
Article MATH Google Scholar
P. E. Debevec, C. J. Taylor, and J. Malik. Modeling and rendering architecture from photographs. In SIGGRAPH, 1996.
Google Scholar
E. Delage, H. Lee, and A. Y. Ng. A dynamic Bayesian network model for autonmous 3d reconstruction from a single indoor image. Unpublished manuscript, 2005.
Google Scholar
P. Favaro and S. Soatto. Shape and radiance estimation from the information divergence of blurred images. In European Conference on Computer Vision, 2000.
Google Scholar
Pedro F. Felzenszwalb and Daniel P. Huttenlocher. Efficient graph-based image segmentation. International Journal of Computer Vision, 59, 2004.
Google Scholar
R. C. Gonzalez and R. E. Woods. Digital Image Processing. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1992.
Google Scholar
F. Han and S. C. Zhu. Bayesian reconstruction of 3d shapes and scenes from a single image. In IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, pages 12–20, 2003.
Google Scholar
F. Huang and Y. Ogata. Generalized pseudo-likelihood estimates for Markov random fields on lattice. Annals of the Institute of Statistical Mathematics, 2002.
Google Scholar
A. Kosaka and A. C. Kak. Fast vision-guided mobile robot navigation using model-based reasoning and prediction of uncertainties. CVGIP: Image Understanding, 56:271–329, 1992.
Article MATH Google Scholar
J. Kosecka and W. Zhang. Video compass. In European Conference on Computer Vision, 2002.
Google Scholar
P. Kovesi. Image features from phase congruency. Videre: A Journal of Computer Vision Research, 1, 1999.
Google Scholar
P. D. Kovesi. MATLAB and Octave functions for computer vision and image processing. School of Computer Science & Software Engineering, The University of Western Australia. Available from: http://www.csse.uwa.edu.au/~pk/research/matlabfns/.
Google Scholar
E. Lutton, H. Maitre, and J. Lopez-Krahe. Contribution to the determination of vanishing points using hough transform. IEEE Trans. Pattern Analysis and Machine Intelligence, 16:430–438, 1994.
Article Google Scholar
J. Michels, A. Saxena, and A. Y. Ng. High-speed obstacle avoidance using monocular vision and reinforcement learning. In International Conference on Machine Learning, 2005.
Google Scholar
A. Saxena, S. Chung, and A. Y. Ng. Learning depth from single monocular images. In Neural Information Processing Systems, 2005.
Google Scholar
G. Schindler and F. Dellaert. Atlanta World: An expectation-maximization framework for simultaneous low-level edge grouping and camera calibration in complex man-made environments. In IEEE International Conference on Computer Vision and Pattern Recognition, 2004.
Google Scholar
H.-Y. Shum, M. Han, and R. Szeliski. Interactive construction of 3d models from panoramic mosaics. In IEEE International Conference on Computer Vision and Pattern Recognition, 1998.
Google Scholar
P. F Sturm and S. J. Maybank. A method for interactive 3d recontruction of piecewise planar objects from single images. In British Machine Vision Conference, 1999.
Google Scholar
C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: a factorization method. International Journal of Computer Vision, 9:137–154, 1992.
Article Google Scholar
M. J. Wainwright, T. S. Jaakkola, and A. S. Willsky. Tree-based reparameterization framework for analysis of sum-product and related algorithms. IEEE Trans. Information Theory, 49(5):1120–1146, 2003.
Article MATH MathSciNet Google Scholar
R. Zhang, P.-S. Tsai, J. E. Cryer, and M. Shah. Shape from shading: A survey. IEEE Trans. Pattern Analysis and Machine Intelligence, 21:690–706, 1999.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Stanford University, Stanford, CA, 94305, USA
Erick Delage, Honglak Lee & Andrew Y. Ng

Authors

Erick Delage
View author publications
You can also search for this author in PubMed Google Scholar
Honglak Lee
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Y. Ng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Stanford University, CA, 94305-9045, Stanford, USA
Sebastian Thrun
MIT Computer Science & Artificial Intelligence Laboratory (CSAIL), 32 Vassar Street, MA, 02139, Cambridge, USA
Rodney Brooks
Australian Centre for Field Robotics, University of Sydney, 2006, Sydney, Australia
Hugh Durrant-Whyte

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Delage, E., Lee, H., Ng, A.Y. (2007). Automatic Single-Image 3d Reconstructions of Indoor Manhattan World Scenes. In: Thrun, S., Brooks, R., Durrant-Whyte, H. (eds) Robotics Research. Springer Tracts in Advanced Robotics, vol 28. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48113-3_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-48113-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48110-2
Online ISBN: 978-3-540-48113-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Automatic Single-Image 3d Reconstructions of Indoor Manhattan World Scenes

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Manhattan-World Urban Reconstruction from Point Clouds

Real-time indoor scene reconstruction with Manhattan assumption

Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Automatic Single-Image 3d Reconstructions of Indoor Manhattan World Scenes

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Manhattan-World Urban Reconstruction from Point Clouds

Real-time indoor scene reconstruction with Manhattan assumption

Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation