Visual Autonomy via 2D Matching in Rendered 3D Models

Tenorio, D.; Rivera, V.; Medina, J.; Leondar, A.; Gaumer, M.; Dodds, Z.

doi:10.1007/978-3-319-27857-5_34

D. Tenorio²⁵,
V. Rivera²⁵,
J. Medina²⁵,
A. Leondar²⁵,
M. Gaumer²⁵ &
…
Z. Dodds²⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9474))

Included in the following conference series:

International Symposium on Visual Computing

2788 Accesses

Abstract

As they decrease in price and increase in fidelity, visually-textured 3D models offer a foundation for robotic spatial reasoning that can support a huge variety of platforms and tasks. This work investigates the capabilities, strengths, and drawbacks of a new sensor, the Matterport 3D camera, in the context of several robot applications. By using hierarchical 2D matching into a database of images rendered from a visually-textured 3D model, this work demonstrates that – when similar cameras are used – 2D matching into visually-textured 3D maps yields excellent performance on both global-localization and local-servoing tasks. When the 2D-matching spans very different camera transforms, however, we show that performance drops significantly. To handle this situation, we propose and prototype a map-alignment phase, in which several visual representations of the same spatial environment overlap: one to support the image-matching needed for visual localization, and the other carrying a global coordinate system needed for task accomplishment, e.g., point-to-point positioning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Matterport. matterport.com. Accessed 20 October 2015
Oliver, A., Kang, S., Wunsche, B.C., MacDonald, B.: Using the Kinect as a navigation sensor for mobile robotics. In: Proceedings of IVCNZ 2012, pp 509–514, Dunedin, New Zealand. ACM, NY
Google Scholar
Olesk, A., Wang, J.: Geometric and error analysis for 3D map-matching. In: Proceedings of IGNSS Symposium, 14 p, Queensland, Australia (2009)
Google Scholar
Pinto, M., Moreira, A.P., Matos, A., Sobreira, H., Santos, F.: Fast 3D map matching localisation algorithm. J. Autom. Control Eng. 1(2), 110–114 (2013)
Article Google Scholar
Endres, F., Hess, J., Engelhard, N., Sturm, J.: In: Proceedings of ICRA 2012, pp. 1691–1696, 14–18 May 2012
Google Scholar
Wu, C., Clipp, B., Li, X., Frahm, J.-M.: 3D model matching with viewpoint-invariant patches (VIP). In: Proceedings of CVPR 2008, pp. 1–8, 23–28 June 2008
Google Scholar
Unity. unity3d.com. Accessed 20 August 2015
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Rosebrock, A.L Pyimagesearch. www.pyimagesearch.com/
The OpenCV Library [http://opencv.org/]; Bradski, G. The OpenCV Library Dr. Dobbs Journal, November 2000
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. Comput. Vis. Image Underst. (CVIU) 110(3), 346–359 (2008)
Article Google Scholar
Rublee, E., et al.: ORB: an efficient alternative to SIFT or SURF. In: 2011 IEEE International Conference on Computer Vision (ICCV). IEEE (2011)
Google Scholar
Quigley, M., Gerkey, B., Conley, K., Faust, J., Foote, T., Leibs, J., Berger, E., Wheeler, R., Ng, A.Y.: ROS: an open-source robot operating system. In: Proceedings of Open-Source Software Workshop of the International Conference on Robotics and Automation (ICRA) (2009)
Google Scholar
Watson, O., Touretzky, D.S.: Navigating with the Tekkotsu Pilot. In: Proceedings of FLAIRS-24, pp. 591–596, Palm Beach, FL, USA, 18–20 May 2011
Google Scholar
Hartley, R., Zisserman, A.: Multiple View Geometry. Cambridge University Press, Cambridge (2004)
Book MATH Google Scholar
Haralick, R., Lee, C., Ottenberg, K., Nolle, M.: Review and analysis of solutions of the three point perspective pose estimation problem. Int. J. Comput. Vis. 13(3), 331–356 (1994)
Article Google Scholar
Lynen, S., Sattler, T., Bosse, M., Hesch, J., Pollefeys, M., Siegwart, R.: Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization. Science and Systems, Rovotics (2011)
Google Scholar

Download references

Acknowledgments

We acknowledge and thank the generous support of the NSF, though CISE REU Site project #1359170, as well as from Harvey Mudd College and its Computer Science department.

Author information

Authors and Affiliations

Harvey Mudd College CS Department, Claremont, CA, 91711, USA
D. Tenorio, V. Rivera, J. Medina, A. Leondar, M. Gaumer & Z. Dodds

Authors

D. Tenorio
View author publications
You can also search for this author in PubMed Google Scholar
V. Rivera
View author publications
You can also search for this author in PubMed Google Scholar
J. Medina
View author publications
You can also search for this author in PubMed Google Scholar
A. Leondar
View author publications
You can also search for this author in PubMed Google Scholar
M. Gaumer
View author publications
You can also search for this author in PubMed Google Scholar
Z. Dodds
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Z. Dodds .

Editor information

Editors and Affiliations

University of Nevada, Reno, Nevada, USA
George Bebis
NASA Ames Research Center, Moffett Field, California, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Bahram Parvin
Desert Research Institute, Reno, Nevada, USA
Darko Koracin
University of Houston, Houston, Texas, USA
Ioannis Pavlidis
IBM T.J. Watson Research Center, Yorktown Heights, New York, USA
Rogerio Feris
Purdue University, West Lafayette, Indiana, USA
Tim McGraw
Side Effects Software, Santa Monica, California, USA
Mark Elendt
The DiVE, Durham, North Carolina, USA
Regis Kopper
Texas A&M University, College Station, Texas, USA
Eric Ragan
Kent State University, Kent, Ohio, USA
Zhao Ye
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Gunther Weber

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tenorio, D., Rivera, V., Medina, J., Leondar, A., Gaumer, M., Dodds, Z. (2015). Visual Autonomy via 2D Matching in Rendered 3D Models. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2015. Lecture Notes in Computer Science(), vol 9474. Springer, Cham. https://doi.org/10.1007/978-3-319-27857-5_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-27857-5_34
Published: 18 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27856-8
Online ISBN: 978-3-319-27857-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics