Skip to main content

Visual Autonomy via 2D Matching in Rendered 3D Models

  • Conference paper
  • First Online:
Book cover Advances in Visual Computing (ISVC 2015)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9474))

Included in the following conference series:

  • 2788 Accesses

Abstract

As they decrease in price and increase in fidelity, visually-textured 3D models offer a foundation for robotic spatial reasoning that can support a huge variety of platforms and tasks. This work investigates the capabilities, strengths, and drawbacks of a new sensor, the Matterport 3D camera, in the context of several robot applications. By using hierarchical 2D matching into a database of images rendered from a visually-textured 3D model, this work demonstrates that – when similar cameras are used – 2D matching into visually-textured 3D maps yields excellent performance on both global-localization and local-servoing tasks. When the 2D-matching spans very different camera transforms, however, we show that performance drops significantly. To handle this situation, we propose and prototype a map-alignment phase, in which several visual representations of the same spatial environment overlap: one to support the image-matching needed for visual localization, and the other carrying a global coordinate system needed for task accomplishment, e.g., point-to-point positioning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Matterport. matterport.com. Accessed 20 October 2015

  2. Oliver, A., Kang, S., Wunsche, B.C., MacDonald, B.: Using the Kinect as a navigation sensor for mobile robotics. In: Proceedings of IVCNZ 2012, pp 509–514, Dunedin, New Zealand. ACM, NY

    Google Scholar 

  3. Olesk, A., Wang, J.: Geometric and error analysis for 3D map-matching. In: Proceedings of IGNSS Symposium, 14 p, Queensland, Australia (2009)

    Google Scholar 

  4. Pinto, M., Moreira, A.P., Matos, A., Sobreira, H., Santos, F.: Fast 3D map matching localisation algorithm. J. Autom. Control Eng. 1(2), 110–114 (2013)

    Article  Google Scholar 

  5. Endres, F., Hess, J., Engelhard, N., Sturm, J.: In: Proceedings of ICRA 2012, pp. 1691–1696, 14–18 May 2012

    Google Scholar 

  6. Wu, C., Clipp, B., Li, X., Frahm, J.-M.: 3D model matching with viewpoint-invariant patches (VIP). In: Proceedings of CVPR 2008, pp. 1–8, 23–28 June 2008

    Google Scholar 

  7. Unity. unity3d.com. Accessed 20 August 2015

  8. Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)

    Article  Google Scholar 

  9. Rosebrock, A.L Pyimagesearch. www.pyimagesearch.com/

  10. The OpenCV Library [http://opencv.org/]; Bradski, G. The OpenCV Library Dr. Dobbs Journal, November 2000

  11. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  12. Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. Comput. Vis. Image Underst. (CVIU) 110(3), 346–359 (2008)

    Article  Google Scholar 

  13. Rublee, E., et al.: ORB: an efficient alternative to SIFT or SURF. In: 2011 IEEE International Conference on Computer Vision (ICCV). IEEE (2011)

    Google Scholar 

  14. Quigley, M., Gerkey, B., Conley, K., Faust, J., Foote, T., Leibs, J., Berger, E., Wheeler, R., Ng, A.Y.: ROS: an open-source robot operating system. In: Proceedings of Open-Source Software Workshop of the International Conference on Robotics and Automation (ICRA) (2009)

    Google Scholar 

  15. Watson, O., Touretzky, D.S.: Navigating with the Tekkotsu Pilot. In: Proceedings of FLAIRS-24, pp. 591–596, Palm Beach, FL, USA, 18–20 May 2011

    Google Scholar 

  16. Hartley, R., Zisserman, A.: Multiple View Geometry. Cambridge University Press, Cambridge (2004)

    Book  MATH  Google Scholar 

  17. Haralick, R., Lee, C., Ottenberg, K., Nolle, M.: Review and analysis of solutions of the three point perspective pose estimation problem. Int. J. Comput. Vis. 13(3), 331–356 (1994)

    Article  Google Scholar 

  18. Lynen, S., Sattler, T., Bosse, M., Hesch, J., Pollefeys, M., Siegwart, R.: Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization. Science and Systems, Rovotics (2011)

    Google Scholar 

Download references

Acknowledgments

We acknowledge and thank the generous support of the NSF, though CISE REU Site project #1359170, as well as from Harvey Mudd College and its Computer Science department.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Z. Dodds .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Tenorio, D., Rivera, V., Medina, J., Leondar, A., Gaumer, M., Dodds, Z. (2015). Visual Autonomy via 2D Matching in Rendered 3D Models. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2015. Lecture Notes in Computer Science(), vol 9474. Springer, Cham. https://doi.org/10.1007/978-3-319-27857-5_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27857-5_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27856-8

  • Online ISBN: 978-3-319-27857-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics