Learning to navigate in a virtual world using optic flow and stereo disparity signals

Raudies, Florian; Eldridge, Schuyler; Joshi, Ajay; Versace, Massimiliano

doi:10.1007/s10015-014-0153-1

Learning to navigate in a virtual world using optic flow and stereo disparity signals

Original Article
Published: 20 August 2014

Volume 19, pages 157–169, (2014)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Florian Raudies¹,
Schuyler Eldridge²,
Ajay Joshi² &
…
Massimiliano Versace¹

294 Accesses
3 Altmetric
Explore all metrics

Abstract

Navigating in a complex world is challenging in that the rich, real environment provides a very large number of sensory states that can immediately precede a collision. Biological organisms such as rodents are able to solve this problem, effortlessly navigating in closed spaces by encoding in neural representations distance toward walls or obstacles for a given direction. This paper presents a method that can be used by virtual (simulated) or robotic agents, which uses states similar to neural representations to learn collision avoidance. Unlike other approaches, our reinforcement learning approach uses a small number of states defined by discretized distances along three constant directions. These distances are estimated either from optic flow or binocular stereo information. Parameterized templates for optic flow or disparity information are compared against the input flow or input disparity to estimate these distances. Simulations in a virtual environment show learning of collision avoidance. Our results show that learning with only stereo information is superior to learning with only optic flow information. Our work motivates the usage of abstract state descriptions for the learning of visual navigation. Future work will focus on the fusion of optic flow and stereo information, and transferring these models to robotic platforms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

NeuroMechFly v2: simulating embodied sensorimotor control in adult Drosophila

Article 12 November 2024

A Hybrid Planning Strategy Through Learning from Vision for Target-Directed Navigation

Cognitive Mapping and Planning for Visual Navigation

Article 04 October 2019

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Adiv G (1985) Determining three-dimensional motion and structure from optical flow generated by several moving objects. IEEE Trans Pattern Anal Mach Intell PAMI–7(4):384–401
Article Google Scholar
Adorni G, Cagnoni S, Enderle S, Kraetzschmar GK, Mordonini M, Plagge M, Ritter M, Sablatng S, Zell A (2001) Vision-based localization for mobile robots. Robot Auton Syst 36:103–119
Article MATH Google Scholar
Baker S, Scharstein D, Lewis JP, Roth S, Black MJ, Szeliski R (2011) A database and evaluation methodology for optical flow. Int J Comput Vis 92(1):1–31
Article Google Scholar
Baird LC (1995) Residual algorithms: Reinforcement learning with function approximation. In: Prieditis A, Russell S (eds) Proceedings of the twelfth international conference on machine learning. Morgan Kaufmann, San Francisco, pp 30–37
Barron J, Fleet D, Beauchemin S (1994) Performance of optical flow techniques. Int J Comput Vis 12(1):43–77
Article Google Scholar
Bonin-Font F, Ortiz A, Oliver G (2008) Visual navigation for mobile robots: a survey. J Intell Robot Syst 53:263–296
Article Google Scholar
Cumming BG, DeAngelis GC (2001) The physiology of stereopsis. Annu Rev Neurosci 24:203–238
Article Google Scholar
DeSouza GN, Kak AC (2002) Vision for mobile robot navigation: a survey. IEEE Trans Pattern Anal Mach Intell 24(2):237–269
Article Google Scholar
Dev A, Krose B, Groen F (1997) Navigation of mobile robot on the temporal development of the optic flow. Proc Intell Robots Syst (IROS) 2:558–563
Google Scholar
Duda RO, Hart PE (1972) Use of the Hough transformation to detect lines and curves in pictures. Commun ACM 15(1):11–15
Article Google Scholar
Franz MO, Schlkopf B, Mallot HA, Blthoff H (1998) Learning view graphs for robot navigation. Auton Robots 5:111–125
Article Google Scholar
Gaskett C, Fletcher L, Zelinsky (2000) A Reinforcement learning for a vision based mobile robot. In: Proceedings of the IEEE conference on intelligent robots and systems (IROS), pp 403–409
Huang B-Q, Cao G-Y Guo M (2005) Reinforcement learning neural network to the problem of autonomous mobile robot obstacle avoidance. In: Proceedings of the 4th international conference on machine learning and cybernetics, pp 85–89
Kim D, Sun J, Oh SM, Rehg JM, Bobick AF (2006) Traversibility classification using unsupervised on-line visual learning for outdoor robot navigation. In: Proceedings of IEEE international conference on robotics and automation, Orlando, Florida, pp 518–525
Lemaire T, Berger C, Jung I-K, Lacroix S (2007) Vison-based SLAM: stereo and monocular approaches. Int J Comput Vis 74(3):343–364
Article Google Scholar
Longuet-Higgins HC, Prazdny K (1980) The interpretation of a moving retinal image. Proc R Soc Lond Ser B Biol Sci 208:385–397
Article Google Scholar
Lever C, Burton S, Jeewajee A, O’Keefe J, Burgess N (2009) Boundary vector cells in the subiculum of the hippocampal formation. J Neurosci 29(31):9771–9777
Article Google Scholar
Marinez-Marin T, Duckett T (2005) Fast reinforcement learning for vision-guided mobile robots. In: Proceedings of the IEEE conference on robotics and automation (IROS), Spain, pp 4170–4175
Michels J, Saxena A, Ng AY (2005) High speed obstacle avoidance using monocular vision and reinforcement learning. In: Proceedings of 22nd international conference on machine learning, Bonn, Germany, pp 593–600
Millan JR (1995) Reinforcement learning of goal-directed obstacle-avoiding reaction strategies in an autonomous mobile robot. Robot Auton Syst 15:275–299
Article Google Scholar
Prescott TJ, Mayhew JEW (1992) Obstacle avoidance through reinforcement learning. In: Moody JE, Hanson SJ, Lippman RP (eds) Advances in neural information processing systems 4. Morgan Kaufmann, San Mateo, pp 523–530
Google Scholar
Ohya A, Kosaka A, Kak A (1998) Vision-based navigation by a mobile robot with obstacle avoidance using single-camera vision and ultrasonic sensing. Robot Autom 14(6):969–978
Article Google Scholar
Pack CC, Born RT (2008) Cortical mechanisms for the integration of visual motion. In: Masland RH, Albright T (eds) The senses: a comprehensive reference, vol 2, pp 189–218
Perrone J (1992) Model for the computation of self-motion in biological systems. J Opt Soc Am A 9(2):177–192
Article MathSciNet Google Scholar
Perrone J, Stone L (1994) A model of self-motion estimation within primate extrastriate visual cortex. Vis Res 34(21):2917–2938
Article Google Scholar
Santos-Victor J, Sandini G, Curotto F, Garibaldi S (1993) Divergence stereo for robot navigation: Learning from bees. In: Proceedings of IEEE conference on computer vision and pattern recognition, pp 434–439
Scharstein D, Szeliski R (2002) A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int J Comput Vis 47(1/2/3):7–42
Article MATH Google Scholar
Shirley P, Marschner S (2009) Fundamentals of computer graphics, 3rd edn. A K Peters Natick, Massachusetts
Google Scholar
Solstad T, Boccara CN, Kropff E, Moser M-B, Moser EI (2008) Representation of geometric borders in the entorhinal cortex. Science 332:1865–1868
Article Google Scholar
Strsslin T, Sheynikhovich D, Chavarriaga R, Gerstner W (2003) Robust self-localization and navigation based on hippocampal place cells. Neural Netw 18:1125–1140
Article Google Scholar
Sutton RS (1996) Generalization in reinforcement learning: successful examples using sparse coarse coding. In: Touretzky DS, Mozer MC, Hasselmo ME (eds) Advances in neural information processing systems: proceedings of the 1995 conference. MIT Press, Cambridge, pp 1038–1044
Google Scholar
Sutton RS, Barto AG (1998) Reinforcement Learning—an Introduction. MIT Press, Cambridge
Google Scholar
Watkins CJCH, Dayan P (1992) Q-learning. Mach Learn 8:279–292
MATH Google Scholar
Waxman A, Duncan JH (1986) Binocular image flows: steps toward stereo-motion fusion. IEEE Trans Pattern Recognit Mach Intell PAMI–8(6):715–729
Article Google Scholar
Yue S, Rind C, Keil M, Cuadri J, Stafford R (2006) A bio-inspired visual collision detection mechanism for cars: optimisation of a model of a locust neuron to a novel environment. Neurocomputing 69:1591–1598
Article Google Scholar
Zhu W, Levinson S (2001) Vision-based reinforcement learning for robot navigation. In: Proceedings of international joint conference on neural networks, Washington DC, pp 1025–1030

Download references

Acknowledgments

All authors are supported in part by CELEST, an NSF Science of Learning Center (SMA-0835976). FR acknowledges support from the Office of Naval Research (ONR N00014-11-1-0535 and ONR MURI N00014-10-1-0936). MV acknowledges support from the National Aeronautics and Space Administration (NASA NNX12AH31G).

Author information

Authors and Affiliations

Center for Computational Neuroscience and Neural Technology (CompNet) at Boston University, 677 Beacon Street, Boston, MA, 02215, USA
Florian Raudies & Massimiliano Versace
Department of Electrical and Computer Engineering at Boston University, 8 St. Mary Street, Boston, MA, 02215, USA
Schuyler Eldridge & Ajay Joshi

Authors

Florian Raudies
View author publications
You can also search for this author inPubMed Google Scholar
Schuyler Eldridge
View author publications
You can also search for this author inPubMed Google Scholar
Ajay Joshi
View author publications
You can also search for this author inPubMed Google Scholar
Massimiliano Versace
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Florian Raudies.

Appendix

We provide pseudo-code for the bio-inspired proposed models that estimate distances of walls from stereo or flow information (Tables 3, 4, 5).

Table 3 Pseudo-code for the algorithm stereo-based template model

Full size table

Table 4 Pseudo algorithm for flow-based template model

Full size table

Table 5 Q-learning algorithm with ε-Greedy action selection

Full size table

About this article

Cite this article

Raudies, F., Eldridge, S., Joshi, A. et al. Learning to navigate in a virtual world using optic flow and stereo disparity signals. Artif Life Robotics 19, 157–169 (2014). https://doi.org/10.1007/s10015-014-0153-1

Download citation

Received: 23 December 2013
Accepted: 29 March 2014
Published: 20 August 2014
Issue Date: September 2014
DOI: https://doi.org/10.1007/s10015-014-0153-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning to navigate in a virtual world using optic flow and stereo disparity signals

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

NeuroMechFly v2: simulating embodied sensorimotor control in adult Drosophila

A Hybrid Planning Strategy Through Learning from Vision for Target-Directed Navigation

Cognitive Mapping and Planning for Visual Navigation

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now