Deep Reinforcement Learning Methods for Navigational Aids

Fakhri, Bijan; Keech, Aaron; Schlosser, Joel; Brooks, Ethan; Venkateswara, Hemanth; Panchanathan, Sethuraman; Kira, Zsolt

doi:10.1007/978-3-030-04375-9_6

Bijan Fakhri¹⁵,
Aaron Keech¹⁷,
Joel Schlosser¹⁷,
Ethan Brooks¹⁷,
Hemanth Venkateswara¹⁵,
Sethuraman Panchanathan¹⁵ &
…
Zsolt Kira^16,17

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11010))

Included in the following conference series:

International Conference on Smart Multimedia

1251 Accesses

Abstract

Navigation is one of the most complex daily activities we engage in. Partly due to its complexity, navigational abilities are vulnerable to many conditions including Topographical Agnosia, Alzheimer’s Disease, and vision impairments. While navigation using solely vision remains a difficult problem in the field of assistive technology, emerging methods in Deep Reinforcement Learning and Computer Vision show promise in producing vision-based navigational aids for those with navigation impairments. To this effect, we introduce GraphMem, a Neural Computing approach to navigation tasks and compare it to several state of the art Neural Computing methods in a one-shot, 3D, first-person maze solving task. Comparing GraphMem to current methods in navigation tasks unveils insights into navigation and represents a first step towards employing these emerging techniques in navigational assistive technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abadi, et al.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems (2016)
Google Scholar
Allamanis, M., Brockschmidt, M., Khademi, M.: Learning to Represent Programs with Graphs. In: ICLR (2018)
Google Scholar
Bengio, Y.: Continuous control with deep reinforcement learning. Foundations and Trends\(\textregistered \). Mach. Learn. 2(1), 1–127 (2009)
Google Scholar
Bourne, R.R.A., et al.: Magnitude, temporal trends, and projections of the global prevalence of blindness and distance and near vision impairment: a systematic review and meta-analysis. Lancet Global Health 5(9), e888–e897 (2017)
Article Google Scholar
Brockman, G., et al.: OpenAI Gym (2016)
Google Scholar
Duan, Y., Schulman, J., Chen, X., Bartlett, P., Sutskever, I., Abbeel, P.: RL\(\hat{}2\): fast reinforcement learning via slow reinforcement learning. arXiv, pp. 1–14 (2016)
Google Scholar
Duthey, B.: Background Paper 6.11 Alzheimer Disease and other Dementias, Update on 2004. World Health Organization, pp. 1–77, February 2013
Google Scholar
Graves, A., Wayne, G., Danihelka, I.: Neural Turing Machines. arXiv, pp. 1–26, October 2014
Google Scholar
Graves, A., et al.: Hybrid computing using a neural network with dynamic external memory. Nat. Res. (2016)
Google Scholar
Gulcehre, C., Chandar, S., Bengio, Y.: Memory Augmented Neural Networks with Wormhole Connections. arXiv, pp. 1–27 (2017)
Google Scholar
Gupta, S., Davidson, J., Levine, S., Sukthankar, R., Malik, J.: Cognitive mapping and planning for visual navigation. In: CVPR (2017)
Google Scholar
Heess, N., Hunt, J.J., Lillicrap, T.P., Silver, D.: Memory-based control with recurrent neural networks (2015)
Google Scholar
Hochreiter, S., Urgen Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Jang, E., Gu, S., Poole, B.: Categorical Reparameterization with Gumbel-Softmax. In: International Conference on Learning Representations, pp. 1–13 (2017)
Google Scholar
Johnson, M., Hofmann, K., Hutton, T., Bignell, D.: The Malmo platform for artificial intelligence experimentation. In: IJCAI International Joint Conference on Artificial Intelligence, January 2016, pp. 4246–4247 (2016)
Google Scholar
Karpathy, A., Li, F.F.: Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 39, no. 4, pp. 3128–3137 (2015)
Google Scholar
Kempka, M., Wydmuch, M., Runc, G., Toczek, J., Jaskowski, W.: ViZDoom: a doom-based AI research platform for visual reinforcement learning. In: IEEE Conference on Computational Intelligence and Games, CIG (2017)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances In Neural Information Processing Systems, pp. 1–9 (2012)
Google Scholar
Mirowski, et al.: Learning to Navigate in Cities Without a Map (2018)
Google Scholar
Mirowski, P., et al.: Learning to navigate in complex environments. ICLR (2017)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Mnih, et al.: Asynchronous methods for deep reinforcement learning. arXiv preprint, vol. 48, pp. 1–28. arXiv:1602.01783v1 [cs.LG] (2016)
Mnih, V., et al.: Playing Atari with deep reinforcement learning. arXiv (2013). https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf
Monacelli, A.M., Cushman, L.A., Kavcic, V., Duffy, C.J.: Spatial disorientation in Alzheimer’s disease: the remembrance of things passed. Neurology 61(11), 1491–1497 (2003)
Article Google Scholar
Moser, M.B., Rowland, D.C., Moser, E.I.: Place cells, grid cells, and memory. Cold Spring Harb. Perspect. Biol. 7(2), a021808 (2015)
Article Google Scholar
Oh, J., Chockalingam, V., Singh, S., Lee, H.: Control of Memory, Active Perception, and Action in Minecraft. arXiv:1605.09128 [cs] (2016)
Sanchez-Gonzalez, A., et al.: Graph networks as learnable physics engines for inference and control (2018)
Google Scholar
Santoro, A., Bartunov, S., Botvinick, M., Wierstra, D., Lillicrap, T.: One-shot Learning with Memory-Augmented Neural Networks. arXiv:1605.06065 [cs], May 2016
Schulman, J., Levine, S., Jordan, M., Abbeel, P.: Trust region policy optimization. In: ICML 2015, p. 16, February 2015
Google Scholar
Wolbers, T., Hegarty, M.: What determines our navigational abilities? Trends Cogn. Sci. 14(3), 138–146 (2010)
Article Google Scholar
Xie, L., Wang, S., Markham, A., Trigoni, N.: Towards Monocular Vision based Obstacle Avoidance Through Deep Reinforcement Learning. Robotics: Science and Systems Workshop 2017: New Frontiers for Deep Learning in Robotics (2017)
Google Scholar
Zhang, J., Springenberg, J.T., Boedecker, J., Burgard, W.: Deep reinforcement learning with successor features for navigation across similar environments. In: IEEE International Conference on Intelligent Robots and Systems, September 2017, pp. 2371–2378 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, Tempe, AZ, 85281, USA
Bijan Fakhri, Hemanth Venkateswara & Sethuraman Panchanathan
School of Interactive Computing, Georgia Tech, 85 5th St. NW, Atlanta, GA, USA
Zsolt Kira
Georgia Tech Research Institute, 250 15th St. NW, Atlanta, GA, USA
Aaron Keech, Joel Schlosser, Ethan Brooks & Zsolt Kira

Authors

Bijan Fakhri
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Keech
View author publications
You can also search for this author in PubMed Google Scholar
Joel Schlosser
View author publications
You can also search for this author in PubMed Google Scholar
Ethan Brooks
View author publications
You can also search for this author in PubMed Google Scholar
Hemanth Venkateswara
View author publications
You can also search for this author in PubMed Google Scholar
Sethuraman Panchanathan
View author publications
You can also search for this author in PubMed Google Scholar
Zsolt Kira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hemanth Venkateswara .

Editor information

Editors and Affiliations

University of Alberta, Edmonton, AB, Canada
Anup Basu
Dipartimento di Ingegneria, Università degli Studi di Firenze, Florence, Italy
Stefano Berretti

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fakhri, B. et al. (2018). Deep Reinforcement Learning Methods for Navigational Aids. In: Basu, A., Berretti, S. (eds) Smart Multimedia. ICSM 2018. Lecture Notes in Computer Science(), vol 11010. Springer, Cham. https://doi.org/10.1007/978-3-030-04375-9_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-04375-9_6
Published: 08 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04374-2
Online ISBN: 978-3-030-04375-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics