Skip to main content

Locally-Connected Interrelated Network: A Forward Propagation Primitive

  • Conference paper
  • First Online:

Part of the book series: Springer Proceedings in Advanced Robotics ((SPAR,volume 17))

Abstract

End-to-end learning for planning is a promising approach for finding good robot strategies in situations where the state transition, observation, and reward functions are initially unknown. Many neural network architectures for this approach have shown positive results. Across these networks, seemingly small components have been used repeatedly in different architectures, which means improving the efficiency of these components has great potential to improve the overall performance of the network. This paper aims to improve one such component: The forward propagation module. In particular, we propose Locally-Connected Interrelated Network (LCI-Net)—a novel type of locally connected layer with unshared but interrelated weights—to improve the efficiency of information propagation and learning stochastic transition models for planning. LCI-Net is a small differentiable neural network module that can be plugged into various existing architectures. For evaluation purposes, we apply LCI-Net to QMDP-Net; QMDP-Net is a neural network for solving POMDP problems whose transition, observation, and reward functions are learned. Simulation tests on benchmark problems involving 2D and 3D navigation and grasping indicate promising results: Changing only the forward propagation module alone with LCI-Net improves QMDP-Net generalization capability by a factor of up to 10.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Neural networks, types, and functional programming. http://colah.github.io/posts/2015-09-NN-Types-FP/. Accessed 03 Sep 2019

  2. François-Lavet, V., Bengio, Y., Precup, D., Pineau, J.: Combined reinforcement learning via abstract representations. In: AAAI, vol. 33, pp. 3582–3589 (2019)

    Google Scholar 

  3. Gupta, S., Davidson, J., Levine, S., Sukthankar, R., Malik, J.: Cognitive mapping and planning for visual navigation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017). https://doi.org/10.1109/CVPR.2017.769

  4. Haarnoja, T., Ajay, A., Levine, S., Abbeel, P.: Backprop KF: learning discriminative deterministic state estimators. In: NIPS Conference (2016)

    Google Scholar 

  5. Hausknecht, M., Stone, P.: Deep recurrent Q-learning for partially observable MDPs. In: AAAI 2015 Fall Symposium (2015)

    Google Scholar 

  6. Howard, A., Roy, N.: The robotics data set repository (radish) (2003). http://radish.sourceforge.net/

  7. Jonkowski, R., Brock, O.: End-to-End Learnable Histogram Filters (2017)

    Google Scholar 

  8. Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Artif. Intell. 101(1–2), 99–134 (1998)

    Article  MathSciNet  Google Scholar 

  9. Karkus, P., Hsu, D., Lee, W.S.: QMDP-net: deep learning for planning under partial observability. In: NIPS Conference (2017)

    Google Scholar 

  10. Karkus, P., Hsu, D., Lee, W.S.: Particle filter networks with application to visual localization. In: CoRL Conference (2018)

    Google Scholar 

  11. Karkus, P., Ma, X., Hsu, D., Kaelbling, L.P., Lee, W.S., Lozano-Perez, T.: Differentiable algorithm networks for composable robot learning. In: Robotics: Science and Systems (2019)

    Google Scholar 

  12. Lee, L., Parisotto, E., Chaplot, D.S., Xing, E., Salakhutdinov, R.: Gated path planning networks. In: ICML Conference (2018)

    Google Scholar 

  13. Littman, M.L., Cassandra, A.R., Kaelbling, L.P.: Learning policies for partially observable environments: scaling up. In: ICML (1995)

    Google Scholar 

  14. Mirowski, P., Pascanu, R., Viola, F., Soyer, H., Ballard, A.J., Banino, A., Denil, M., Goroshin, R., Sifre, L., Kavukcuoglu, K., Kumaran, D., Hadsell, R.: Learning to navigate in complex environments. In: ICLR Conference (2016)

    Google Scholar 

  15. Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., Petersen, S., Beattie, C., Sadik, A., Antonoglou, I., King, H., Kumaran, D., Wierstra, D., Legg, S., Hassabis, D.: Human-level control through deep reinforcement learning. Nature 518, 529 EP, February 2015. https://doi.org/10.1038/nature14236

  16. Oh, J., Guo, X., Lee, H., Lewis, R.L., Singh, S.: Action-conditional video prediction using deep networks in atari games. In: NIPS Conference, pp. 2863–2871 (2015)

    Google Scholar 

  17. Oh, J., Singh, S., Lee, H.: Value prediction network. In: NIPS Conference, pp. 6118–6128 (2017)

    Google Scholar 

  18. Okada, M., Rigazio, L., Aoshima, T.: Path Integral Networks: End-to-End Differentiable Optimal Control (2017)

    Google Scholar 

  19. Shankar, T., Dwivedy, S.K., Guha, P.: Reinforcement learning via recurrent convolutional neural networks. In: ICPR Conference. pp. 2592–2597, December 2016

    Google Scholar 

  20. Sondik, E.: The optimal control of partially observable Markov processes. Ph.D. thesis, Stanford University (1971)

    Google Scholar 

  21. Tamar, A., Wu, Y., Thomas, G., Levine, S., Abbeel, P.: Value iteration networks. In: IJCAI Conference, August 2017

    Google Scholar 

  22. Wahlström, N., Schön, T.B., Deisenroth, M.P.: Learning deep dynamical models from image pixels. In: The 17th IFAC Symposium on System Identification (SYSID) (2015)

    Google Scholar 

Download references

Acknowledgements

Nicholas Collins is supported by an Australian Government Research Training Program (RTP) scholarship provided by the University of Queensland.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nicholas Collins .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Collins, N., Kurniawati, H. (2021). Locally-Connected Interrelated Network: A Forward Propagation Primitive. In: LaValle, S.M., Lin, M., Ojala, T., Shell, D., Yu, J. (eds) Algorithmic Foundations of Robotics XIV. WAFR 2020. Springer Proceedings in Advanced Robotics, vol 17. Springer, Cham. https://doi.org/10.1007/978-3-030-66723-8_8

Download citation

Publish with us

Policies and ethics