Reward-Based Learning of a Memory-Required Task Based on the Internal Dynamics of a Chaotic Neural Network

Matsuki, Toshitaka; Shibata, Katsunari

doi:10.1007/978-3-319-46687-3_42

Reward-Based Learning of a Memory-Required Task Based on the Internal Dynamics of a Chaotic Neural Network

Toshitaka Matsuki¹⁹ &
Katsunari Shibata¹⁹

Conference paper
First Online: 29 September 2016

2540 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9947))

Abstract

We have expected that dynamic higher functions such as “thinking” emerge through the growth from exploration in the framework of reinforcement learning (RL) using a chaotic Neural Network (NN). In this frame, the chaotic internal dynamics is used for exploration and that eliminates the necessity of giving external exploration noises. A special RL method for this framework has been proposed in which “traces” were introduced. On the other hand, reservoir computing has shown its excellent ability in learning dynamic patterns. Hoerzer et al. showed that the learning can be done by giving rewards and exploration noises instead of explicit teacher signals. In this paper, aiming to introduce the learning ability into our new RL framework, it was shown that the memory-required task in the work of Hoerzer et al. could be learned without giving exploration noises by utilizing the chaotic internal dynamics while the exploration level was adjusted flexibly and autonomously. The task could be learned also using “traces”, but still with problems.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Shibata, K., Okabe, Y.: Reinforcement learning when visual signals are directly given as inputs. In: Proceedings of ICNN 1997, vol. 3, pp. 1716–1720 (1997)
Google Scholar
Shibata, K.: Emergence of intelligence through reinforcement learning with a neural network. In: Mellouk, A. (ed.) Advances in Reinforcement Learning, pp. 99–120. InTech, Rijeka (2011)
Google Scholar
Shibata, K., Utsunomiya, H.: Discovery of pattern meaning from delayed rewards by reinforcement learning with a recurrent neural network. In: Proceedings of IJCNN, pp. 1445–1452 (2011)
Google Scholar
Shibata, K., Goto, K.: Emergence of flexible prediction-based discrete decision making and continuous motion generation through actor-q-learning. In: Proceedings of ICDL-Epirob, ID 15 (2013)
Google Scholar
Sawatsubashi, Y., et al.: Emergence of discrete and abstract state representation in continuous input task. In: Robot Intelligence Technology and Applications, pp. 13–22 (2012)
Google Scholar
Shibata, K., Sakashita, Y.: Reinforcement learning with internal-dynamics-based exploration using a chaotic neural network. In: Proceedings of International Joint Conference on Neural Networks (IJCNN) (2015). 2015.7
Google Scholar
Goto, Y., Shibata, K.: Emergence of higher exploration in reinforcement learning using a chaotic neural network. In: Akira, H., Seiichi, O., Kenji, D., Kazushi, I., Minho, L., Derong, L. (eds.) ICONIP 2016. LNCS, pp. 40–48. Springer, Heidelberg (2016)
Google Scholar
Jaeger, H.: The “echo state” approach to analysing and training recurrent neural networks. GMD report 148, p. 43 (2001)
Google Scholar
Maass, W., Natschlger, T., Markram, H.: Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput. 14(11), 2531–2560 (2002)
Article MATH Google Scholar
Sussillo, D., Abbott, L.F.: Generating coherent patterns of activity from chaotic neural networks. Neuron 63(4), 544–557 (2009)
Article Google Scholar
Hoerzer, G.M., Legenstein, R., Maass, W.: Emergence of complex computational structures from chaotic neural networks through reward-modulated Hebbian learning. Cereb. Cortex 24(3), 677–690 (2014)
Article Google Scholar

Download references

Acknowledgement

The authors wish to thank Prof. Hiromichi Suetani for introducing FORCE Learning and the work of Hoerzer et al. to us. This work was supported by JSPS KAKENHI Grant Number 15K00360.

Author information

Authors and Affiliations

Oita University, 700 Dannoharu, Oita, Japan
Toshitaka Matsuki & Katsunari Shibata

Authors

Toshitaka Matsuki
View author publications
You can also search for this author in PubMed Google Scholar
Katsunari Shibata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Toshitaka Matsuki .

Editor information

Editors and Affiliations

The University of Tokyo, Tokyo, Japan
Akira Hirose
Kobe University, Kobe, Japan
Seiichi Ozawa
Okinawa Institute of Science and Technology Graduate University, Onna, Japan
Kenji Doya
Nara Institute of Science and Technology, Ikoma, Japan
Kazushi Ikeda
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Chinese Academy of Sciences, Beijing, China
Derong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matsuki, T., Shibata, K. (2016). Reward-Based Learning of a Memory-Required Task Based on the Internal Dynamics of a Chaotic Neural Network. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9947. Springer, Cham. https://doi.org/10.1007/978-3-319-46687-3_42

Download citation

DOI: https://doi.org/10.1007/978-3-319-46687-3_42
Published: 29 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46686-6
Online ISBN: 978-3-319-46687-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics