Emergence of Discrete and Abstract State Representation through Reinforcement Learning in a Continuous Input Task

Sawatsubashi, Yoshito; Samusudin, Mohamad Faizal bin; Shibata, Katsunari

doi:10.1007/978-3-642-37374-9_2

Yoshito Sawatsubashi⁵,
Mohamad Faizal bin Samusudin^5,6 &
Katsunari Shibata⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 208))

208 Accesses
2 Citations

Abstract

“Concept” is a kind of discrete and abstract state representation, and is considered useful for efficient action planning. However, it is supposed to emerge in our brain as a parallel processing and learning system through learning based on a variety of experiences, and so it is difficult to be developed by hand-coding. In this paper, as a previous step of the “concept formation”, it is investigated whether the discrete and abstract state representation is formed or not through learning in a task with multi-step state transitions using Actor-Q learning method and a recurrent neural network. After learning, an agent repeated a sequence two times, in which it pushed a button to open a door and moved to the next room, and finally arrived at the third room to get a reward. In two hidden neurons, discrete and abstract state representation not depending on the door opening pattern was observed. The result of another learning with two recurrent neural networks that are for Q-values and for Actors suggested that the state representation emerged to generate appropriate Q-values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tani, J., Nolfi, S.: Learning to Perceive the World as Articulated: An Approach for Hierarchical Learning in Sensory-Motor Systems. Neural Networks 12, 1131–1141 (1999)
Article Google Scholar
Yamashita, Y., Tani, J.: Emergence of functional hierarchy in a multiple timescale neural network model: a humanoid robot experiment. PLoS Computational Biology 4, e100220 (2008)
Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning Internal Representations by Error Propagation. In: Parallel Distributed Processing, pp. 318–362. The MIT Press (1986)
Google Scholar
Shibata, K., Nishino, T., Okabe, Y.: Active Perception and Recognition Learning System Based on Actor-Q Architecture Systems and Computers in Japan 33(14), 12–22 (2002)
Google Scholar
Samsudin, M.F., Shibata, K.: Emergence of Multi-Step Discrete State Transition through Reinforcement Learning with a Recurrent Neural Network. In: Proc. of ICONIP 2012 (2012) (to appear)
Google Scholar
Utsunomiya, H., Shibata, K.: Contextual Behaviors and Internal Representations Acquired by Reinforcement Learning with a Recurrent Neural Network in a Continuous State and Action Space Task. In: Köppen, M., Kasabov, N., Coghill, G. (eds.) ICONIP 2008, Part II. LNCS, vol. 5507, pp. 970–978. Springer, Heidelberg (2009)
Chapter Google Scholar
Shibata, K., Ito, K.: Adaptive Space Reconstruction on Hidden Layer and Knowledge Transfer based on Hidden-level Generalization in Layered Neural Networks. Trans. SICE 43(1), 54–63 (2007) (in Japanese)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, Oita University, 700, Dannoharu, 870-1124, Oita, Japan
Yoshito Sawatsubashi, Mohamad Faizal bin Samusudin & Katsunari Shibata
Department of Mechatronic, Universiti Malaysia Perlis, 02600, Arau, Perlis, Malaysia
Mohamad Faizal bin Samusudin

Authors

Yoshito Sawatsubashi
View author publications
You can also search for this author in PubMed Google Scholar
Mohamad Faizal bin Samusudin
View author publications
You can also search for this author in PubMed Google Scholar
Katsunari Shibata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yoshito Sawatsubashi .

Editor information

Editors and Affiliations

, Electrical Engineering and Computer Sci., KAIST, 291 Daehak-ro, Daejeon, 305-701, Korea, Republic of (South Korea)
Jong-Hwan Kim
Dept. Computer &, Information Technology (ICT), Purdue University, N. Grant St. 401, West Lafayette, 47907-1421, Indiana, USA
Eric T. Matson
, Dept. of Civil and Environmental Engg., KAIST, Daehak-ro 291, Daejeon, 305-701, Korea, Republic of (South Korea)
Hyun Myung
, Faculty of Engineering, The University of Auckland, Private Bag, Auckland, 1142, New Zealand
Peter Xu

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sawatsubashi, Y., Samusudin, M.F.b., Shibata, K. (2013). Emergence of Discrete and Abstract State Representation through Reinforcement Learning in a Continuous Input Task. In: Kim, JH., Matson, E., Myung, H., Xu, P. (eds) Robot Intelligence Technology and Applications 2012. Advances in Intelligent Systems and Computing, vol 208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37374-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-37374-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37373-2
Online ISBN: 978-3-642-37374-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics