Adaptive co-construction of state and action spaces in reinforcement learning

Nagayoshi, Masato; Murao, Hajime; Tamaki, Hisashi

doi:10.1007/s10015-011-0883-2

Adaptive co-construction of state and action spaces in reinforcement learning

Original Article
Published: 29 June 2011

Volume 16, pages 48–52, (2011)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Masato Nagayoshi¹,
Hajime Murao² &
Hisashi Tamaki³

97 Accesses
1 Citation
Explore all metrics

Abstract

Reinforcement learning (RL) attracts much attention as a technique for realizing computational intelligence such as adaptive and autonomous decentralized systems. In general, however, it is not easy to put RL to practical use. The difficulty includes the problem of designing suitable state and action spaces for an agent. Previously, we proposed an adaptive state space construction method which is called a “state space filter,” and an adaptive action space construction method which is called “switching RL,” after the other space has been fixed. In this article, we reconstitute these two construction methods as one method by treating the former and the latter as a combined method for mimicking an infant’s perceptual development. In this method, perceptual differentiation progresses as an infant become older and more experienced, and the infant’s motor development, in which gross motor skills develop before fine motor skills, also progresses. The proposed method is based on introducing and referring to “entropy.” In addition, a computational experiment was conducted using a so-called “path planning problem” with continuous state and action spaces. As a result, the validity of the proposed method has been confirmed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficient reinforcement learning in continuous state and action spaces with Dyna and policy approximation

Article 13 February 2018

Reinforcement Learning Algorithms with Selector, Tuner, or Estimator

Article 19 September 2023

Complex behavior from intrinsic motivation to occupy future action-state path space

Article Open access 29 July 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Sutton RS, Barto AG (1998) Reinforcement learning. Bradford Books, MIT Press, Cambridge
Google Scholar
Nagayoshi M, Murao H, Tamaki H (2006) A state space filter for reinforcement learning. Proceedings AROB 11th’ 06, pp 615–618 (GS1–3)
Nagayoshi M, Murao H, Tamaki H (2010) A reinforcement learning with switching controllers for continuous action space. Artif Life Robotics 15:97–100
Article Google Scholar

Download references

Author information

Authors and Affiliations

Niigata College of Nursing, 240 Shinnan, Joetsu, 943-0147, Japan
Masato Nagayoshi
Faculty of Cross-Cultural Studies, Kobe University, Kobe, Japan
Hajime Murao
Graduate School of Engineering, Kobe University, Kobe, Japan
Hisashi Tamaki

Authors

Masato Nagayoshi
View author publications
You can also search for this author inPubMed Google Scholar
Hajime Murao
View author publications
You can also search for this author inPubMed Google Scholar
Hisashi Tamaki
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Masato Nagayoshi.

Additional information

This work was presented in part at the 16th International Symposium on Artificial Life and Robotics, Oita, Japan, January 27–29, 2011

About this article

Cite this article

Nagayoshi, M., Murao, H. & Tamaki, H. Adaptive co-construction of state and action spaces in reinforcement learning. Artif Life Robotics 16, 48–52 (2011). https://doi.org/10.1007/s10015-011-0883-2

Download citation

Received: 11 February 2011
Accepted: 11 February 2011
Published: 29 June 2011
Issue Date: June 2011
DOI: https://doi.org/10.1007/s10015-011-0883-2

Key words

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive co-construction of state and action spaces in reinforcement learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient reinforcement learning in continuous state and action spaces with Dyna and policy approximation

Reinforcement Learning Algorithms with Selector, Tuner, or Estimator

Complex behavior from intrinsic motivation to occupy future action-state path space

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Key words

Subscribe and save

Buy Now