A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Aung, Kathy Thi; Fuchida, Takayasu

doi:10.1007/s10015-011-0961-5

A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Original Article
Published: 22 February 2012

Volume 16, pages 473–477, (2012)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Kathy Thi Aung¹ &
Takayasu Fuchida¹

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

In this article, we examine the learning performance of various strategies under different conditions using the Voronoi Q-value element (VQE) based on reward in a single-agent environment, and decide how to act in a certain state. In order to test our hypotheses, we performed computational experiments using several situations such as various angles of rotation of VQEs which are arranged into a lattice structure, various angles of an agent’s action rotation that has 4 actions, and a random arrangement of VQEs to correctly evaluate the optimal Q-values for state and action pairs in order to deal with continuous-valued inputs. As a result, the learning performance changes when the angle of VQEs and the angle of action are changed by a specific relative position.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning to Navigate in 3D Virtual Environment Using Q-Learning

Improvements to Vanilla Implementation of Q-Learning Used in Path Planning of an Agent

A Novel Experience-Based Exploration Method for Q-Learning

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. Bradford Books, MIT Press
Aung KT, Fuchida T (2010) Reinforcement learning using Voronoi space division. Artif Life Robotics 15:330–334
Article Google Scholar
Gaskett C, Wettergreen D, Zelinsky A (1999) Q-learning in continuous state and action spaces. Adv Topics Artif Intell 1747

Download references

Author information

Authors and Affiliations

Department of Information and Computer Science, Graduate School of Science and Engineering, Kagoshima University, 1-21-40 Korimoto, Kagoshima, 890-0065, Japan
Kathy Thi Aung & Takayasu Fuchida

Authors

Kathy Thi Aung
View author publications
You can also search for this author inPubMed Google Scholar
Takayasu Fuchida
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Takayasu Fuchida.

About this article

Cite this article

Aung, K.T., Fuchida, T. A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment. Artif Life Robotics 16, 473–477 (2012). https://doi.org/10.1007/s10015-011-0961-5

Download citation

Received: 27 April 2011
Accepted: 27 April 2011
Published: 22 February 2012
Issue Date: February 2012
DOI: https://doi.org/10.1007/s10015-011-0961-5

Key words

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Learning to Navigate in 3D Virtual Environment Using Q-Learning

Improvements to Vanilla Implementation of Q-Learning Used in Path Planning of an Agent

A Novel Experience-Based Exploration Method for Q-Learning

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Key words

Subscribe and save

Buy Now