Conferences >2009 IEEE Symposium on Adapti...

The QV family compared to other reinforcement learning algorithms

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper describes several new online model-free reinforcement learning (RL) algorithms. We designed three new reinforcement algorithms, namely: QV2, QVMAX, and QVMAX2,...Show More

Metadata

Abstract:

This paper describes several new online model-free reinforcement learning (RL) algorithms. We designed three new reinforcement algorithms, namely: QV2, QVMAX, and QVMAX2, that are all based on the QV-learning algorithm, but in contrary to QV-learning, QVMAX and QVMAX2 are off-policy RL algorithms and QV2 is a new on-policy RL algorithm. We experimentally compare these algorithms to a large number of different RL algorithms, namely: Q-learning, Sarsa, R-learning, Actor-Critic, QV-learning, and ACLA. We show experiments on five maze problems of varying complexity. Furthermore, we show experimental results on the cart pole balancing problem. The results show that for different problems, there can be large performance differences between the different algorithms, and that there is not a single RL algorithm that always performs best, although on average QV-learning scores highest.

Published in: 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Date of Conference: 30 March 2009 - 02 April 2009

Date Added to IEEE Xplore: 15 May 2009

Print ISBN:978-1-4244-2761-1

ISSN Information:

DOI: 10.1109/ADPRL.2009.4927532

Conference Location: Nashville, TN, USA

Contents

References is not available for this document.

The QV family compared to other reinforcement learning algorithms

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

The QV family compared to other reinforcement learning algorithms

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?