Conferences >2011 Annual International Con...

Reinforcement learning via kernel temporal difference

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper introduces a kernel adaptive filter implemented with stochastic gradient on temporal differences, kernel Temporal Difference (TD)(λ), to estimate the state-act...Show More

Metadata

Abstract:

This paper introduces a kernel adaptive filter implemented with stochastic gradient on temporal differences, kernel Temporal Difference (TD)(λ), to estimate the state-action value function in reinforcement learning. The case λ=0 will be studied in this paper. Experimental results show the method's applicability for learning motor state decoding during a center-out reaching task performed by a monkey. The results are compared to the implementation of a time delay neural network (TDNN) trained with backpropagation of the temporal difference error. From the experiments, it is observed that kernel TD(0) allows faster convergence and a better solution than the neural network.

Published in: 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society

Date of Conference: 30 August 2011 - 03 September 2011

Date Added to IEEE Xplore: 01 December 2011

ISBN Information:

ISSN Information:

PubMed ID: 22255624

DOI: 10.1109/IEMBS.2011.6091370

Conference Location: Boston, MA, USA

Contents

References is not available for this document.

Reinforcement learning via kernel temporal difference

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Reinforcement learning via kernel temporal difference

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?