Conferences >Proceedings of the 2002 Ameri...

On the mean-square rate of convergence of temporal-difference learning algorithms

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this paper, the mean-square rate of convergence of temporal-difference learning algorithms is analyzed. The analysis is carried out for the case of discounted cost fun...Show More

Metadata

Abstract:

In this paper, the mean-square rate of convergence of temporal-difference learning algorithms is analyzed. The analysis is carried out for the case of discounted cost function associated with a Markov chain with a finite dimensional state-space. Under mild conditions, it is shown that these algorithms converge at the rate O(n/sup -1/2/). The results are illustrated with examples related to random coefficient autoregression models and M/G/1 queues.

Published in: Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301)

Date of Conference: 08-10 May 2002

Date Added to IEEE Xplore: 07 November 2002

Print ISBN:0-7803-7298-0

Print ISSN: 0743-1619

DOI: 10.1109/ACC.2002.1023226

Conference Location: Anchorage, AK, USA

Contents

References is not available for this document.

On the mean-square rate of convergence of temporal-difference learning algorithms

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

On the mean-square rate of convergence of temporal-difference learning algorithms

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?