Machine Learning Proceedings 1995

Machine Learning Proceedings 1995

Proceedings of the Twelfth International Conference on Machine Learning, Tahoe City, California, July 9–12, 1995
1995, Pages 387-395
Machine Learning Proceedings 1995

Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State

https://doi.org/10.1016/B978-1-55860-377-6.50055-4Get rights and content

Abstract

We present Utile Suffix Memory, a reinforcement learning algorithm that uses short-term memory to overcome the state aliasing that results from hidden state. By combining the advantages of previous work in instance-based (or “memory- based”) learning and previous work with statistical tests for separating noise from task structure, the method learns quickly, creates only as much memory as needed for the task at hand, and handles noise well.

Utile Suffix Memory uses a tree-structured representation, and is related to work on Prediction Suffix Trees [Ron et al., 1994], Parti-game [Moore, 1993], G-algorithm [Chapman and Kaelbling, 1991], and Variable Resolution Dynamic Programming [Moore, 1991].

References (0)

Cited by (0)

View full text