Abstract
A perception scheme for Reinforcement Learning (RL) is developed as a function approximator. The main motivation for the development of this scheme is the need for generalization when the problem to be solved has continuous state variables. We propose a solution to the generalization problem in RL algorithms using a k-nearest-neighbor pattern classification (k-NN). By means of the k-NN technique we investigate the effect of collective decision making as a mechanism of perception and action-selection and a sort of back-propagation of its proportional influence in the action-selection process as the factor that moderate the learning of each decision making unit. A very well known problem is presented as a case study to illustrate the results of this k-NN based perception scheme.
This work has been partially funded by the Spanish Ministry of Science and Technology, project DPI2006-15346-C03-02.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Reinforcement Learning. In: Sutton, R.S (ed.) SECS, vol. 173, Kluwer Academic Publishers (1992). Reprinted from of Machine Learning, vol. 8(3-4) (1992)
Sutton, R.S.: Reinforcement learning and artificial intelligence (2006)
Sutton, R.S.: Reinforcement learning architectures. In: Proc. Int. Symp. on Neural Information Processing, Kyushu Inst.of Technology, Japan (1992)
Sutton, R.S., Barto, A.G.: Reinforcement Learning. An Introduction. MIT Press, Cambridge (1998)
Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. IEEE Transactions on Information Theory, IT 13(1), 21–27 (1967)
Dudani, S.A.: The distance-weighted k-nearest-neighbor rule. IEEE Transactions on Systems, Man and Cybernetics SMC-6(4), 325–327 (1976)
Watkins, C.J., Dayan, P.: Technical note Q-learning. Machine Learning 8, 279 (1992)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martín H., J.A., de Lope, J. (2007). A k-NN Based Perception Scheme for Reinforcement Learning. In: Moreno Díaz, R., Pichler, F., Quesada Arencibia, A. (eds) Computer Aided Systems Theory – EUROCAST 2007. EUROCAST 2007. Lecture Notes in Computer Science, vol 4739. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75867-9_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-75867-9_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75866-2
Online ISBN: 978-3-540-75867-9
eBook Packages: Computer ScienceComputer Science (R0)