A Novel Feature Sparsification Method for Kernel-Based Approximate Policy Iteration

Huang, Zhenhua; Liu, Chunming; Xu, Xin; Lian, Chuanqiang; Wu, Jun

doi:10.1007/978-3-642-31346-2_28

A Novel Feature Sparsification Method for Kernel-Based Approximate Policy Iteration

Zhenhua Huang¹⁹,
Chunming Liu¹⁹,
Xin Xu¹⁹,
Chuanqiang Lian¹⁹ &
…
Jun Wu¹⁹

Conference paper

2639 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7367))

Abstract

In this paper, we present a novel feature sparsification approach for a class of kernel-based approximate policy iteration algorithms called KLSPI. We firstly introduce the relative approximation error in the sparsification process based on the approximate linear dependence (ALD) analysis. The relative approximation error is used as the criterion for selecting the kernel-based features. An improved KLSPI algorithm is also proposed by integrating the new sparsification method with KLSPI. Experimental results on the Inverted Pendulum problem demonstrate that the proposed sparsification method can obtain a smaller size of kernel dictionary than the previous ALD method. Furthermore, by using the more representative samples as the kernel dictionary, the precision of value function approximation has been increased. The improved KLSPI algorithm can also achieve better learning efficiency and policy quality than the original one. The feasibility and validity of the new method are proven.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R., Barto, A.: Reinforcement learning:an introduction. MIT Press, Cambridge (1998)
Google Scholar
Boyan, J.: Technical update: Least-squares temporal difference learning. Mach. Learn. 49(2-3), 233–246 (2002)
Article MATH Google Scholar
Xu, X., et al.: Kernel-based Least-Squares Policy Iteration for Reinforcement Learning. IEEE Trans. on Neural Networks 18, 973–992 (2007)
Article Google Scholar
Xu, X., et al.: Efficient Reinforcement Learning Using Recursive Least-Squares Methods. Journal of Artificial Intelligence Research 16, 259–292 (2002)
MathSciNet MATH Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge Univ. Press, Cambridge (2000)
Google Scholar
Kaelbling, L.P., et al.: Reinforcement learning: A survey. J. Artif. Intell. Res. 4, 237–285 (1996)
Google Scholar
Lagoudakis, M.G., Parr, R.: Least-squares policy iteration. J.Mach. Learn. Res. 4, 1107–1149 (2003)
MathSciNet Google Scholar
Ormoneit, S.: Kernel-based reinforcement learning. Machine Learning 49(2), 161–178 (2002)
Article MathSciNet MATH Google Scholar
Schölkopf, B., Smola, A.: Learning With Kernels. MIT Press, Cambridge (2002)
Google Scholar
Tsitsiklis, J.N., Roy, B.V.: An analysis of temporal difference learning with function approximation. IEEE Trans. Autom. Control. 42(5), 674–690 (1997)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Automation, National University of Defense Technology, Changsha, 410073, P.R. China
Zhenhua Huang, Chunming Liu, Xin Xu, Chuanqiang Lian & Jun Wu

Authors

Zhenhua Huang
View author publications
You can also search for this author in PubMed Google Scholar
Chunming Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chuanqiang Lian
View author publications
You can also search for this author in PubMed Google Scholar
Jun Wu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mechanical & Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
Jun Wang
School of Electrical and Computer Engineering, Oklahoma State University, 74078, Stillwater, OK, USA
Gary G. Yen
Department of Electrical and Computer Engineering, University of Cyprus, 75 Kallipoleos Avenue, 1678, Nicosia, Cyprus
Marios M. Polycarpou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, Z., Liu, C., Xu, X., Lian, C., Wu, J. (2012). A Novel Feature Sparsification Method for Kernel-Based Approximate Policy Iteration. In: Wang, J., Yen, G.G., Polycarpou, M.M. (eds) Advances in Neural Networks – ISNN 2012. ISNN 2012. Lecture Notes in Computer Science, vol 7367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31346-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-31346-2_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31345-5
Online ISBN: 978-3-642-31346-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics