Skip to main content

A Novel Feature Sparsification Method for Kernel-Based Approximate Policy Iteration

  • Conference paper
  • 2639 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7367))

Abstract

In this paper, we present a novel feature sparsification approach for a class of kernel-based approximate policy iteration algorithms called KLSPI. We firstly introduce the relative approximation error in the sparsification process based on the approximate linear dependence (ALD) analysis. The relative approximation error is used as the criterion for selecting the kernel-based features. An improved KLSPI algorithm is also proposed by integrating the new sparsification method with KLSPI. Experimental results on the Inverted Pendulum problem demonstrate that the proposed sparsification method can obtain a smaller size of kernel dictionary than the previous ALD method. Furthermore, by using the more representative samples as the kernel dictionary, the precision of value function approximation has been increased. The improved KLSPI algorithm can also achieve better learning efficiency and policy quality than the original one. The feasibility and validity of the new method are proven.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sutton, R., Barto, A.: Reinforcement learning:an introduction. MIT Press, Cambridge (1998)

    Google Scholar 

  2. Boyan, J.: Technical update: Least-squares temporal difference learning. Mach. Learn. 49(2-3), 233–246 (2002)

    Article  MATH  Google Scholar 

  3. Xu, X., et al.: Kernel-based Least-Squares Policy Iteration for Reinforcement Learning. IEEE Trans. on Neural Networks 18, 973–992 (2007)

    Article  Google Scholar 

  4. Xu, X., et al.: Efficient Reinforcement Learning Using Recursive Least-Squares Methods. Journal of Artificial Intelligence Research 16, 259–292 (2002)

    MathSciNet  MATH  Google Scholar 

  5. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge Univ. Press, Cambridge (2000)

    Google Scholar 

  6. Kaelbling, L.P., et al.: Reinforcement learning: A survey. J. Artif. Intell. Res. 4, 237–285 (1996)

    Google Scholar 

  7. Lagoudakis, M.G., Parr, R.: Least-squares policy iteration. J.Mach. Learn. Res. 4, 1107–1149 (2003)

    MathSciNet  Google Scholar 

  8. Ormoneit, S.: Kernel-based reinforcement learning. Machine Learning 49(2), 161–178 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  9. Schölkopf, B., Smola, A.: Learning With Kernels. MIT Press, Cambridge (2002)

    Google Scholar 

  10. Tsitsiklis, J.N., Roy, B.V.: An analysis of temporal difference learning with function approximation. IEEE Trans. Autom. Control. 42(5), 674–690 (1997)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Huang, Z., Liu, C., Xu, X., Lian, C., Wu, J. (2012). A Novel Feature Sparsification Method for Kernel-Based Approximate Policy Iteration. In: Wang, J., Yen, G.G., Polycarpou, M.M. (eds) Advances in Neural Networks – ISNN 2012. ISNN 2012. Lecture Notes in Computer Science, vol 7367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31346-2_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31346-2_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31345-5

  • Online ISBN: 978-3-642-31346-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics