Architecture of Knowledge-based Function Approximator

Osman, Hassab Elgawi

doi:10.1007/978-1-84882-171-2_27

Hassab Elgawi Osman⁴

Included in the following conference series:

International Conference on Innovative Techniques and Applications of Artificial Intelligence

473 Accesses

Abstract

This paper proposes a new architecture to build a hybrid value function estimation based on a combination of temporal-different (TD) and on-line variant of Random Forest (RF). We call this implementation Random-TD. The approach iteratively improves its value function by exploiting only relevant parts of action space. We evaluate the potential of the proposed procedure in terms of a reduction in the Bellman error. The results demonstrate that our approach can significantly improve the performance of TD methods and speed up learning process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Achieving data-driven actionability by combining learning and planning

Article 07 February 2018

Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator

Overview of Machine Learning Algorithms

References

Sutton, R., & Barto, A.: “Reinforcement Learning: An introduction,” Cambring, MA: MIT Press, 1998.
Google Scholar
Leo Breiman.: “Random Forests,” Machine Learning, 45(1):5.32, 2001.
Article Google Scholar
Hassab Elgawi Osman.: “Online Random Forests based on CorrFS and CorrBE,” In Proc.IEEE workshop on online classification, CVPR, 2008.
Google Scholar
Guvenir, H. A., & Uysal, I.: Bilkent University function approximation repository, 2000. Available online at http://funapp.cs.bilkent.edu.tr/DataSets/.
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Intelligence and Systems Science, Tokyo Institute of Technology, Tokyo, Japan
Hassab Elgawi Osman

Authors

Hassab Elgawi Osman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Technology, University of Portsmouth, Portsmouth, UK
Max Bramer BSc, PhD, CEng, CITP, FBCS, FIET, FRSA, FHEA
University of Greenwich, UK
Miltos Petridis DipEng, MBA, PhD, MBCS, AMBA
Department of Computer Science, University of Liverpool, Liverpool, UK
Frans Coenen BSc, PhD

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Osman, H.E. (2009). Architecture of Knowledge-based Function Approximator. In: Bramer, M., Petridis, M., Coenen, F. (eds) Research and Development in Intelligent Systems XXV. SGAI 2008. Springer, London. https://doi.org/10.1007/978-1-84882-171-2_27

Download citation

DOI: https://doi.org/10.1007/978-1-84882-171-2_27
Publisher Name: Springer, London
Print ISBN: 978-1-84882-170-5
Online ISBN: 978-1-84882-171-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics