A KBRL Inference Metaheuristic with Applications

Bucur, Laurentiu; Florea, Adina; Chera, Catalin

doi:10.1007/978-3-642-29694-9_27

Laurentiu Bucur²,
Adina Florea² &
Catalin Chera²

Part of the book series: Studies in Computational Intelligence ((SCI,volume 427))

4496 Accesses

Abstract

In this chapter we propose an inference metaheuristic for Kernel-Based Reinforcement Learning (KBRL) agents – agents that operate in a continuous-state MDP. The metaheuristic is proposed in the simplified case of greedy policy RL agents with no receding horizon which perform online learning in an environment where feedback is generated by an ergodic and stationary source. We propose two inference strategies: isotropic discrete choice and anisotropic optimization, the former focused on speed and the latter focused on generalization capability. We cast the problem of classification as a RL problem and test the proposed metaheuristic in two experiments: an image recognition experiment on the Yale Faces database and a synthetic data set experiment. We propose a set of inference filters which increase the vigilance of the agent and show that they can prevent the agent from taking erroneous actions in an unknown environment. Two parallel inference algorithms are tested and illustrated in a cluster and GPU implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Knowledge Gradient for Online Reinforcement Learning

An efficient hybrid multilayer perceptron neural network with grasshopper optimization

Article 30 July 2018

Assessing Policy, Loss and Planning Combinations in Reinforcement Learning Using a New Modular Architecture

References

GEEA – Centru de resurse GRID multi-corE de înalta pErformAnta pentru suportul cercetarii, http://cluster.grid.pub.ro/index.php/projects/projects-geea/
The OpenCL programming model, http://www.ks.uiuc.edu/Research/gpu/files/upcrc_opencl_lec1.pdf
Bucur, L.: The FCINT Computer Vision System (Software, 2011f), http://www.fcint.ro/portal/service/FCINT_ComputerVisionSystem/FCINT_ComputerVision.zip
Ormoneit, D., Sen, S.: Kernel-Based Reinforcement Learning. Machine Learning 49, 161–178 (2002)
Article MATH Google Scholar
Jong, N.K., Stone, P.: Kernel-Based Models for Reinforcement Learning. In: The ICML 2006 Workshop on Kernel Methods in Reinforcement Learning (June 2006)
Google Scholar
Bernstein, A., Shimkin, N.: Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains. Machine Learning 81(3), 359–397
Google Scholar
Kaelbing, L.P., Littman, M.L., Moore, A.: Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Brox, T., Rosenhahn, B., Cremers, D., Seidel, H.-P.: Nonparametric Density Estimation with Adaptive, Anisotropic Kernels for Human Motion Tracking. In: Elgammal, A., Rosenhahn, B., Klette, R. (eds.) Human Motion 2007. LNCS, vol. 4814, pp. 152–165. Springer, Heidelberg (2007)
Chapter Google Scholar
Taylor, J.S., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press (2004) ISBN 978-0-521-81396-6
Google Scholar
Bucur, L.: Experimental data and software for the Original Yale Faces image recognition experiment, https://docs.google.com/uc?id=0B7VYFkQ0d6D-OTU2NDExNjUtODNkNS00ZDFjLWI5OWItNTFhZTNkNzU3YTE0&export=download&authkey=COHq0rkJ&hl=en
The Extended Yale Faces Database, http://vision.ucsd.edu/~leekc/ExtYaleDatabase/ExtYaleB.html
Bucur, L.: Image recognition data sets and software for the HPC KBRL image recognition experiment, https://docs.google.com/leaf?id=0B7VYFkQ0d6D-Zjg0N2RmNTEtNjYxNS00NDgxLWIzYjUtZTcyM2Q5OGU0NmJh&hl=en_US
Bucur, L.: The FCINT Computer Vision System, http://www.fcint.ro/portal/service/FCINT_ComputerVisionSystem/FCINT_ComputerVision.zip
NVIDIA Corporation GPU Computing SDK, http://developer.nvidia.com/gpu-computing-sdk
NVIDIA GeForce 210 Technical specifications, http://www.nvidia.com/object/product_geforce_210_us.html
The OpenCV Library, http://opencv.willowgarage.com/wiki/

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, POLITEHNICA University of Bucharest, Splaiul Independenţei nr. 313, 060042, Bucharest, Romania
Laurentiu Bucur, Adina Florea & Catalin Chera

Authors

Laurentiu Bucur
View author publications
You can also search for this author in PubMed Google Scholar
Adina Florea
View author publications
You can also search for this author in PubMed Google Scholar
Catalin Chera
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laurentiu Bucur .

Editor information

Editors and Affiliations

, Mathematics and Scientific Computing, National Physical Laboratory, Hampton Road, Teddington, TW11 0LW, United Kingdom
Xin-She Yang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bucur, L., Florea, A., Chera, C. (2013). A KBRL Inference Metaheuristic with Applications. In: Yang, XS. (eds) Artificial Intelligence, Evolutionary Computing and Metaheuristics. Studies in Computational Intelligence, vol 427. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29694-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-29694-9_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29693-2
Online ISBN: 978-3-642-29694-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics