A Dynamic Bayesian Network Framework for Learning from Observation

Ontañón, Santiago; Montaña, José Luis; Gonzalez, Avelino J.

doi:10.1007/978-3-642-40643-0_38

A Dynamic Bayesian Network Framework for Learning from Observation

Santiago Ontañón²⁶,
José Luis Montaña²⁷ &
Avelino J. Gonzalez²⁸

Conference paper

1631 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8109))

Abstract

Learning from Observation (a.k.a. learning from demonstration) studies how computers can learn to perform complex tasks by observing and thereafter imitating the performance of an expert. Most work on learning from observation assumes that the behavior to be learned can be expressed as a state-to-action mapping. However most behaviors of interest in real applications of learning from observation require remembering past states. We propose a Dynamic Bayesian Network approach to learning from observation that addresses such problem by assuming the existence of non-observable states.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Argall, B.D., Chernova, S., Veloso, M., Browning, B.: A survey of robot learning from demonstration. Robot. Auton. Syst. 57, 469–483 (2009)
Article Google Scholar
Bauer, M.A.: Programming by examples. Artificial Intelligence 12(1), 1–21 (1979)
Article MATH Google Scholar
Bengio, Y., Frasconi, P.: Input/output hmms for sequence processing. IEEE Transactions on Neural Networks 7, 1231–1249 (1996)
Article Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, Series B 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Fernlund, H.K.G., Gonzalez, A.J., Georgiopoulos, M., DeMara, R.F.: Learning tactical human behavior through observation of human performance. IEEE Transactions on Systems, Man, and Cybernetics, Part B 36(1), 128–140 (2006)
Article Google Scholar
Floyd, M.W., Esfandiari, B., Lam, K.: A case-based reasoning approach to imitating robocup players. In: Proceedings of the Twenty-First International Florida Artificial Intelligence Research Society (FLAIRS), pp. 251–256 (2008)
Google Scholar
Ghahramani, Z.: Learning dynamic Bayesian networks. In: Caianiello, E.R. (ed.) Adaptive Processing of Sequences and Data Structures, International Summer School on Neural Networks. Tutorial Lectures, pp. 168–197. Springer, London (1998)
Chapter Google Scholar
Lozano-Pérez, T.: Robot programming. Proceedings of IEEE 71, 821–841 (1983)
Article Google Scholar
Moriarty, C.L., Gonzalez, A.J.: Learning human behavior from observation for gaming applications. In: FLAIRS Conference (2009)
Google Scholar
Ng, A.Y., Russell, S.: Algorithms for Inverse Reinforcement Learning. In: in Proc. 17th International Conf. on Machine Learning, pp. 663–670 (2000)
Google Scholar
Ontañón, S., Mishra, K., Sugandh, N., Ram, A.: On-line case-based planning. Computational Intelligence Journal 26(1), 84–119 (2010)
Article Google Scholar
Papoulis, A., Pillai, S.U.: Probability, Random Variables, and Stochastic Processes. McGraw-Hill Series in Electrical and Computer Engineering. McGraw-Hill (2002)
Google Scholar
Pomerleau, D.: Alvinn: An autonomous land vehicle in a neural network. In: Touretzky, D.S. (ed.) Advances in Neural Information Processing Systems, vol. 1. Morgan Kaufmann (1989)
Google Scholar
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 257–286 (1989)
Google Scholar
Sammut, C., Hurst, S., Kedzier, D., Michie, D.: Learning to fly. In: Proceedings of the Ninth International Workshop on Machine Learning (ML 1992), pp. 385–393 (1992)
Google Scholar
Schaal, S.: Learning from demonstration. In: NIPS, pp. 1040–1046 (1996)
Google Scholar
Sidani, T.: Automated Machine Learning from Observation of Simulation. Ph.D. thesis, University of Central Florida (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Drexel University, Philadelphia, PA, USA, 19104
Santiago Ontañón
University of Cantabria, Santander, Spain
José Luis Montaña
University of Central Florida, Orlando, FL, USA
Avelino J. Gonzalez

Authors

Santiago Ontañón
View author publications
You can also search for this author in PubMed Google Scholar
José Luis Montaña
View author publications
You can also search for this author in PubMed Google Scholar
Avelino J. Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad Politécnica de Madrid, 28660, Madrid, Spain
Concha Bielza
Universidad de Almería, 04120, Almería, Spain
Antonio Salmerón
Universdad de A Coruña, 15071, A Coruña, Spain
Amparo Alonso-Betanzos
Universidad Complutense de Madrid, 28040, Madrid, Spain
J. Ignacio Hidalgo
Universidad de Jaén, 23071, Jaén, Spain
Luis Martínez
Universidad Pablo de Olavide, 41013, Sevilla, Spain
Alicia Troncoso
Universidad de Salamanca, 37008, Salamanca, Spain
Emilio Corchado & Juan M. Corchado &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ontañón, S., Montaña, J.L., Gonzalez, A.J. (2013). A Dynamic Bayesian Network Framework for Learning from Observation. In: Bielza, C., et al. Advances in Artificial Intelligence. CAEPIA 2013. Lecture Notes in Computer Science(), vol 8109. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40643-0_38

Download citation

DOI: https://doi.org/10.1007/978-3-642-40643-0_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40642-3
Online ISBN: 978-3-642-40643-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics