Learning from Demonstration Using Variational Bayesian Inference

Hussein, Mostafa; Mohammed, Yasser; Ali, Samia A.

doi:10.1007/978-3-319-19066-2_36

Mostafa Hussein⁹,
Yasser Mohammed⁹ &
Samia A. Ali⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9101))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

2804 Accesses
3 Citations

Abstract

Learning from demonstration (LFD) is an active area of research in robotics. There are many approaches to LFD. One of the most widely used approaches is the combination of Gaussian Mixture Model learning for modeling and Gaussian Mixture Regression for behavior generation (GMM/GMR) due to its advantages including easy learning using Expectation Maximization and the simplicity of serializing learned behaviors as well as the ability to model internal correlations and constraints within the task.

A critical parameter that affects the accuracy of learned behavior in GMM/GMR is the number of components in the mixture. A handful of approaches for selecting this number can be found in the literature including classical model selection methods like Bayesian Information Criteria and Akaik Information Criteria and more advanced methods including Dirichlet Process modeling. These approaches are either wasteful of computational resources or hard to implement. This paper introduces a LfD approach which uses GMM with a variational Bayesian Inference (VB) approach to select the number of Gaussians that best fit the data. The proposed method is compared to classical model selection approaches and a recently proposed symbolization based method and is shown to provide an appropriate balance between execution speed and model accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Incremental Learning of Skills in a Task-Parameterized Gaussian Mixture Model

Article 04 November 2015

Gaussian-process-based robot learning from demonstration

Article Open access 22 February 2023

Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies

Article 04 May 2017

References

Ratliff, N., Ziebart, B.D., Peterson, K., Bagnell, J.A., Hebert, M., Dey, A., Srinivasa, S.: Inverse optimal heuristic control for imitation learning. In: Intl. Conf. on Artificial Intelligence and Statistics (AIStats) (2009)
Google Scholar
Abbeel, P., Ng, A.Y.: Apprenticeship learning via inverse reinforcement learning. In: Proc. Intl. Conf. on Machine Learning (ICML) (2004)
Google Scholar
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)
MATH Google Scholar
Mohammad, Y., Nishida, T.: Robust learning from demonstrations using multidimensional SAX. In: ICCAS 2014, Korea (2014)
Google Scholar
Mohammad, Y., Nishida, T.: Unsupervised discovery of basic human actions from activity recording datasets. In: IEEE/SICE SII 2012, Kyushu, Japan (2012)
Google Scholar
Lopes, M., Melo, F., Montesano, L.: Active learning for reward estimation in inverse reinforcement learning. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009, Part II. LNCS, vol. 5782, pp. 31–46. Springer, Heidelberg (2009)
Chapter Google Scholar
Ijspeert, A.J., Nakanishi, J., Schaal, S.: Learning attractor landscapes for learning motor primitives. In: Advances in Neural Information Processing Systems, pp. 1523–1530 (2002)
Google Scholar
Calinon, S., Guenter, F., Billard, A.: On Learning, Representing and Generalizing a Task in a Humanoid Robot. IEEE Transactions on Systems, Man and Cybernetics, Part B 37(2), 286–298 (2007). Special issue on robot learning by observation, demonstration and imitation
Article Google Scholar
Bishop, C.M., Corduneanu, A.: Variational bayesian model selection for mixture distributions. In: Jaakkola, T., Richardson, T. (eds.) Artificial Intelligence and Statistics 2001, pp. 27–34 (2001)
Google Scholar
Chatzis, S.P., Korkinof, D., Demiris, Y.: A nonparametric bayesian approach toward robot learning by demonstration. Robotics and Autonomous Systems 60(6), 789–802 (2012)
Article Google Scholar
Niekum, S., Osentoski, S., Konidaris, G., Barto, A.G.: Learning and generalization of complex tasks from unstructured demonstrations. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5239–5246 (2012)
Google Scholar
Argall, B., Chernova, S., Veloso, M., Browning, B.: A survey of robot learning from demonstration. Robotics and Autonomous Systems 57(5), 469–483 (2009)
Article Google Scholar
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing 26(1), 43–49 (1978)
Article MATH Google Scholar
Calinon, S., D’halluin, F., Sauser, E.L., Caldwell, D.G., Billard, A.G.: Learning and reproduction of gestures by imitation: An approach based on Hidden Markov Model and Gaussian Mixture Regression. IEEE Robotics and Automation Magazine 17(2), 44–54 (2010)
Article Google Scholar
Bishop, C.M.: Variational principal components. In: Proceedings Ninth International Conference on Artificial Neural Networks, ICANN 1999, vol. 1, pp. 509–514. IEEE (1999)
Google Scholar
Jordan, M.I., Ghahramani, Z., Jaakkola, T.S., Saul, L.K.: An introduction to variational methods for graphical models. In: Jordan, M.I. (ed.) Learning in Graphical Models, pp. 105–162. Kluwer (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Assiut University, Asyut, Egypt
Mostafa Hussein, Yasser Mohammed & Samia A. Ali

Authors

Mostafa Hussein
View author publications
You can also search for this author in PubMed Google Scholar
Yasser Mohammed
View author publications
You can also search for this author in PubMed Google Scholar
Samia A. Ali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mostafa Hussein .

Editor information

Editors and Affiliations

Texas State University, San Marcos, Texas, USA
Moonis Ali
Dongguk University, Seoul, Korea, Republic of (South Korea)
Young Sig Kwon
Dongguk University, Seoul, Korea, Republic of (South Korea)
Chang-Hwan Lee
Dongguk University, Seoul, Korea, Republic of (South Korea)
Juntae Kim
Seoul National University, Seoul, Korea, Republic of (South Korea)
Yongdai Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hussein, M., Mohammed, Y., Ali, S.A. (2015). Learning from Demonstration Using Variational Bayesian Inference. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_36

Download citation

DOI: https://doi.org/10.1007/978-3-319-19066-2_36
Published: 01 May 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19065-5
Online ISBN: 978-3-319-19066-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics