Statistical Learning for Humanoid Robots

Vijayakumar, Sethu; D'souza, Aaron; Shibata, Tomohiro; Conradt, Jörg; Schaal, Stefan

doi:10.1023/A:1013258808932

Statistical Learning for Humanoid Robots

Published: January 2002

Volume 12, pages 55–69, (2002)
Cite this article

Autonomous Robots Aims and scope Submit manuscript

Sethu Vijayakumar¹,
Aaron D'souza¹,
Tomohiro Shibata²,
Jörg Conradt³ &
…
Stefan Schaal¹

643 Accesses
52 Citations
3 Altmetric
Explore all metrics

Abstract

The complexity of the kinematic and dynamic structure of humanoid robots make conventional analytical approaches to control increasingly unsuitable for such systems. Learning techniques offer a possible way to aid controller design if insufficient analytical knowledge is available, and learning approaches seem mandatory when humanoid systems are supposed to become completely autonomous. While recent research in neural networks and statistical learning has focused mostly on learning from finite data sets without stringent constraints on computational efficiency, learning for humanoid robots requires a different setting, characterized by the need for real-time learning performance from an essentially infinite stream of incrementally arriving data. This paper demonstrates how even high-dimensional learning problems of this kind can successfully be dealt with by techniques from nonparametric regression and locally weighted learning. As an example, we describe the application of one of the most advanced of such algorithms, Locally Weighted Projection Regression (LWPR), to the on-line learning of three problems in humanoid motor control: the learning of inverse dynamics models for model-based control, the learning of inverse kinematics of redundant manipulators, and the learning of oculomotor reflexes. All these examples demonstrate fast, i.e., within seconds or minutes, learning convergence with highly accurate final peformance. We conclude that real-time learning for complex motor system like humanoid robots is possible with appropriately tailored algorithms, such that increasingly autonomous robots with massive learning abilities should be achievable in the near future.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

An, C.H., Atkeson, C., and Hollerbach, J. 1988. Model Based Control of a Robot Manipulator, MIT Press: Cambridge, MA.
Google Scholar
Atkeson, C., Moore, A., and Schaal, S. 1997. Locally weighted learning. Artificial Intelligence Review, 11:76–113.
Google Scholar
Bishop, C. 1995. Neural Networks for Pattern Recognition, Oxford University Press: London.
Google Scholar
Bullock, D., Grossberg, S., and Guenther, F.H. 1993. A selforganizing neural model of motor equivalent reaching and tool use by a multijoint arm. Journal of Cognitive Neuroscience, 5(4): 408–435.
Google Scholar
Cruse, H. and Brüwer, M. 1987. The human arm as a redundant manipulator: The control of path and joint angles. Biological Cybernetics, 57:137–144.
Google Scholar
Frank, I.E. and Friedman, J.H. 1993. A statistical view of some chemometric regression tools. Technometrics, 35:109–135.
Google Scholar
Jordan, M.I. and Rumelhart, D.E. 1992. Supervised learning with a distal teacher. Cognitive Science, 16(3):307–354.
Google Scholar
Kawato, M. 1990. Feedback-error-learning neural network for supervised motor learning. In Advanced Neural Computers, R. Eckmiller (Ed.), North-Holland/Elsevier: Amsterdam, pp. 365–372.
Google Scholar
Liegeois, A. 1977. Automatic supervisory control of the configuration and behavior of multibody mechnisms. IEEE Transactions on Systems, Man, and Cybernetics, 7(12):868–871.
Google Scholar
Ljung, L. and Soderstrom, T. 1986. Theory and Practice of Recursive Identification, MIT Press: Cambridge, MA.
Google Scholar
Sanger, T.D. 1989. Optimal unsupervised learning in a single layer liner feedforward neural network. Neural Networks, 2:459–473.
Google Scholar
Saunders, C., Stitson, M.O., Weston, J., Bottou, L., Schoelkopf, B., and Smola, A. 1998. Support vector machine—Reference manual. TR CSD-TR–98–03. Department of Computer Science, Royal Holloway, University of London.
Schaal, S. 1999. Is imitation learning the route to humanoid robots? Trends in Cognitive Sciences, 3:233–242.
Google Scholar
Schaal, S. and Atkeson, C.G. 1998. Constructive incremental learning from only local information. Neural Comp. 10:2047–2084.
Google Scholar
Schaal, S., Atkeson, C.G., and Vijayakumar, S. 2000. Real-time robot learning with locally weighted statistical learning. In Proc. International Conference on Robotics and Automation ICRA2000, pp. 288–293.
Schaal, S., Vijayakumar, S., and Atkeson, C.G. 1998. Local dimensionality reduction. Proc. Neural Information Processing Systems, 10:633–639.
Google Scholar
Shibata, T. and Schaal, S. 2001. Biomimetic gaze stabilization based on feedback-error-learning with nonparametric regression networks. Neural Networks, 14(2):201–216.
Google Scholar
Slotine, J.E. and Li, W. 1991. Applied Nonlinear Control, Prentice Hall: Englewood cliffs, NJ.
Google Scholar
Tevatia, G. and Schaal, S. 2000. Inverse kinematics for humanoid robots. In Proceedings of the International Conference on Robotics and Automation (ICRA2000), San Francisco, CA.
Vapnik, V. 1995. The Nature of Statistical Learning Theory, Springer: New York.
Google Scholar
Vijayakumar, S. and Schaal, S. 2000. Locally weighted projection regression: An O(n) algorithm for incremental real time learning in high dimensional space. In Proc. International Conference on Machine Learning ICML2000, pp. 1079–1086.
Wold, H. 1975. Soft modeling with latent variables: The nonlinear iterative partial least squares approach. Perspectives in Probability and Statistics: Papers in Honor of M.S. Bartlett, pp. 114–142. Academic Press: London.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science & Neuroscience and Kawato Dynamic Brain Project, University of Southern California, Los Angeles, CA, 90089-2520, USA
Sethu Vijayakumar, Aaron D'souza & Stefan Schaal
Kawato Dynamic Brain Project, ERATO, Japan Science & Technology Corp., Kyoto, 619-0288, Japan
Tomohiro Shibata
University/ETH Zurich, Winterthurerstr. 190, CH-8057, Zurich, Switzerland
Jörg Conradt

Authors

Sethu Vijayakumar
View author publications
You can also search for this author in PubMed Google Scholar
Aaron D'souza
View author publications
You can also search for this author in PubMed Google Scholar
Tomohiro Shibata
View author publications
You can also search for this author in PubMed Google Scholar
Jörg Conradt
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Schaal
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vijayakumar, S., D'souza, A., Shibata, T. et al. Statistical Learning for Humanoid Robots. Autonomous Robots 12, 55–69 (2002). https://doi.org/10.1023/A:1013258808932

Download citation

Issue Date: January 2002
DOI: https://doi.org/10.1023/A:1013258808932

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical Learning for Humanoid Robots

Abstract

Access this article

Similar content being viewed by others

Hybrid Control for Learning Motor Skills

Probabilistic Learning

Incremental statistical learning for integrating motion primitives and language in humanoid robots

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Statistical Learning for Humanoid Robots

Abstract

Access this article

Similar content being viewed by others

Hybrid Control for Learning Motor Skills

Probabilistic Learning

Incremental statistical learning for integrating motion primitives and language in humanoid robots

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation