Abstract
Research in learning from demonstrations has focused primarily on the question of how to utilize demonstrations to learn new behavior assuming that the demonstrator (teacher) explicitly teaches the learner. In this paper, we focus our attention on learning from unplanned demonstrations. In such cases, the learner has to take the initiative and decide for itself what actions of the—assumed unaware—demonstrator are to be learned. Moreover, the actions of the demonstrator are not pre-segmented and the learner needs to discover the boundaries of interesting behaviors without any help from the demonstrator. We propose a fluid imitation engine that augments a traditional LfD system. The proposed engine casts the problem as a well-defined constrained motif discovery problem subject to constraints that are driven from object and behavior saliency, as well as behavior relevance to the learner’s goals and abilities. Relation between perceived behaviors of the demonstrator and changes in objects in the environment is quantified using a change-causality test that is shown to provide better results compared to traditional g-causality tests. The main advantage of the proposed system is that it can naturally combine information from all available sources including low-level saliency measures and high-level goal-driven relevance constraints. The paper also reports a series of experiments to evaluate the utility of the proposed engine in learning navigation tasks with increasing complexity using both simulated and real world robots.
References
Abbeel P, Coates A, Ng aY (2010) Autonomous helicopter aerobatics through apprenticeship learning. Int J Robot Res 29(13):1608–1639
Aleotti J, Caselli S (2008) Grasp programming by demonstration: a task-based quality measure. In: The 17th IEEE international symposium on robot and human interactive communication, 2008. RO-MAN 2008. IEEE Press, New York, pp 383–388
Alippi C, Roveri M (2006) An adaptive CUSUM-based test for signal change detection. In: 2006 IEEE international symposium on circuits and systems, p 4
Antonelo E, Schrauwen B, Stroobandt D (2008) Imitation learning of an intelligent navigation system for mobile robots using reservoir computing. In: 10th Brazilian symposium on neural networks, 2008, SBRN’08, pp 93–98
Argall BD, Chernova S, Veloso M, Browning B (2009) A survey of robot learning from demonstration. Robot Auton Syst 57(5):1–15
Barakova EI, Vanderelst D (2011) From spreading of behavior to dyadic interaction—a robot learns what to imitate. Int J Intell Syst 26(3):228–245
Breazeal C, Buchsbaum D, Gray J, Gatenby D, Blumberg B (2005) Learning from and about others: towards using imitation to bootstrap the social understanding of others by robots. Artif Life 11(1–2):31–62
Catalano J, Armstrong T, Oates T (2006) Discovering patterns in real-valued time series. In: Knowledge discovery in databases: PKDD 2006, pp 462–469
Dautenhahn K, Ai-lab VUB (1995) Getting to know each other artificial social intelligence for autonomous robots. Robot Auton Syst 50(95):333–356
EPFL: http://www.e-puck.org
Minnen D, Starner T, Essa I, Isbell C (2007) Improving activity discovery with automatic neighborhood estimation. In: Int. joint conf. on artificial intelligence
Mohammad Y, Nishida T (2009) Constrained motif discovery in time series. New Gener Comput 27(4):319–346
Mohammad Y, Nishida T (2009) Robust singular spectrum transform. In: The twenty second international conference on industrial, engineering and other applications of applied intelligent systems IEA-AIE 2009, pp 123–132
Mohammad Y, Nishida T (2010) Incremental gesture discovery for interactive robots. In: 2010 IEEE international conference on robotics and biomimetics (ROBIO), pp 185–189
Mohammad Y, Nishida T (2010) Mining causal relationships in multidimensional time series. In: Szczerbicki E, Nguyen NT (eds) Smart information and knowledge management: advances, challenges, and critical issues, studies in computational intelligence. Springer, Berlin, pp 309–338
Mohammad Y, Nishida T (2011) Discovering causal change relationships between processes in complex systems. In: 2011 IEEE/SICE international symposium on system integration, pp 12–17
Mohammad Y, Nishida T (2011) On comparing SSA-based change point discovery algorithms. In: 2011 IEEE/SICE international symposium on system integration, pp 938–945
Mohammad Y, Nishida T, Okada S (2009) Unsupervised simultaneous learning of gestures, actions and their associations for human-robot interaction. In: Proceedings of the 2009 IEEE/RSJ international conference on intelligent robots and systems, IROS’09. IEEE Press, Piscataway, pp 2537–2544
Moskvina V, Zhigljavsky A (2003) An algorithm based on singular spectrum analysis for change-point detection. Commun Stat, Simul Comput 32(4):319–352
Nagai Y, Rohlfing KJ (2007) Can motionese tell infants and robots. What to imitate? In: 4th international symposium on imitation in animals and artifacts, pp 299–306
Page ES (1954) Continuous inspection schemes. Biometrika 44:100–115
Scassellati BB (1999) Knowing what to imitate and knowing when you succeed. In: AISB99 symposium on imitation in animals and artifacts, pp 105–113
V-REP: http://www.v-rep.eu/
Floyd MW, Esfandiari B, Lam K (2008) A case-based reasoning approach to imitating RoboCup players. In: AAAI
Acknowledgements
This work was partially supported by Kyoto University’s GCOE Project.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was partially supported by Kyoto University’s GCOE.
Rights and permissions
About this article
Cite this article
Mohammad, Y., Nishida, T. Fluid Imitation. Int J of Soc Robotics 4, 369–382 (2012). https://doi.org/10.1007/s12369-012-0153-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12369-012-0153-z