Simultaneous learning of hierarchy and primitives for complex robot tasks

Mohseni-Kabir, Anahita; Li, Changshuo; Wu, Victoria; Miller, Daniel; Hylak, Benjamin; Chernova, Sonia; Berenson, Dmitry; Sidner, Candace; Rich, Charles

doi:10.1007/s10514-018-9749-y

Simultaneous learning of hierarchy and primitives for complex robot tasks

Published: 30 April 2018

Volume 43, pages 859–874, (2019)
Cite this article

Autonomous Robots Aims and scope Submit manuscript

Anahita Mohseni-Kabir¹,
Changshuo Li²,
Victoria Wu²,
Daniel Miller²,
Benjamin Hylak²,
Sonia Chernova³,
Dmitry Berenson⁴,
Candace Sidner² &
…
Charles Rich ORCID: orcid.org/0000-0001-8892-3333²

1547 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

We present a new interaction paradigm for robot learning from demonstration, called simultaneous learning of hierarchy and primitives (SLHAP), in which information about hierarchy and primitives is naturally interleaved in a single, coherent demonstration session. A key innovation in the new paradigm is the human demonstrator’s narration of primitives as he executes them, which allows the system to identify the boundaries between primitives. Hierarchy is represented using hierarchical task networks; motion planning constraints on the primitives are represented using task space regions. We implemented SLHAP on an autonomous robot and produced an interaction video illustrating its effectiveness learning a complex task with five levels of hierarchy and eight types of primitives. The underlying algorithms which make SLHAP possible are described and evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

Niekum et al. (2015) learn both low-level motion trajectories and high-level tasks (as state machines) from demonstration, but the high-level tasks are not hierarchical.
The human’s head pose is only used to control where the robot “looks”; it is not part of the task learning process.
Our speech recognition and understanding is not general-purpose; we use a push-to-talk button operated offscreen and a predefined grammar for the human utterances. Solutions to these limitations are beyond the scope of this work.
The inputs of a task are the target and reference objects. The output of a task is any object whose properties, such as location, are changed by the task.
Reusable by the human; retargeting the primitive for the robot is addressed by the TSR constraint learning subcomponent.
It is clear that this solution will not work for all possible manipulation primitives, and therefore needs further investigation. In learning theory, this relates to the issue of automatic feature selection. Our algorithm for identifying the primitives only targets tasks with one target and one reference object. In future work, we plan to extend our work to learn tasks with multiple target and reference objects.
We also recorded motion data for a cup retrieval task to specifically evaluate the pose constraint learning—see Li and Berenson (2016).

References

Akgun, B., Cakmak, M., Jiang, K., & Thomaz, A. L. (2012). Keyframe-based learning from demonstration. International Journal of Social Robotics, 4(4), 343–355.
Article Google Scholar
Argall, B. D., Chernova, S., Veloso, M., & Browning, B. (2009). A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5), 469–483.
Article Google Scholar
Baisero, A., Mollard, Y., Lopes, M., Toussaint, M., & Lutkebohle, I. (2015). Temporal segmentation of pair-wise interaction phases in sequential manipulation demonstrations. In IROS.
Berenson, D., Srinivasa, S. S., & Kuffner, J. (2011). Task space regions: A framework for pose-constrained manipulation planning. The International Journal of Robotics Research, 30, 1435–1460.
Article Google Scholar
Cakmak, M., Chao, C., & Thomaz, A. L. (2010). Designing interactions for robot active learners. IEEE Transactions on Autonomous Mental Development, 2(2), 108–118.
Article Google Scholar
Cakmak, M., & Thomaz, A. L. (2012). Designing robot learners that ask good questions. In ACM/IEEE international conference on human–robot interaction (pp. 17–24). ACM.
Calinon, S., Guenter, F., & Billard, A. (2007). On learning, representing, and generalizing a task in a humanoid robot. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 37(2), 286–298.
Article Google Scholar
Chernova, S., & Thomaz, A. L. (2014). Robot learning from human teachers. Synthesis Lectures on Artificial Intelligence and Machine Learning, 8(3), 1–121.
Article Google Scholar
Chiappa, S., & Peters, J. R. (2010). Movement extraction by detecting dynamics switches and repetitions. In Advances in neural information processing systems (pp. 388–396).
Erol, K., Hendler, J., & Nau, D. S. (1994). HTN planning: Complexity and expressivity. AAAI, 94, 1123–1128.
Google Scholar
Garland, A., Ryall, K., & Rich, C. (2001). Learning hierarchical task models by defining and refining examples. In International conference on knowledge capture (pp. 44–51).
Hayes, B., & Scassellati, B. (2014). Discovering task constraints through observation and active learning. In IEEE/RSJ international conference on intelligent robots and systems.
Hsu, D., Jiang, T., Reif, J., & Sun, Z. (2003). The bridge test for sampling narrow passages with probabilistic roadmap planners. In ICRA.
Huffman, S. B., & Laird, J. E. (1995). Flexibly instructable agents. Journal of Artificial Intelligence Research, 3, 271–324.
Article MATH Google Scholar
Konidaris, G. (2016). Constructing abstraction hierarchies using a skill-symbol loop. In IJCAI: Proceedings of the conference (p. 1648), NIH Public Access.
Kulic, D., Lee, D., Ott, C., & Nakamura, Y. (2008). Incremental learning of full body motion primitives for humanoid robots. In 8th IEEE-RAS international conference on humanoid robots, 2008. Humanoids 2008 (pp. 326–332). IEEE.
Levy-leduc, C., & Harchaoui, Z. (2008). Catching change-points with lasso. In Advances in neural information processing systems (pp. 617–624).
Li, C., & Berenson, D. (2016). Learning object orientation constraints and guiding constraints for narrow passages from one demonstration. In International symposium on experimental robotics.
Minnen, D., Starner, T., Essa, I. A., & Isbell, C. L, Jr. (2007). Improving activity discovery with automatic neighborhood estimation. IJCAI, 7, 2814–2819.
Google Scholar
Mohammad, Y., & Nishida, T. (2015). Exact multi-length scale and mean invariant motif discovery. Applied Intelligence, 44, 322–339.
Article Google Scholar
Mohan, S., & Laird, J. E. (2011). Towards situated, interactive, instructable agents in a cognitive architecture. In AAAI Fall symposium series.
Mohseni-Kabir, A., Chernova, S., & Rich, C. (2014). Collaborative learning of hierarchical task networks from demonstration and instruction. In Workshop on human–robot collaboration for industrial manufacturing, robotics science and systems, Berkeley, CA.
Mohseni-Kabir, A., Rich, C., Chernova, S., Sidner, C. L., & Miller, D. (2015). Interactive hierarchical task learning from a single demonstration. In Proceedings of the tenth annual ACM/IEEE international conference on human–robot interaction (pp. 205–212). ACM.
Mohseni-Kabir, A., Wu, V., Chernova, S., & Rich, C. (2016). What’s in a primitive? identifying reusable motion trajectories in narrated demonstrations. In IEEE international symposium on robot and human interactive communication (ROMAN).
Mollard, Y., Munzer, T., Baisero, A., Toussaint, M., & Lopes, M. (2015). Robot programming from demonstration, feedback and transfer. In IROS.
Niekum, S., Osentoski, S., Konidaris, G., Chitta, S., Marthi, B., & Barto, A. G. (2015). Learning grounded finite-state representations from unstructured demonstrations. The International Journal of Robotics Research, 34(2), 131–157.
Article Google Scholar
Oates, T. (2002). Peruse: An unsupervised algorithm for finding recurring patterns in time series. In 2002 IEEE international conference on data mining, 2002. ICDM 2003. Proceedings (pp. 330–337). IEEE.
Pardowitz, M., Knoop, S., Dillmann, R., & Zollner, R. (2007). Incremental learning of tasks from user demonstrations, past experiences, and vocal comments. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 37(2), 322–332.
Article Google Scholar
Phillips, M., Hwang, V., Chitta, S., & Likhachev, M. (2016). Learning to plan for constrained manipulation from demonstrations. Autonomous Robots, 40(1), 109–124.
Article Google Scholar
Rich, C. (2009). Building task-based user interfaces with ANSI/CEA-2018. Computer, 42(8), 20–27.
Article Google Scholar
Rich, C., & Sidner, C. (2012). Using collaborative discourse theory to partially automate dialogue tree authoring. In Intelligent virtual agents (pp. 327–340). Springer.
Rudin, L. I., Osher, S., & Fatemi, E. (1992). Nonlinear total variation based noise removal algorithms. Physica D: Nonlinear Phenomena, 60(1), 259–268.
Article MathSciNet MATH Google Scholar
Rybski, P. E., Yoon, K., Stolarz, J., & Veloso, M. M. (2007). Interactive robot task training through dialog and demonstration. In ACM/IEEE international conference on human–robot interaction (pp. 49–56).
Senin, P., Lin, J., Wang, X., Oates, T., Gandhi, S., Boedihardjo, A. P., et al. (2014). Grammarviz 2.0: A tool for grammar-based pattern discovery in time series. In Machine learning and knowledge discovery in databases (pp. 468–472). Springer.
Ye, G., & Alterovitz, R. (2011). Demonstration-guided motion planning. In ISRR.

Download references

Author information

Authors and Affiliations

Carnegie Mellon University, Pittsburgh, PA, USA
Anahita Mohseni-Kabir
Worcester Polytechnic Institute, 100 Institute Rd, Worcester, MA, 01609, USA
Changshuo Li, Victoria Wu, Daniel Miller, Benjamin Hylak, Candace Sidner & Charles Rich
Georgia Institute of Technology, Atlanta, GA, USA
Sonia Chernova
University of Michigan, Ann Arbor, MI, USA
Dmitry Berenson

Authors

Anahita Mohseni-Kabir
View author publications
You can also search for this author in PubMed Google Scholar
Changshuo Li
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Wu
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Miller
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Hylak
View author publications
You can also search for this author in PubMed Google Scholar
Sonia Chernova
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry Berenson
View author publications
You can also search for this author in PubMed Google Scholar
Candace Sidner
View author publications
You can also search for this author in PubMed Google Scholar
Charles Rich
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sonia Chernova.

Additional information

This work is supported in part by the Office of Naval Research under Grant N00014-13-1-0735.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (mp4 5217 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mohseni-Kabir, A., Li, C., Wu, V. et al. Simultaneous learning of hierarchy and primitives for complex robot tasks. Auton Robot 43, 859–874 (2019). https://doi.org/10.1007/s10514-018-9749-y

Download citation

Received: 13 February 2017
Accepted: 03 April 2018
Published: 30 April 2018
Issue Date: 01 April 2019
DOI: https://doi.org/10.1007/s10514-018-9749-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Simultaneous learning of hierarchy and primitives for complex robot tasks

Abstract

Access this article

Similar content being viewed by others

A review of motion planning algorithms for intelligent robots

Recent advances in human–robot interaction: robophobia or synergy

A Survey on Learning-Based Robotic Grasping

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material 1 (mp4 5217 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Simultaneous learning of hierarchy and primitives for complex robot tasks

Abstract

Access this article

Similar content being viewed by others

A review of motion planning algorithms for intelligent robots

Recent advances in human–robot interaction: robophobia or synergy

A Survey on Learning-Based Robotic Grasping

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material 1 (mp4 5217 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation