Grounding Verbs of Motion in Natural Language Commands to Robots

Kollar, Thomas; Tellex, Stefanie; Roy, Deb; Roy, Nicholas

doi:10.1007/978-3-642-28572-1_3

Grounding Verbs of Motion in Natural Language Commands to Robots

Thomas Kollar⁴,
Stefanie Tellex⁵,
Deb Roy⁵ &
…
Nicholas Roy⁴

Chapter

10k Accesses
20 Citations

Part of the book series: Springer Tracts in Advanced Robotics ((STAR,volume 79))

Abstract

To be useful teammates to human partners, robots must be able to follow spoken instructions given in natural language. An important class of instructions involve interacting with people, such as “Follow the person to the kitchen” or “Meet the person at the elevators.” These instructions require that the robot fluidly react to changes in the environment, not simply follow a pre-computed plan. We present an algorithm for understanding natural language commands with three components. First, we create a cost function that scores the language according to how well it matches a candidate plan in the environment, defined as the log-likelihood of the plan given the command. Components of the cost function include novel models for the meanings of motion verbs such as “follow,” “meet,” and “avoid,” as well as spatial relations such as “to” and landmark phrases such as “the kitchen.” Second, an inference method uses this cost function to perform forward search, finding a plan that matches the natural language command. Third, a high-level controller repeatedly calls the inference method at each timestep to compute a new plan in response to changes in the environment such as the movement of the human partner or other people in the scene. When a command consists of more than a single task, the controller switches to the next task when an earlier one is satisfied. We evaluate our approach on a set of example tasks that require the ability to follow both simple and complex natural language commands.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wei, Y., Brunskill, E., Kollar, T., Roy, N.: Where to go: Interpreting natural directions using global inference. In: ICRA (2009)
Google Scholar
Kollar, T., Tellex, S., Roy, D., Roy, N.: Toward understanding natural language directions. In: Proceedings of HRI (2010)
Google Scholar
Matuszek, C., Fox, D., Koscher, K.: Following directions using statistical machine translation. In: Proceedings of HRI (2010)
Google Scholar
Shimizu, N., Haas, A.: Learning to follow navigational route instructions. In: IJCAI 2009: Proceedings of the 21st International Jont Conference on Artifical Intelligence, pp. 1488–1493. Morgan Kaufmann Publishers Inc., San Francisco (2009)
Google Scholar
MacMahon, M., Stankiewicz, B., Kuipers, B.: Walk the talk: Connecting language, knowledge, and action in route instructions. In: Proceedings of the National Conference on Artificial Intelligence, pp. 1475–1482 (2006)
Google Scholar
Vogel, A., Jurafsky, D.: Learning to follow navigational directions. In: ACL 2010: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Morristown, NJ, USA, pp. 806–814. Association for Computational Linguistics (2010)
Google Scholar
Hsiao, K.-Y., Tellex, S., Vosoughi, S., Kubat, R., Roy, D.: Object schemas for grounding language in a responsive robot. Connect. Sci. 20(4), 253–276 (2008)
Article Google Scholar
Skubic, M., Perzanowski, D., Blisard, S., Schultz, A., Adams, W., Bugajska, M., Brock, D.: Spatial language for human-robot dialogs. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 34(2), 154–167 (2004) ISSN 1094-6977, doi:10.1109/TSMCC.2004.826273
Article Google Scholar
Kruger, V., Kragic, D., Ude, A., Geib, C.: The meaning of action: A review on action recognition and mapping. Advanced Robotics 21(13) (2007)
Google Scholar
Chernova, S., Veloso, M.: Interactive policy learning through confidence-based autonomy. JAIR 34(1), 1–25 (2009)
MathSciNet MATH Google Scholar
Schaal, S., Ijspeert, A., Billard, A.: Computational approaches to motor learning by imitation. Philosophical Transactions of the Royal Society B: Biological Sciences 358(1431), 537–547 (2003) ISSN 0962-8436, PMID: 12689379 PMCID: 1693137
Article Google Scholar
Ekvall, S., Kragic, D.: Robot learning from demonstration: a task-level planning approach. International Journal of Advanced Robotic Systems 5(3) (2008)
Google Scholar
Nicolescu, M., Mataric, M.: Natural methods for robot task learning: instructive demonstrations, generalization and practice. In: Proc. AAMAS (2003)
Google Scholar
Rybski, P.E., Yoon, K., Stolarz, J., Veloso, M.M.: Interactive robot task training through dialog and demonstration. In: Proceedings of HRI, p. 56. ACM (2007)
Google Scholar
Abbeel, P., Ng, A.: Apprenticeship learning via inverse reinforcement learning. In: Proc. ICML (2004)
Google Scholar
Peters, J., Kober, J.: Using reward-weighted imitation for robot reinforcement learning. In: Proc. Inter. Symp. on Approximate Dynamic Programming and Reinforcement Learning (2009)
Google Scholar
Silver, D., Andrew Bagnell, J., Stentz, A.: Perceptual interpretation for autonomous navigation through dynamic imitation learning. In: Proc. ISRR (2009)
Google Scholar
Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Artificial Intelligence 101(1-2) (1998)
Google Scholar
Toussaint, M., Storkey, A.: Probabilistic inference for solving discrete and continuous state Markov Decision Processes. In: Proceedings of the 23rd International Conference on Machine Learning, p. 952. ACM (2006)
Google Scholar
Attias, H.: Planning by probabilistic inference. In: Proceedings of the 9th International Workshop on Artificial Intelligence and Statistics (2003)
Google Scholar
Kollar, T., Roy, N.: Utilizing object-object and object-scene context when planning to find things. In: IEEE International Conference on Robotics and Automation (2009)
Google Scholar
Grisetti, G., Stachniss, C., Burgard, W.: Improved techniques for grid mapping with Rao-Blackwellized particle filters. IEEE Transactions on Robotics 23(1), 34–46 (2007)
Article Google Scholar
Felzenszwalb, P., Mcallester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR 2008 (June 2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science and Artificial Intelligence Lab., Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA, 02139, USA
Thomas Kollar & Nicholas Roy
MIT Media Lab., 75 Amherst St., Cambridge, MA, 02139, USA
Stefanie Tellex & Deb Roy

Authors

Thomas Kollar
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Tellex
View author publications
You can also search for this author in PubMed Google Scholar
Deb Roy
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Roy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Kollar .

Editor information

Editors and Affiliations

Artificial Intelligence Laboratory, Stanford University Dept. Computer Science, Stanford, California, USA
Oussama Khatib
Department of Mechanical Engineering, University of Pennsylvania GRASP Laboratory, Philadelphia, Pennsylvania, USA
Vijay Kumar
Department of Computer Science, University of Southern California MC 290, California, USA
Gaurav Sukhatme

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kollar, T., Tellex, S., Roy, D., Roy, N. (2014). Grounding Verbs of Motion in Natural Language Commands to Robots. In: Khatib, O., Kumar, V., Sukhatme, G. (eds) Experimental Robotics. Springer Tracts in Advanced Robotics, vol 79. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28572-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-28572-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28571-4
Online ISBN: 978-3-642-28572-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics