Abstract
A methodology and programming environment for the specification and interpretation of dialogue models for grounded multimodal interaction is presented. This conceptual framework permits the declarative specification of complex interactive systems with multimodal input and output, including speech, computer vision and motor behavior. We first introduce the present notion of dialogue model with its motivation on the structure of conversation. Then, the specification and interpretation of dialogue models is presented and discussed. We also present a cognitive architecture for the construction of intelligent Human-Computer Interaction (HCI) applications within this conceptual framework. The paper concludes with references to working systems, demos and work in progress built within the present framework.
We acknowledge the support of the members of the DIME and Golem group at IIMAS, UNAM. We also gratefully thank the support of grants CONACyT 81965 and PAPPIT-UNAM IN-121206, IN-104408 and IN-115710.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aguilar, W., Pineda, L.A.: Integrating Graph-Based Vision Perception to Spoken Conversation in Human-Robot Interaction. In: Cabestany, J., et al. (eds.) IWANN 2009. LNCS, vol. 5517, pp. 789–796. Springer, Heidelberg (2009)
Allen, J., Core, M.: Draft of DAMSL: Dialog Act Markup in Several Layers Annotation Scheme. Department of Computer Science, Rochester University (1997)
Avilés, H., Meza, I., Aguilar, W., Pineda, L.A.: Integrating Pointing Gestures into Spanish-Spoken Dialog System for Conversational Service Robots. In: Proceedings of ICAART 2010, Valencia, España, pp. 585–588 (DVD) (2010)
Avilés, H., Alvarado-González, M., Venegas, E., Rascón, C., Meza, I., Pineda, L.A.: Development of a Tour-Guide Robot Using Dialogue Models and a Cognitive Architecture. In: Kuri-Morales, A., Simari, G. (eds.) IBERAMIA 2010. LNCS (LNAI), vol. 6433, pp. 512–521. Springer, Heidelberg (2010)
Clark, H., Schaefer, E.: Contributing to Discourse. Cognitive Science 13, 259–294
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004) (1989)
Mann, W.C., Thompson, S.: Rhetorical Structure Theory: Towards a functional theory of text organization. Text 8(3), 243–281 (1988)
Meza, I., Pérez-Pavón, P., Salinas, L., Avilés, H., Pineda, L.A.: A Multimodal Dialogue System for Playing the Game “Guess the Card”. Procesamiento del Lenguaje Natural (44), 131–138 (2010)
Meza, I., Salinas, L., Venegas, E., Castellanos, H., Chavarria, A., Pineda, L.A.: Specification and Evaluation of a Spanish Conversational System Using Dialogue Models. In: Kuri-Morales, A., Simari, G. (eds.) IBERAMIA 2010. LNCS (LNAI), vol. 6433, pp. 346–355. Springer, Heidelberg (2010)
Nakauchi, Y., Naphattalung, P., Takahashi, T., Matsubara, T., Kashiwagi, E.: Proposal and Evaluation of Nat. Lang. Human-Robot Interface System based on Conversation Theory. In: Proceedings of the 2003 IEEE Int. Conf. on Robotics and Automation, Taipei, Taiwan, September 14-19, pp. 380–385 (2003)
Pineda, L.A.: Conservation principles and action schemes in the synthesis of geometric concepts. Artificial Intelligence 171(4), 197–238 (2007)
Pineda, L.A., Estrada, V., Coria, S., Allen, J.: The Obligations and Common Ground Structure of Practical Dialogues, Inteligencia Artificial. Revista Iberoamericana de Inteligencia Artificial 11(36), 9–17 (2007)
Pineda, L.A.: Specification and Interpretation of Multimodal Dialogue Models for Human-Robot Interaction. In: Sidorov, G. (ed.) Artificial Intelligence for Humans: Service Robots and Social Modeling, SMIA, México, pp. 33–50 (2008)
Rascón, C., Avilés, H., Pineda, L.A.: Robotic Orientation towards Speaker in Human-Robot Interaction. In: Kuri-Morales, A., Simari, G. (eds.) IBERAMIA 2010. LNCS (LNAI), vol. 6433, pp. 10–19. Springer, Heidelberg (2010)
Tulving, E.: Memory systems: episodic and semantic memory. In: Tulving, E., Donaldson, W. (eds.) Organization of Memory, pp. 381–403. Academic Press, New York (1972)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pineda, L.A., Meza, I.V., Salinas, L. (2010). Dialogue Model Specification and Interpretation for Intelligent Multimodal HCI. In: Kuri-Morales, A., Simari, G.R. (eds) Advances in Artificial Intelligence – IBERAMIA 2010. IBERAMIA 2010. Lecture Notes in Computer Science(), vol 6433. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16952-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-16952-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16951-9
Online ISBN: 978-3-642-16952-6
eBook Packages: Computer ScienceComputer Science (R0)