Abstract
In this paper we present the integration of graph-based visual perception to spoken conversation in human-robot interaction. The proposed architecture has a dialogue manager as the central component for the multimodal interaction, which directs the robot’s behavior in terms of the intentions and actions associated to the conversational situations. We tested this ideas on a mobile robot programmed to act as a visitor’s guide to our department of computer science.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allen, J., Byron, D., Dzikovska, M., Ferguson, G., Galescu, L., Stent, A.: An architecture for a generic dialogue shell. Natural Language Engineering 6(34), 213–228 (2000)
Pineda, L.A.: Specification and interpretation of multimodal dialogue models. In: Sidorov, G. (ed.) Memorias del Workshop de robots de servicio, MICAI (2008)
Wachsmuth, S., Fink, G.A., Kummert, F., Sagerer, G.: Using speech in visual object recognition. In: Mustererkennung 2000, 22. DAGM-Symposium Kiel, Informatik Aktuell, pp. 428–435. Springer, Heidelberg (2000)
Saenko, K., Darrell, T.: Towards adaptive object recognition for situated human-computer interaction. In: Proceedings of the 2007 Workshop on Multimodal Interfaces in Semantic Interaction, pp. 43–46 (2007)
Rahmadi, K., Altab, H.M., Akio, N., Yoshinori, K.: Object recognition through human-robot interaction by speech. In: 13th IEEE International Workshop on Robot and Human Interactive Communication, RO-MAN, pp. 619–624 (2004)
Pineda, L.A., Villasenor, L., Cuétara, J., Castellanos, H., López, I.: Dimex100: A new phonetic and speech corpus for mexican spanish. In: Lemaître, C., Reyes, C.A., González, J.A. (eds.) IBERAMIA 2004. LNCS, vol. 3315, pp. 974–983. Springer, Heidelberg (2004)
Obdrzálek, J.M.: Object recognition methods based on transformation covariant features. In: XII European Signal Processing Conference EUSIPCO 2004, pp. 1333–1336 (2004)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(10), 1615–1630 (2005)
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Beis, J., Lowe, D.: Shape indexing using approximate nearest-neighbour search in highdimensional spaces. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, Puerto Rico, pp. 1000–1006 (1997)
Aguilar, W., Frauel, Y., Escolano, F., Pérez, M.M., Espinosa-Romero, A., Lozano, M.: A robust graph transformation matching for non-rigid registration. Image and Vision Computing (2008), doi:10.1016/j.imavis.2008.05.004
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aguilar, W., Pineda, L.A. (2009). Integrating Graph-Based Vision Perception to Spoken Conversation in Human-Robot Interaction. In: Cabestany, J., Sandoval, F., Prieto, A., Corchado, J.M. (eds) Bio-Inspired Systems: Computational and Ambient Intelligence. IWANN 2009. Lecture Notes in Computer Science, vol 5517. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02478-8_99
Download citation
DOI: https://doi.org/10.1007/978-3-642-02478-8_99
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02477-1
Online ISBN: 978-3-642-02478-8
eBook Packages: Computer ScienceComputer Science (R0)