Identifying Utterances Addressed to an Agent in Multiparty Human–Agent Conversations

Baba, Naoya; Huang, Hung-Hsuan; Nakano, Yukiko I.

doi:10.1007/978-3-642-23974-8_28

Naoya Baba²³,
Hung-Hsuan Huang²⁴ &
Yukiko I. Nakano²⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6895))

Included in the following conference series:

International Workshop on Intelligent Virtual Agents

2845 Accesses
3 Citations

Abstract

In multiparty human–agent interaction, the agent should be able to properly respond to a user by determining whether the utterance is addressed to the agent or to another person. This study proposes a model for predicting the addressee by using the acoustic information in speech and head orientation as nonverbal information. First, we conducted a Wizard-of-Oz (WOZ) experiment to collect human–agent triadic conversations. Then, we analyzed whether the acoustic features and head orientations were correlated with addressee-hood. Based on the analysis, we propose an addressee prediction model that integrates acoustic and bodily nonverbal information using SVM.

Download to read the full chapter text

Chapter PDF

Gaze, Prosody and Semantics: Relevance of Various Multimodal Signals to Addressee Detection in Human-Human-Computer Conversations

Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations

Detecting Address Estimation Errors from Users’ Reactions in Multi-user Agent Conversation

Keywords

References

Kendon, A.: Some Functions of Gaze Direction in Social Interaction. Acta Psychologica 26, 22–63 (1967)
Article Google Scholar
Duncan, S.: Some signals and rules for taking speaking turns in conversations. Journal of Personality and Social Psychology 23(2), 283–292 (1972)
Article Google Scholar
Vertegaal, R., et al.: Eye gaze patterns in conversations: there is more the conversational agents than meets the eyes. In: CHI 2001 (2001)
Google Scholar
Takemae, Y., Otsuka, K., Mukawa, N.: Video cut editing rule based on participants’ gaze in multiparty conversation. In: The 11th ACM International Conference on Multimedia (2003)
Google Scholar
Akker, R.o.d., Traum, D.: A comparison of addressee detection methods for multiparty conversations. In: 13th Workshop on the Semantics and Pragmatics of Dialogue (2009)
Google Scholar
Frampton, M., et al.: Who is “You”? Combining Linguistic and Gaze Features to Resolve Second-Person References in Dialogue. In: the 12th Conference of the European Chapter of the ACL (2009)
Google Scholar
Lunsford, R., Oviatt, S.: Human perception of intended addressee during computer-assisted meetings. In: The 8th international Conference on Multimodal interfaces, ICMI 2006 (2006)
Google Scholar
Bohus, D., Horvitz, E.: Facilitating Multiparty Dialog with Gaze, Gesture, and Speech. In: ICMI-MLMI 2010 (2010)
Google Scholar
Terken, J., Joris, I., Valk, L.d.: Multimodal Cues for Addressee-hood in Triadic Communication with a Human Information Retrieval Agent. In: International Conference on Multimodal interfaces, ICMI 2007 (2007)
Google Scholar
Katzenmaier, M., Stiefelhagen, R., Schultz, T.: Identifying the Addressee in HumanHumanRobot Interactions based on Head Pose and Speech. In: international Conference on Multimodal interfaces, ICMI 2004 (2004)
Google Scholar
Rodriguez, H., Beck, D., Lind, D., Lok, B.: Audio Analysis of Human/Virtual-Human Interaction. In: Prendinger, H., Lester, J.C., Ishizuka, M. (eds.) IVA 2008. LNCS (LNAI), vol. 5208, pp. 154–161. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Science and Technology, Seikei University, Musashino-shi, Tokyo, 180-8633, Japan
Naoya Baba
Department of Information & Communication Science, Ritsumeikan University, Japan
Hung-Hsuan Huang
Dept. of Computer and Information Science, Seikei University, Japan
Yukiko I. Nakano

Authors

Naoya Baba
View author publications
You can also search for this author in PubMed Google Scholar
Hung-Hsuan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yukiko I. Nakano
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Reykjavik University, Menntavegur 1, 101, Reykjavík, Iceland
Hannes Högni Vilhjálmsson
Bielefeld University, CITEC, P.O. Box 100131, 33501, Bielefeld, Germany
Stefan Kopp
Institute for Creative Technologies, University of Southern California, 12015 Waterfront Drive, 90094-2536, Playa Vista, CA, USA
Stacy Marsella
Reykjavik University, CADIA, Menntavegur 1, 101, Reykjavik, Iceland
Kristinn R. Thórisson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baba, N., Huang, HH., Nakano, Y.I. (2011). Identifying Utterances Addressed to an Agent in Multiparty Human–Agent Conversations. In: Vilhjálmsson, H.H., Kopp, S., Marsella, S., Thórisson, K.R. (eds) Intelligent Virtual Agents. IVA 2011. Lecture Notes in Computer Science(), vol 6895. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23974-8_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-23974-8_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23973-1
Online ISBN: 978-3-642-23974-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Identifying Utterances Addressed to an Agent in Multiparty Human–Agent Conversations

Abstract

Chapter PDF

Similar content being viewed by others

Gaze, Prosody and Semantics: Relevance of Various Multimodal Signals to Addressee Detection in Human-Human-Computer Conversations

Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations

Detecting Address Estimation Errors from Users’ Reactions in Multi-user Agent Conversation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Identifying Utterances Addressed to an Agent in Multiparty Human–Agent Conversations

Abstract

Chapter PDF

Similar content being viewed by others

Gaze, Prosody and Semantics: Relevance of Various Multimodal Signals to Addressee Detection in Human-Human-Computer Conversations

Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations

Detecting Address Estimation Errors from Users’ Reactions in Multi-user Agent Conversation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation