Abstract
Much talk between humans is casual and multiparty. It facilitates social bonding and mutual co-presence rather than strictly being used to exchange information in order to complete well-defined practical tasks. Artificial partners that are capable of participating as a speaker or listener in such talk would be useful for companionship, educational, and social contexts. However, such applications require dialogue structure beyond simple question/answer routines. While there is body of theory on multiparty casual talk, there is a lack of work quantifying such phenomena. This is critical if we are to manage and generate human machine multiparty casual talk. We outline the current knowledge on the structure of casual talk, describe our investigations in this domain, summarise our findings on timing, laughter, and disfluency in this domain, and discuss how they can inform the design and implementation of truly social machine dialogue partners.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Allen, J., Byron, D., Dzikovska, M., Ferguson, G., Galescu, L., Stent, A.: An architecture for a generic dialogue shell. Nat. Lang. Eng. 6(3&4), 213–228 (2000)
Allwood, J., Björnberg, M., Grönqvist, L., Ahlsén, E., Ottesjö, C.: The spoken language corpus at the department of linguistics, Göteborg University. In: FQS-Forum Qualitative Social Research, vol. 1 (2000). http://www.ling.gu.se/~jens/publications/bfiles/B45.pdf
Anderson, A., Bader, M., Bard, E., Boyle, E., Doherty, G., Garrod, S., Isard, S., Kowtko, J., McAllister, J., Miller, J., et al.: The HCRC map task corpus. Lang. Speech 34(4), 351–366 (1991)
Aubrey, A.J., Marshall, D., Rosin, P.L., Vandeventer, J., Cunningham, D.W., Wallraven, C.: Cardiff conversation database (CCDb): a database of natural dyadic conversations. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 277–282, June 2013
Baron, D., Shriberg, E., Stolcke, A.: Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues. Channels 20(61), 41 (2002)
Beattie, G.: Talk: an Analysis of Speech and Non-verbal Behaviour in Conversation. Open University Press, Milton Keynes (1983)
Bickmore, T., Cassell, J.: Social dialogue with embodied conversational agents. In: van Kuppevelt, J., Dybkjaer, L., Bernsen, N. (eds.) Advances in Natural Multimodal Dialogue Systems, pp. 23–54. Kluwer, New York (2005)
BNC-Consortium: British national corpus (2000). URL http://www.hcu.ox.ac.uk/BNC
Boersma, P., Weenink, D.: Praat: doing phonetics by computer [Computer program], Version 5.1. 44 (2010)
Bonin, F., Campbell, N., Vogel, C.: Laughter and topic changes: temporal distribution and information flow. In: 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom), pp. 53–58 (2012)
Bonin, F., Campbell, N., Vogel, C.: Temporal distribution of laughter in conversation. In: Proceedings of the Third Interdisciplinary Workshop on Laughter and other Non-Verbal Vocalization in Speech, pp. 25–26 (2012)
Campbell, N.: Multimodal processing of discourse information, the effect of synchrony. In: Second International Symposium on Universal Communication, ISUC 2008, pp. 12–15 (2008)
Campbell, N.: Approaches to conversational speech rhythm: speech activity in two-person telephone dialogues. In: Proceedings of XVIth International Congress of the Phonetic Sciences, Saarbrucken, Germany, pp. 343–348 (2007). http://www.icphs2007.de/conference/Papers/1775/1775.pdf
Cheepen, C.: The Predictability of Informal Conversation. Pinter, London (1988)
DuBois, J.W., Chafe, W.L., Meyer, C., Thompson, S.A.: Santa Barbara corpus of spoken american english. Linguistic Data Consortium, CD-ROM, Philadelphia (2000)
Edlund, J., Beskow, J., Elenius, K., Hellmer, K., Strömbergsson, S., House, D.: Spontal: a swedish spontaneous dialogue corpus of audio, video and motion capture. In: LREC (2010)
Eggins, S., Slade, D.: Analysing Casual Conversation. Equinox Publishing Ltd., Sheffield (2004)
Gilmartin, E., Bonin, F., Cerrato, L., Vogel, C., Campbell, N.: What’s the game and who’s got the ball? genre in spoken interaction. In: 2015 AAAI Spring Symposium Series (2015)
Gilmartin, E., Bonin, F., Vogel, C., Campbell, N.: Laugher and topic transition in multiparty conversation. In: Proceedings of the SIGDIAL 2013 Conference, pp. 304–308. Association for Computational Linguistics, Metz, August 2013
Gilmartin, E., Campbell, N.: Capturing chat: annotation and tools for multiparty casual conversation. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) (2016)
Gilmartin, E., Vogel, C., Campbell, N.: Disfluency in multiparty social talk. In: Proceedings of DISS 2015, Edinburgh (2015)
Godfrey, J.J., Holliman, E.C., McDaniel, J.: SWITCHBOARD: telephone speech corpus for research and development. In: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-92, vol. 1, pp. 517–520 (1992)
Greenbaum, S.: ICE: the international corpus of english. Engl. Today 28(7.4), 3–7 (1991)
Hennig, S., Chellali, R., Campbell, N.: The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech. Reykjavik, Iceland (2014)
Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., Peskin, B., Pfau, T., Shriberg, E., Stolcke, A.: The ICSI meeting corpus. In: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, Proceedings, ICASSP 2003, vol. 1, pp. I-364 (2003)
Laskowski, K.: Predicting, detecting and explaining the occurrence of vocal activity in multi-party conversation. Ph.D. thesis, Carnegie Mellon University (2011)
Laskowski, K., Burger, S.: Analysis of the occurrence of laughter in meetings. In: INTERSPEECH, pp. 1258–1261 (2007)
Laver, J.: Communicative functions of phatic communion. In: Organization of Behavior in Face-to-Face Interaction, pp. 215–238 (1975)
McCowan, I., Carletta, J., Kraaij, W., Ashby, S., Bourban, S., Flynn, M., Guillemot, M., Hain, T., Kadlec, J., Karaiskos, V.: The AMI meeting corpus. In: Proceedings of the 5th International Conference on Methods and Techniques in Behavioral Research. vol. 88 (2005)
Oertel, C., Cummins, F., Edlund, J., Wagner, P., Campbell, N.: D64: a corpus of richly recorded conversational interaction. J. Multimodal User Interfaces 7, 1–10 (2010)
Paggio, P., Allwood, J., Ahlsén, E., Jokinen, K.: The NOMCO multimodal Nordic resource-goals and characteristics (2010). http://bada.hb.se/handle/2320/7400
Schneider, K.P.: Small Talk: Analysing Phatic Discourse, vol. 1. Hitzeroth, Marburg (1988)
Ventola, E.: The structure of casual conversation in English. J. Pragmatics 3(3), 267–298 (1979)
Wilson, J.: On the Boundaries of Conversation, vol. 10. Pergamon Press, Pergamon (1989)
Wittenburg, P., Brugman, H., Russel, A., Klassmann, A., Sloetjes, H.: Elan: a professional framework for multimodality research. In: Proceedings of LREC, vol. 2006 (2006)
Yu, Z., Xu, Z., Black, A.W., Rudnicky, A.: Strategy and policy learning for non-task-oriented conversational systems. In: Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue (2016)
Acknowledgements
This work is supported by the European Coordinated Research on Long-term Challenges in Information and Communication Sciences and Technologies ERA-NET (CHISTERA) JOKER project, JOKe and Empathy of a Robot/ECA: Towards social and affective relations with a robot, and by the Speech Communication Lab, Trinity College Dublin, and by Science Foundation Ireland funding for ADAPT (13/RC/2106) at Trinity College Dublin.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Gilmartin, E., Cowan, B.R., Vogel, C., Campbell, N. (2017). Exploring Multiparty Casual Talk for Social Human-Machine Dialogue. In: Karpov, A., Potapova, R., Mporas, I. (eds) Speech and Computer. SPECOM 2017. Lecture Notes in Computer Science(), vol 10458. Springer, Cham. https://doi.org/10.1007/978-3-319-66429-3_36
Download citation
DOI: https://doi.org/10.1007/978-3-319-66429-3_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66428-6
Online ISBN: 978-3-319-66429-3
eBook Packages: Computer ScienceComputer Science (R0)