ABSTRACT
Pauses play a critical role in adding, shifting or contradicting meaning in a conversation. To enable the study and incorporation of this important modality in computational discourse analytic and processing systems, we require extensible open source pause coding systems and associated software libraries. We designed and implemented a coding and visualisation system for pause and overlap detection and analysis, extending existing voicing and silence detection algorithms. Demonstrating the system using the TalkBank CallFriend and CallHome corpora we show how the approach can be used to code many different kinds of pauses and overlaps within and between interlocutors, and calculate the temporal distribution of these different types of pause and overlap. The coding schema is intended to be combined with other speech modalities to provide novel approaches to predicting social cues and markers, useful for designing more naturalistic conversational agents, and in new tools for measuring turn-taking structure of conversation in greater depth and accuracy.
- {n. d.}. The Talk Bank system. https://talkbank.org/. Accessed: 2018--05--14.Google Scholar
- Xavier Anguera, Simon Bozonnet, Nicholas Evans, Corinne Fredouille, Gerald Friedland, and Oriol Vinyals. 2012. Speaker diarization: A review of recent research. IEEE Transactions on Audio, Speech, and Language Processing 20, 2 (2012), 356--370. Google ScholarDigital Library
- Cindy Gallois and Howard Giles. 2015. Communication accommodation theory (1st ed.). Wiley, Hoboken, NJ, 159--176.Google Scholar
- Cynthia Gallois and Norman Markel. 1975. Turn Taking: Social Personality and Conversational Style. Journal of Personality and Social Psychology 31, 6 (1975), 1134--1140.Google ScholarCross Ref
- Cindy Gallois, Ann Weatherall, and Howard Giles. 2016. CAT and talk in action. Cambridge University Press, Cambridge, England, 105--122.Google Scholar
- Yaniv Leviathan and Yossi Matias. 2018. Google Duplex: An AI System for Accomplishing Real-World Tasks Over the Phone. https://ai.googleblog.com/2018/05/duplex-ai-system-for-natural-conversation.html.Google Scholar
- Harvey Sacks, Emanuel A Schegloff, and Gail Jefferson. 1978. A simplest systematics for the organization of turn taking for conversation. In Studies in the organization of conversational interaction. Elsevier, 7--55.Google Scholar
- Kirill Sakhnov, Ekaterina Verteletskaya, and Boris Simak. 2009. Dynamical energy-based speech/silence detector for speech enhancement applications. In Proceedings of the World Congress on Engineering, Vol. 1. Citeseer, 2.Google Scholar
- Yvonne Yu, Paul Vrbik, and Daniel Angus. 2018. Communications Analytics Laboratory Toolbox. https://github.com/YvonneYYu/calpy.Google Scholar
Index Terms
- PauseCode: Computational Conversation Timing Analysis
Recommendations
Multimodal human discourse: gesture and speech
Gesture and speech combine to form a rich basis for human conversational interaction. To exploit these modalities in HCI, we need to understand the interplay between them and the way in which they support communication. We propose a framework for the ...
Can machines talk? Comparison of Eliza with modern dialogue systems
To find if current dialogue systems use the same, psychotherapist questioning technique as Joseph Weizenbaum's 1960 natural language understanding programme, Eliza, the authors carried out an original experiment comparing five successful artificial ...
Changes in linguistic behaviors based on smart speaker task performance and pragmatic skills in multiple turn-taking interactions
AbstractIn the current study, we conducted a Wizard-of-Oz experiment using a smart speaker to investigate how smart speakers’ task performance (success vs. failure) and pragmatic levels (high vs. low) alter users’ linguistic behaviors during multiple turn-...
Comments