skip to main content
10.1145/3279972.3279975acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
research-article

PauseCode: Computational Conversation Timing Analysis

Authors Info & Claims
Published:16 October 2018Publication History

ABSTRACT

Pauses play a critical role in adding, shifting or contradicting meaning in a conversation. To enable the study and incorporation of this important modality in computational discourse analytic and processing systems, we require extensible open source pause coding systems and associated software libraries. We designed and implemented a coding and visualisation system for pause and overlap detection and analysis, extending existing voicing and silence detection algorithms. Demonstrating the system using the TalkBank CallFriend and CallHome corpora we show how the approach can be used to code many different kinds of pauses and overlaps within and between interlocutors, and calculate the temporal distribution of these different types of pause and overlap. The coding schema is intended to be combined with other speech modalities to provide novel approaches to predicting social cues and markers, useful for designing more naturalistic conversational agents, and in new tools for measuring turn-taking structure of conversation in greater depth and accuracy.

References

  1. {n. d.}. The Talk Bank system. https://talkbank.org/. Accessed: 2018--05--14.Google ScholarGoogle Scholar
  2. Xavier Anguera, Simon Bozonnet, Nicholas Evans, Corinne Fredouille, Gerald Friedland, and Oriol Vinyals. 2012. Speaker diarization: A review of recent research. IEEE Transactions on Audio, Speech, and Language Processing 20, 2 (2012), 356--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cindy Gallois and Howard Giles. 2015. Communication accommodation theory (1st ed.). Wiley, Hoboken, NJ, 159--176.Google ScholarGoogle Scholar
  4. Cynthia Gallois and Norman Markel. 1975. Turn Taking: Social Personality and Conversational Style. Journal of Personality and Social Psychology 31, 6 (1975), 1134--1140.Google ScholarGoogle ScholarCross RefCross Ref
  5. Cindy Gallois, Ann Weatherall, and Howard Giles. 2016. CAT and talk in action. Cambridge University Press, Cambridge, England, 105--122.Google ScholarGoogle Scholar
  6. Yaniv Leviathan and Yossi Matias. 2018. Google Duplex: An AI System for Accomplishing Real-World Tasks Over the Phone. https://ai.googleblog.com/2018/05/duplex-ai-system-for-natural-conversation.html.Google ScholarGoogle Scholar
  7. Harvey Sacks, Emanuel A Schegloff, and Gail Jefferson. 1978. A simplest systematics for the organization of turn taking for conversation. In Studies in the organization of conversational interaction. Elsevier, 7--55.Google ScholarGoogle Scholar
  8. Kirill Sakhnov, Ekaterina Verteletskaya, and Boris Simak. 2009. Dynamical energy-based speech/silence detector for speech enhancement applications. In Proceedings of the World Congress on Engineering, Vol. 1. Citeseer, 2.Google ScholarGoogle Scholar
  9. Yvonne Yu, Paul Vrbik, and Daniel Angus. 2018. Communications Analytics Laboratory Toolbox. https://github.com/YvonneYYu/calpy.Google ScholarGoogle Scholar

Index Terms

  1. PauseCode: Computational Conversation Timing Analysis

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            MA3HMI'18: Proceedings of the 4th International Workshop on Multimodal Analyses Enabling Artificial Agents in Human-Machine Interaction
            October 2018
            50 pages
            ISBN:9781450360760
            DOI:10.1145/3279972

            Copyright © 2018 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 16 October 2018

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader