skip to main content
10.1145/3308532.3329445acmconferencesArticle/Chapter ViewAbstractPublication PagesivaConference Proceedingsconference-collections
extended-abstract

Text-driven Visual Prosody Generation for Embodied Conversational Agents

Published: 01 July 2019 Publication History

Abstract

In face-to-face conversations, head motions play a crucial role in encoding information, and humans are very skilled at decoding multiple messages from interlocutors' head motions. It is of great importance to endow embodied conversational agents (ECAs) with the capability of conveying communicative intention through head movements. Our work is aimed at automatically synthesizing head motions for an ECA speaking Chinese. We propose to take only transcripts as input to compute head movements, based on a statistical framework. Subjective experiments are conducted to validate the proposed statistical framework. The results show that the generated head animation is able to improve human perception in terms of naturalness and demonstrate that the head animation is synchronized with the input of synthetic speech.

References

[1]
Y. Ding, Y. Zhang, M. Xiao, and Z. Deng. 2017. A Multifaceted Study on Eye Contact Based Speaker Identification in Three-party Conversations. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (CHI '17). 3011--3021.
[2]
L. Tickle-Degnen and R. Rosenthal. 1990. The Nature of Rapport and Its Nonverbal Correlates. Psychological Inquiry, Vol. 1, 4 (1990), 285--293.
[3]
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems 30. 5998--6008.

Cited By

View all
  • (2024)Enhancing Transparency: AI Applications for Detecting Cheating and Predicting Player Attrition in Online Gaming2024 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT)10.1109/IConSCEPT61884.2024.10627922(1-6)Online publication date: 4-Jul-2024
  • (2023)A Music-Driven Deep Generative Adversarial Model for Guzheng Playing AnimationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311590229:2(1400-1414)Online publication date: 1-Feb-2023
  • (2023)Explainable AI for Cheating Detection and Churn Prediction in Online GamesIEEE Transactions on Games10.1109/TG.2022.317339915:2(242-251)Online publication date: Jun-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IVA '19: Proceedings of the 19th ACM International Conference on Intelligent Virtual Agents
July 2019
282 pages
ISBN:9781450366724
DOI:10.1145/3308532
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 July 2019

Check for updates

Author Tags

  1. animation
  2. data-driven
  3. datasets
  4. deep learning
  5. head
  6. neural networks
  7. statistical framework
  8. text
  9. transformer

Qualifiers

  • Extended-abstract

Conference

IVA '19
Sponsor:

Acceptance Rates

IVA '19 Paper Acceptance Rate 15 of 63 submissions, 24%;
Overall Acceptance Rate 53 of 196 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Enhancing Transparency: AI Applications for Detecting Cheating and Predicting Player Attrition in Online Gaming2024 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT)10.1109/IConSCEPT61884.2024.10627922(1-6)Online publication date: 4-Jul-2024
  • (2023)A Music-Driven Deep Generative Adversarial Model for Guzheng Playing AnimationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.311590229:2(1400-1414)Online publication date: 1-Feb-2023
  • (2023)Explainable AI for Cheating Detection and Churn Prediction in Online GamesIEEE Transactions on Games10.1109/TG.2022.317339915:2(242-251)Online publication date: Jun-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media