skip to main content
10.1145/3406499.3418754acmconferencesArticle/Chapter ViewAbstractPublication PageshaiConference Proceedingsconference-collections
poster
Public Access

Study on Text-based and Voice-based Dialogue Interfaces for Human-Computer Interactions in a Blocks World

Published: 10 November 2020 Publication History

Abstract

We conducted a small scale user study to understand user experiences with two different forms of dialogue interfaces - text-based and voice-based - to interact with a virtual agent in a Blocks World environment while perform tower building tasks. The participants also had the option of using deictic gestures in addition to the speech modality. We identify common types of errors/issues that led to communication failures and share our observations about how users reacted to these issues. We also present survey data that reflects users' evaluations of the dialogue interfaces and their interactions with the virtual agent.

Supplementary Material

MP4 File (3406499.3418754.mp4)
We conducted a small scale user study to understand user experiences with two different forms of dialogue interfaces - text-based and voice-based - to interact with a virtual agent in a Blocks World environment while perform tower building tasks. The participants also had the option of using deictic gestures in addition to the speech modality. We identify common types of errors/issues that led to communication failures and share our observations about how users reacted to these issues. We also present survey data that reflects users? evaluations of the dialogue interfaces and their interactions with the virtual agent.

References

[1]
Dan Bohus and Alexander I. Rudnicky. 2005. Sorry, I didn't catch that! -- An investigation of non-understanding errors and recovery strategies. In In Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue. 128--143.
[2]
Rolf Carlson, Jens Edlund, Mattias Heldner, Anna Hjalmarsson, David House, and Gabriel Skantze. 2006. Towards human-like behaviour in spoken dialog systems. Proceedings of Swedish Language Technology Conference (SLTC 2006) (2006).
[3]
SRI International. 2019. Play with SMILEE. https://sites.google.com/view/playwithsmilee/home
[4]
Sujeong Kim, David Salter, Luke DeLuccia, Kilho Son, Mohamed R. Amer, and Amir Tamrakar. 2018. SMILEE: Symmetric Multi-modal Interactions with Language-gesture Enabled (AI) Embodiment. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations. Association for Computational Linguistics, New Orleans, Louisiana, 86--90. https://doi.org/10.18653/v1/N18-5018
[5]
Robyn Kozierok, Lynette Hirschman, John Aberdeen, Cheryl Clar, Christopher Garay, Bradley Goodman, Tonia Korves, and Matthew Peterson. 2018. DARPA Communicating With Computers: Program Goals and Hallmarks. (03 2018).
[6]
Nikhil Krishnaswamy, Pradyumna Narayana, Isaac Wang, Kyeongmin Rim, Rahul Bangar, Dhruva Patil, Gururaj Mulay, J. Ross Beveridge, Jaime Ruiz, Bruce A. Draper, and James Pustejovsky. 2018. Learning Interpretable Spatial Operations in a Rich 3D Blocks World. Association for the Advancement of Artificial Intelligence (AAAI).
[7]
André Natal, Glen Shires, Marcos Cáceres, and Philip J"agenstedt. 2020. Web Speech API Draft Community Group Report. https://wicg.github.io/speech-api/
[8]
Gabriel Skantze. 2003. Exploring human error handling strategies: Implications for spoken dialogue systems. In ISCA Tutorial and Research Workshop on Error Handling in Spoken Dialogue Systems.
[9]
Rhea Sukthanker, Soujanya Poria, Erik Cambria, and Ramkumar Thirunavukarasu. 2020. Anaphora and coreference resolution: A review. Information Fusion, Vol. 59 (2020), 139--162. https://doi.org/10.1016/j.inffus.2020.01.010

Cited By

View all
  • (2024)System and User Strategies to Repair Conversational Breakdowns of Spoken Dialogue Systems: A Scoping ReviewProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665558(1-13)Online publication date: 8-Jul-2024
  • (2024)Assessing Creativity and User Experience in Immersive Virtual Reality with Cultural Heritage LearningInternational Journal of Human–Computer Interaction10.1080/10447318.2024.2405784(1-17)Online publication date: 30-Sep-2024
  • (2022)Conversational AI over Military Scenarios Using Intent Detection and Response GenerationApplied Sciences10.3390/app1205249412:5(2494)Online publication date: 27-Feb-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HAI '20: Proceedings of the 8th International Conference on Human-Agent Interaction
November 2020
304 pages
ISBN:9781450380546
DOI:10.1145/3406499
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 November 2020

Check for updates

Author Tags

  1. blocks world
  2. dialogue interface
  3. human-agent interactions
  4. natural language interface

Qualifiers

  • Poster

Funding Sources

  • DARPA

Conference

HAI '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 121 of 404 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)70
  • Downloads (Last 6 weeks)6
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)System and User Strategies to Repair Conversational Breakdowns of Spoken Dialogue Systems: A Scoping ReviewProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665558(1-13)Online publication date: 8-Jul-2024
  • (2024)Assessing Creativity and User Experience in Immersive Virtual Reality with Cultural Heritage LearningInternational Journal of Human–Computer Interaction10.1080/10447318.2024.2405784(1-17)Online publication date: 30-Sep-2024
  • (2022)Conversational AI over Military Scenarios Using Intent Detection and Response GenerationApplied Sciences10.3390/app1205249412:5(2494)Online publication date: 27-Feb-2022
  • (2021)Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction2021 9th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)10.1109/ACIIW52867.2021.9666238(1-5)Online publication date: 28-Sep-2021

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media