skip to main content
10.1145/3334480.3382792acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
abstract

Understanding User Perceptions of Robot's Delay, Voice Quality-Speed Trade-off and GUI during Conversation

Published: 25 April 2020 Publication History

Abstract

Conversational robots face the practical challenge of providing timely responses to ensure smooth interactions with users. Thus, those who design and implement robots will need to understand how different levels of delay in response may affect users' satisfaction with the conversation, how to balance the trade-off between a robot's quality of voice and response time, and how to design strategies to mitigate possible negative effects of a long delay. Via an online video-prototype study on a service robot with 94 Chinese participants, we find that users could tolerate up to 4s delay but their satisfaction drops at the 8s delay during both information-retrieval conversations and chitchats. We gain an in-depth understanding of users' preference for the trade-off between the voice quality and the response speed, as well as their opinions on possible robot graphic user interface (GUI) design to alleviate negative user experience with response latency.

References

[1]
Sean Andrist, Xiang Zhi Tan, Michael Gleicher, and Bilge Mutlu. 2014. Conversational Gaze Aversion for Humanlike Robots. In Proceedings of the 2014 ACM/IEEE International Conference on Human-robot Interaction (HRI '14). ACM, New York, NY, USA, 25--32.
[2]
Sean Andrist, Micheline Ziadee, Halim Boukaram, Bilge Mutlu, and Majd Sakr. 2015. Effects of Culture on the Credibility of Robot Speech: A Comparison between English and Arabic. In Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction (HRI '15). ACM, NY, NY, USA, 157--164.
[3]
K Dautenhahn, M Walters, S Woods, K L Koay, C L Nehaniv, hertsacuk E A Sisbot, and R Alami. 2006. How May I Serve You? A Robot Companion Approaching a Seated Person in a Helping Context. (2006). http://delivery.acm.org/10.1145/1130000/1121272/p172-dautenhahn.pdf?ip=175.159.124.16
[4]
iFLYTEK. 2019. Offline Speech Synthesis - iFLYTEK Open Platform. (2019). https://www.xfyun.cn/services/offline_tts.
[5]
Alisa Kalegina, Grace Schroeder, Aidan Allchin, Keara Berlin, and Maya Cakmak. 2018. Characterizing the Design Space of Rendered Robot Faces. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction (HRI '18). ACM, New York, NY, USA, 96--104.
[6]
Min Kyung Lee, Sara Kielser, Jodi Forlizzi, Siddhartha Srinivasa, and Paul Rybski. 2010. Gracefully Mitigating Breakdowns in Robotic Services. In Proceedings of the 5th ACM/IEEE International Conference on Human-robot Interaction (HRI '10). IEEE Press, Piscataway, NJ, USA, 203--210. http: //dl.acm.org/citation.cfm?id=1734454.1734544
[7]
Min Kyung Lee, Sara Kiesler, Jodi Forlizzi, and Paul Rybski. 2012. Ripple Effects of an Embedded Social Agent: A Field Study of a Social Robot in the Workplace. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '12). ACM, New York, NY, USA, 695--704.
[8]
FIONA PARKER FOR THE DAILY MAIL. 2018. Shop hires robot assistant... then fires it after just a week: Fabio the ShopBot irritates and confuses customers with vague replies and bad directions. (2018). http://www.dailymail.co.uk/news/article-5295837/Shop-hires-robot-assistant-fires-just-week.html.
[9]
Richard Nisbett, Craig Caputo, Patricia Legant, and Jeanne Marecek. 1973. Behavior as Seen by the Actor and as Seen by the Observer. Journal of Personality and Social Psychology 27 (08 1973), 154--164.
[10]
Zhenhui Peng, Yunhwan Kwon, Jiaan Lu, Ziming Wu, and Xiaojuan Ma. 2019. Design and Evaluation of Service Robot's Proactivity in Decision-Making Support Process. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). ACM, NY, NY, USA, Article Paper 98, 13 pages.
[11]
Steven C. Seow. 2008. Designing and Engineering Time: The Psychology of Time Perception in Software (1 ed.). Addison-Wesley Professional.
[12]
Yang Shi, Xin Yan, Xiaojuan Ma, Yongqi Lou, and Nan Cao. 2018. Designing Emotional Expressions of Conversational States for Voice Assistants: Modality and Engagement. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems (CHI EA '18). ACM, New York, NY, USA, Article LBW557, 6 pages.
[13]
Toshiyuki Shiwa, Takayuki Kanda, Michita Imai, Hiroshi Ishiguro, and Norihiro Hagita. 2008. How Quickly Should Communication Robots Respond?. In Proceedings of the 3rd ACM/IEEE International Conference on Human Robot Interaction (HRI '08). ACM, New York, NY, USA, 153--160.
[14]
Michael Walters, Dag Sverre Syrdal, Kheng Koay, Kerstin Dautenhahn, and Rene Boekhorst. 2008. Human approach distances to a mechanical-looking robot with different robot voice styles. Proceedings of the 17th IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN (09 2008), 707 -- 712.

Cited By

View all
  • (2024)Beyond Text and Speech in Conversational Agents: Mapping the Design Space of AvatarsProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3661563(1875-1894)Online publication date: 1-Jul-2024
  • (2024)Speech-to-SQL: toward speech-driven SQL query generation from natural language questionThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-024-00837-033:4(1179-1201)Online publication date: 16-Feb-2024
  • (2023)Managing Delays in Human-Robot InteractionACM Transactions on Computer-Human Interaction10.1145/356989030:4(1-42)Online publication date: 12-Sep-2023
  • Show More Cited By

Index Terms

  1. Understanding User Perceptions of Robot's Delay, Voice Quality-Speed Trade-off and GUI during Conversation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems
    April 2020
    4474 pages
    ISBN:9781450368193
    DOI:10.1145/3334480
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 April 2020

    Check for updates

    Author Tags

    1. delay
    2. graphic user interface
    3. human-robot interaction
    4. speech synthesis
    5. voice

    Qualifiers

    • Abstract

    Conference

    CHI '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 6,164 of 23,696 submissions, 26%

    Upcoming Conference

    CHI 2025
    ACM CHI Conference on Human Factors in Computing Systems
    April 26 - May 1, 2025
    Yokohama , Japan

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)75
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Beyond Text and Speech in Conversational Agents: Mapping the Design Space of AvatarsProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3661563(1875-1894)Online publication date: 1-Jul-2024
    • (2024)Speech-to-SQL: toward speech-driven SQL query generation from natural language questionThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-024-00837-033:4(1179-1201)Online publication date: 16-Feb-2024
    • (2023)Managing Delays in Human-Robot InteractionACM Transactions on Computer-Human Interaction10.1145/356989030:4(1-42)Online publication date: 12-Sep-2023
    • (2023)Sustainable cloud services for verbal interaction with embodied agentsIntelligent Service Robotics10.1007/s11370-023-00485-316:5(599-618)Online publication date: 22-Sep-2023
    • (2022)Understanding User Perceptions of Response Delays in Crowd-Powered Conversational SystemsProceedings of the ACM on Human-Computer Interaction10.1145/35557656:CSCW2(1-42)Online publication date: 11-Nov-2022
    • (2022)A Platform for Deploying the TFE Ecosystem of Automatic Speech RecognitionProceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3547731(6952-6954)Online publication date: 10-Oct-2022
    • (2021)FPGA-Based Voice Encryption Equipment under the Analog Voice Communication ChannelInformation10.3390/info1211045612:11(456)Online publication date: 4-Nov-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media