Abstract
This paper proposes a user interface model named GVUI, which provides a design guidance for the voice assistant running on smart display. The theory of GVUI is divided into three parts. First, a multi-modal structure is built, which allows each modality to play to its strength and cooperate with each other. Second, three principles of conversational user interface are proposed, including intuitive, inclusive and stress-free, and 15 tips are provided to explain how to achieve these goals. Third, the paper describes a card-based visual reply design pattern and introduces three types of arrangement for multiple results. In the conclusion based on the practice of weather query function, it is proved that an intuitive and efficient user experience of voice can be gained via GVUI.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Porcheron, M., Fischer, J.E., Reeves, S., Sharples, S.: Voice interfaces in everyday life. In: 2018 CHI Conference on Human Factors in Computing Systems, pp. 1–12. Association for Computing Machinery, New York (2018)
Lewis, J.R.: Practical Speech User Interface Design. CRC Press, United States (2016)
Schnelle, D., Lyardet, F.: Voice user interface design patterns. In: EuroPLoP, pp. 287–316 (2006)
Hyerle, D.: Thinking maps: visual tools for activating habits of mind. In: Arthur, L.C., Bena, K. (eds.) Learning and leading with habits of mind, pp. 149–174. ASCD, Alexandria (2008)
Cohen, M.H., Michael, H.C., James, P.G., Jennifer, B.: Voice User Interface Design. Addison-Wesley Professional, Boston (2004)
Pearl, C.: Designing Voice User Interfaces: Principles of Conversational Experiences. O’Reilly Media, Sebastopol (2016)
Bentley, F., Luvogt, C., Silverman, M., Wirasinghe, R., White, B., Lottridge, D.: Understanding the long-term use of smart speaker assistants. In: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, p. 124. Association for Computing Machinery, New York (2018)
Apple Siri for developer. https://developer.apple.com/siri/
Alexa Presentation Language. https://developer.amazon.com/zh/blogs/alexa/post/1dee3fa0-8c5f-4179-ab7a-74545ead24ce/introducing-the-alexa-presentation-language-preview
Google Assistant Dialogflow and legacy Actions SDK. https://developers.google.com/assistant/conversational/df-asdk/rich-responses
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhou, B., Li, L. (2021). GVUI: Graphic-Assisted Voice User Interface Based on Multi-modal Human-Machine Conversation. In: Ahram, T.Z., Falcão, C.S. (eds) Advances in Usability, User Experience, Wearable and Assistive Technology. AHFE 2021. Lecture Notes in Networks and Systems, vol 275. Springer, Cham. https://doi.org/10.1007/978-3-030-80091-8_99
Download citation
DOI: https://doi.org/10.1007/978-3-030-80091-8_99
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80090-1
Online ISBN: 978-3-030-80091-8
eBook Packages: EngineeringEngineering (R0)