Integration model of eye-gaze, voice and manual response in multimodal user interface

Wang, Jian

doi:10.1007/BF02947219

Integration model of eye-gaze, voice and manual response in multimodal user interface

Published: September 1996

Volume 11, pages 512–518, (1996)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Wang Jian¹

86 Accesses
3 Citations
Explore all metrics

Abstract

This paper reports the utility of eye-gaze, voice and manual response in the design of multimodal user interface. A device- and application-independent user interface model (VisualMan) of 3D object selection and manipulation was developed and validated in a prototype interface based on a 3D cube manipulation task. The multimodal inputs are integrated in the prototype interface based on the priority of modalities and interaction context. The implications of the model for virtual reality interface are discussed and a virtual environment using the multimodal user interface model is proposed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

How latency, action modality and display modality influence the sense of agency: a virtual reality study

Article 21 September 2019

Effectiveness of Eye-Gaze Input Method: Comparison of Speed and Accuracy Among Three Eye-Gaze Input Method

A comparative analysis of two immersive virtual reality systems in the integration and visualization of natural hand interaction

Article Open access 28 January 2022

References

Apple Computer Inc. Apple Human Interface Guidelines: The Apple Desktop Interface. Addison-Wesley, Massachusetts, 1987.
Google Scholar
Begault D R. 3D Sound for Virtual Reality and Multimedia. Academic Press, Boston, 1994.
Google Scholar
Bolt R A. The Human Interface: Where People and Computers Meet Lifetime Learning Publications, Belmont, 1984.
Google Scholar
Bruner J. Toward a Theory of Instruction. Harvard University Press, Cambridge, 1966.
Google Scholar
Burdea G, Coiffet P. Virtual Reality Technology. John Wiley & Sons, New York, 1994.
Google Scholar
Dannenberg R B, Blattner M M. Introduction: The trend toward multimedia interface. InMultimedia Interface Design, Blattner M M, Dannenber R B (eds.), Addison-Wesley, Massachusetts, 1992.
Google Scholar
Dear D L. HyperCard: What is it?Byte, August 1988, 71–75.
Encarnacao J, Foley J, Bryson S, Feiner S K, Gershon N. Research issues in perception and user interfaces.IEEE Computer Graphics and Applications, March 1994, 67–69.
Foley J D. Interfaces for advanced computing.Scientific American, Oct. 1987, 97–90.
Giagante M A. Virtual reality: Enabling technologies. InVirtual Reality Systems, Earnshaw R A, Gigante M A, Jones H (eds.), London: Academic Press, 1993.
Google Scholar
Hauptmann A G, McAvinney P. Gestures with speech for graphic manipulation.International Journal of Man-Machine Studies, 1993, 38: 231–249.
Article Google Scholar
Hill W, Worblewski D, Mccandless T, Cohen R. Architectural qualities and principles for multimodal and multimedia interfaces. InMultimedia Interface Design, Blattner M M, Dannenberg R B (eds.), Massachusetts: Addison-Wesley, 1992.
Google Scholar
Kalawsky R S. The Science of Virtual Reality and Virtual Environment. Wokingham: Addison-Wesley, 1993.
Google Scholar
Jacob R J K. What you look at is what you get: Eye movement-based interaction techniques. InProc. ACM CHI’90 Human Factors in Computing Systems Conference, 1990, pp. 95–102.
Latta J N, Oberg D J. A conceptual virtual model.IEEE Computer Graphics & Applications, Jan. 1994, 23–29.
Microsoft Corporation. The Windows Interface: An Application Design Guide. Washington: Microsoft Press, 1992.
Google Scholar
Pausch R, Grossweiler R. Application-independent object selection from inaccurate multimodal input. InMultimedia Interface Design, Blattner M M, Dannenberg R B (eds.), Massachusetts: Addison-Wesley, 1992.
Google Scholar
Robertson P, Earnshaw R A, Thalmann D, Grave M, Gallop J, De Jong E M. Research issues in the foundations of visualization.IEEE Computer Graphics and Applications, March 1994, 73–76.
Rudnicky A I, Hauptmann A G. Multimodal interaction in speech systems. InMultimedia Interface Design, Blattner M M, Dannenberg R B, Massachusetts: Addison-Wesley, 1992.
Google Scholar
Wilsson M, Conway A. Enhanced interaction styles for user interfaces.IEEE Computer Graphics & Applications, March 1991, 79–90.

Download references

Author information

Authors and Affiliations

State Key Laboratory of Human Factors, Hangzhou University, 310028, Hangzhou
Wang Jian

Authors

Wang Jian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wang Jian.

Additional information

This research was supported by the grants from National Natural Science Foundation of China and 863 High-Tech Programme. Part of the paper was first presented at the IEEE International Conference on System, Man and Cybernetics in 1995 at Vancouver, British Columbia, Canada, and at the International Workshop on Virtual Reality and Scientific Visualization in Hangzhou, China, in April 1995.

Wang Jian received his Ph.D. degree in Engineering Psychology at Hangzhou University in 1990. He currently is a professor of psychology and the head of Human Factors Laboratory in Hangzhou University. His research interests include user interface design in VR environment, integration of eye-tracking in user interface design and cognitive process in human-computer interaction.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, J. Integration model of eye-gaze, voice and manual response in multimodal user interface. J. of Comput. Sci. & Technol. 11, 512–518 (1996). https://doi.org/10.1007/BF02947219

Download citation

Received: 17 July 1995
Revised: 19 August 1996
Issue Date: September 1996
DOI: https://doi.org/10.1007/BF02947219

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Integration model of eye-gaze, voice and manual response in multimodal user interface

Abstract

Access this article

Similar content being viewed by others

How latency, action modality and display modality influence the sense of agency: a virtual reality study

Effectiveness of Eye-Gaze Input Method: Comparison of Speed and Accuracy Among Three Eye-Gaze Input Method

A comparative analysis of two immersive virtual reality systems in the integration and visualization of natural hand interaction

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Integration model of eye-gaze, voice and manual response in multimodal user interface

Abstract

Access this article

Similar content being viewed by others

How latency, action modality and display modality influence the sense of agency: a virtual reality study

Effectiveness of Eye-Gaze Input Method: Comparison of Speed and Accuracy Among Three Eye-Gaze Input Method

A comparative analysis of two immersive virtual reality systems in the integration and visualization of natural hand interaction

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation