Integrating Semantics into Multimodal Interaction Patterns

Taib, Ronnie; Ruiz, Natalie

doi:10.1007/978-3-540-78155-4_9

Ronnie Taib^1,2 &
Natalie Ruiz^1,2

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4892))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

1013 Accesses
2 Citations

Abstract

A user experiment on multimodal interaction (speech, hand position and hand shapes) to study two major relationships: between the level of cognitive load experienced by users and the resulting multimodal interaction patterns; and how the semantics of the information being conveyed affected those patterns. We found that as cognitive load increases, users’ multimodal productions tend to become semantically more complementary and less redundant across modalities. This validates cognitive load theory as a theoretical background for understanding the occurrence of particular kinds of multimodal productions. Moreover, results indicate a significant relationship between the temporal multimodal integration pattern (7 patterns in this experiment) and the semantics of the command being issued by the user (4 types of commands), shedding new light on previous research findings that assign a unique temporal integration pattern to any given subject regardless of the communication taking place.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baddeley, A.D.: Working Memory. Science 255, 5044, 556–559 (1992)
Article Google Scholar
Bolt, R.A.: Put-That-There: Voice and Gesture at the Graphics Interface. In: 7th annual conference on Computer Graphics and Interactive Techniques, Seattle, Washington, United States, pp. 262–270. ACM Press, New York (1980)
Chapter Google Scholar
Gupta, A.K., Anastasakos, T.: Dynamic Time Windows for Multimodal Input Fusion. In: Proc. 8th International Conference on Spoken Language Processing (INTERSPEECH 2004 - ICSLP), Jeju, Korea, October 4-8, 2004, pp. 1009–1012 (2004)
Google Scholar
Hauptmann, A.G.: Speech and Gestures for Graphic Image Manipulation. In: CHI 1989, SIGCHI Conference on Human Factors in Computing Systems: Wings for the Mind, pp. 241–245. ACM Press, New York (1989)
Chapter Google Scholar
Huang, X., Oviatt, S., Lunsford, R.: Combining user modeling and machine learning to predict users’ multimodal integration patterns. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, pp. 50–62. Springer, Heidelberg (2006)
Chapter Google Scholar
Kettebekov, S.: Exploiting Prosodic Structuring of Coverbal Gesticulation. In: ICMI 2004: 6th international conference on Multimodal interfaces, State College, PA, USA, October 13-15, 2004, pp. 105–112. ACM Press, New York (2004)
Chapter Google Scholar
Lisowska, A., Armstrong, S.: Multimodal input for meeting browsing and retrieval interfaces: preliminary findings. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, pp. 142–153. Springer, Heidelberg (2006)
Chapter Google Scholar
Oviatt, S., Coulston, R., Lunsford, R.: When Do We Interact Multimodally? Cognitive Load and Multimodal Communication Pattern. In: ICMI 2004: 6th international conference on Multimodal interfaces, State College, PA, USA, October 13-15, 2004, pp. 129–136. ACM Press, New York (2004)
Chapter Google Scholar
Oviatt, S., Coulston, R., Tomko, S., Xiao, B., Lunsford, R., Wesson, M., Carmichael, L.: Toward a Theory of Organized Multimodal Integration Patterns during Human-Computer Interaction. In: ICMI 2003, 5th international conference on Multimodal interfaces, Vancouver, British Columbia, Canada, November 05-07, 2003, pp. 44–51. ACM Press, New York (2003)
Chapter Google Scholar
Oviatt, S., DeAngeli, A., Kuhn, K.: Integration and Synchronization of Input Modes During Multimodal Human-Computer Interaction. In: SIGCHI conference on Human factors in computing systems, Atlanta, Georgia, United States, March 22-27, 1997, pp. 415–422 (1997)
Google Scholar
Paas, F., Tuovinen, J.E., Tabbers, H., Van Gerven, P.W.M.: Cognitive Load Measurement as a Means to Advance Cognitive Load Theory. Educational psychologist 38(1), 63–71 (2003)
Article Google Scholar
Quek, F., McNeill, D., Bryll, R., Kirbas, C., Arlsan, H., McCullough, K.E., Furuyama, N., Gesture, A.R.: Speech, and gaze cues for discourse segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR 2000), Hilton head island, South Carolina, USA, June 13-15, 2000, pp. 247–254 (2000)
Google Scholar
Ruiz, N., Taib, R., Chen, F.: Examining the redundancy of multimodal input. In: Proc. 20th annual conference of the Australian computer-human interaction special interest group (OzCHI 2006), Sydney, Australia, November 20-24, 2006, pp. 389–392 (2006)
Google Scholar
Salber, D., Coutaz, J.: Applying the Wizard of Oz technique to the Study of Multimodal Systems. In: Bass, L.J., Unger, C., Gornostaev, J. (eds.) EWHCI 1993. LNCS, vol. 753, pp. 219–230. Springer, Heidelberg (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

ATP Research Laboratory, National ICT Australia, Locked Bag 9013, NSW 1435, Sydney, Australia
Ronnie Taib & Natalie Ruiz
School of Computer Science and Engineering, The University of New South Wales, NSW 2052, Sydney, Australia
Ronnie Taib & Natalie Ruiz

Authors

Ronnie Taib
View author publications
You can also search for this author in PubMed Google Scholar
Natalie Ruiz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Andrei Popescu-Belis Steve Renals Hervé Bourlard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Taib, R., Ruiz, N. (2008). Integrating Semantics into Multimodal Interaction Patterns. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds) Machine Learning for Multimodal Interaction. MLMI 2007. Lecture Notes in Computer Science, vol 4892. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78155-4_9

Download citation

DOI: https://doi.org/10.1007/978-3-540-78155-4_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78154-7
Online ISBN: 978-3-540-78155-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics