research-article

Modelling fusion of modalities in multimodal interactive systems with MMMM

Authors:
Bruno Dumas

University of Namur, Belgium

University of Namur, Belgium
View Profile

,
Jonathan Pirau

University of Namur, Belgium

University of Namur, Belgium
View Profile

,
Denis Lalanne

University of Fribourg, Switzerland

University of Fribourg, Switzerland
View Profile

ICMI '17: Proceedings of the 19th ACM International Conference on Multimodal InteractionNovember 2017Pages 288–296https://doi.org/10.1145/3136755.3136768

Published:03 November 2017Publication History

ICMI '17: Proceedings of the 19th ACM International Conference on Multimodal Interaction

Pages 288–296

ABSTRACT

Several models and design spaces have been defined and are regularly used to describe how modalities can be fused together in an interactive multimodal system. However, models such as CASE, the CARE properties or TYCOON have all been defined more than two decades ago. In this paper, we start with a critical review of these models, which notably highlighted a confusion between how the user and the system side of a multimodal system were described. Based on this critical review, we define MMMM v1, an improved model for the description of multimodal fusion in interactive systems targeting completeness. A first user evaluation comparing the models revealed that MMMM v1 was indeed complete, but at the cost of user friendliness. Based on the results of this first evaluation, an improved version of MMMM, called MMMM v2 was defined. A second user evaluation highlighted that this model achieved a good balance between complexity, consistency and completeness compared to the state of the art.

References

Richard A. Bolt. 1980. “Put-that-there”: Voice and Gesture at the Graphics Interface. In Proceedings of the 7th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH 1980). Seattle, USA, 262–270. Google ScholarDigital Library
Andrea Cherubini, Robin Passama, Philippe Fraisse, and André Crosnier. 2015. A Unified Multimodal Control Framework for Human-Robot Interaction. Robotics and Autonomous Systems 70 (2015), 106–115. Modelling Fusion of Modalities in Multimodal... ICMI’17, November 13–17, 2017, Glasgow, UK Google ScholarDigital Library
Joëlle Coutaz, Laurence Nigay, Daniel Salber, Ann Blandford, Jon May, and Richard M. Young. 1995. Four Easy Pieces for Assessing the Usability of Multimodal Interaction: The CARE Properties. In Proceedings of the 5th International Conference on Human-Computer Interaction (Interact 1995). Lillehammer, Norway, 115–120.Google Scholar
Fredy Cuenca, Jan Van den Bergh, Kris Luyten, and Karin Coninx. 2015. Hasselt UIMS: A Tool for Describing Multimodal Interactions with Composite Events. In Proceedings of the 7th ACM SIGCHI Symposium on Engineering Interactive Computing Systems. Duisburg, Germany, 226–229. Google ScholarDigital Library
Bruno Dumas, Rolf Ingold, and Denis Lalanne. 2009. Benchmarking Fusion Engines of Multimodal Interactive Systems. In Proceedings of the 2009 International Conference on Multimodal Interfaces. Cambridge, Massachusetts, USA, 169–176. Google ScholarDigital Library
Bruno Dumas, Denis Lalanne, and Rolf Ingold. 2010. Description Languages for Multimodal Interaction: A Set of Guidelines and its Illustration with SMUIML. Journal on Multimodal User Interfaces: “Special Issue on The Challenges of Engineering Multimodal Interaction” 3, 3 (February 2010), 237–247.Google ScholarCross Ref
Bruno Dumas, Denis Lalanne, and Sharon Oviatt. 2009. Multimodal Interfaces: A Survey of Principles, Models and Frameworks. In Human Machine Interaction: Research Results of the MMI Program. Springer-Verlag, Berlin, Heidelberg, 3–26. Google ScholarDigital Library
Lode Hoste, Bruno Dumas, and Beat Signer. 2011. Mudra: A Unified Multimodal Interaction Framework. In Proceedings of the 13th International Conference on Multimodal Interfaces (ICMI 2011). Alicante, Spain, 97–104. Google ScholarDigital Library
Marc Erich Latoschik. 2005. A User Interface Framework for Multimodal VR Interactions. In Proceedings of the 7th International Conference on Multimodal Interfaces. ACM, Torento, Italy, 76–83. Google ScholarDigital Library
Jean-Claude Martin. 1998. TYCOON: Theoretical Framework and Software Tools for Multimodal Interfaces. Intelligence and Multimodality in Multimedia interfaces (1998), 1–25.Google Scholar
Jean-Claude Martin and Dominique Béroule. 1999. TYCOON: Six Primitive Types of Cooperation for Observing, Evaluating and Specifying Cooperations. In Proceedings of the AAAI Fall 1999 Symposium on Psychological Models of Communication in Collaborative Systems, Vol. 16.Google Scholar
David R. McGee, Philip R. Cohen, and Lizhong Wu. 2000. Something from Nothing: Augmenting a Paper-based Work Practice via Multimodal Interaction. In Proceedings of DARE 2000 on Designing Augmented Reality Environments. Elsinore, Denmark, 71–80. Google ScholarDigital Library
Laurence Nigay. 1994. Conception et modélisation logicielles des systèmes interactifs: application aux interfaces multimodales. Ph.D. Dissertation. Université Joseph-Fourier-Grenoble I.Google Scholar
Laurence Nigay. 2004. Design Space for Multimodal Interaction. In Building the Information Society. Springer, 403–408.Google Scholar
Laurence Nigay and Joëlle Coutaz. 1993. A Design Space for Multimodal Systems: Concurrent Processing and Data Fusion. In Proceedings of the SIGCHI conference on Human factors in computing systems (CHI 1993). 172–178. Google ScholarDigital Library
Donald A Norman. 2013. The design of everyday things: Revised and expanded edition. Basic books.Google Scholar
Sharon Oviatt. 1999. Ten myths of multimodal interaction. Commun. ACM 42, 11 (1999), 74–81. Google ScholarDigital Library
M. Serrano, L. Nigay, J.Y. Lawson, A. Ramsay, R. Murray-Smith, and S. Denef. 2008. The OpenInterface Framework: A Tool for Multimodal Interaction. In Proceedings of the 26th International Conference on Human Factors in Computing Systems (CHI 2008). Florence, Italy, 3501–3506. Google ScholarDigital Library

Index Terms

Modelling fusion of modalities in multimodal interactive systems with MMMM
1. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Fusion engines for multimodal input: a survey
ICMI-MLMI '09: Proceedings of the 2009 international conference on Multimodal interfaces

Fusion engines are fundamental components of multimodal inter-active systems, to interpret input streams whose meaning can vary according to the context, task, user and time. Other surveys have considered multimodal interactive systems; we focus more ...
Read More
Benchmarking fusion engines of multimodal interactive systems
ICMI-MLMI '09: Proceedings of the 2009 international conference on Multimodal interfaces

This article proposes an evaluation framework to benchmark the performance of multimodal fusion engines. The paper first introduces different concepts and techniques associated with multimodal fusion engines and further surveys recent implementations. ...
Read More
Salience modeling based on non-verbal modalities for spoken language understanding
ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces

Previous studies have shown that, in multimodal conversational systems, fusing information from multiple modalities together can improve the overall input interpretation through mutual disambiguation. Inspired by these findings, this paper investigates ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMI '17: Proceedings of the 19th ACM International Conference on Multimodal Interaction
November 2017
676 pages
ISBN:9781450355438
DOI:10.1145/3136755
General Chairs:
Edward Lank
University of Waterloo, Canada
,
Alessandro Vinciarelli
University of Glasgow, UK
,
Program Chairs:
Eve Hoggan
Aarhus University, Denmark
,
Sriram Subramanian
University of Sussex, UK
,
Stephen A. Brewster
University of Glasgow, UK
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 November 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
CARE
CASE
MMMM
Modelling
Multimodal Interfaces
Tools for Design
Qualifiers
- research-article
Conference

Acceptance Rates
ICMI '17 Paper Acceptance Rate65of149submissions,44%Overall Acceptance Rate453of1,080submissions,42%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 154
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Modelling fusion of modalities in multimodal interactive systems with MMMM

ICMI '17: Proceedings of the 19th ACM International Conference on Multimodal Interaction

ABSTRACT

References

Cited By

Index Terms

Recommendations

Fusion engines for multimodal input: a survey

Benchmarking fusion engines of multimodal interactive systems

Salience modeling based on non-verbal modalities for spoken language understanding