short-paper

High level data fusion on a multimodal interactive application platform

Author:
Hildeberto Mendonça

Université catholique de Louvain, Louvain la Neuve, Belgium

Université catholique de Louvain, Louvain la Neuve, Belgium
View Profile

EICS '09: Proceedings of the 1st ACM SIGCHI symposium on Engineering interactive computing systemsJuly 2009Pages 333–336https://doi.org/10.1145/1570433.1570497

Published:15 July 2009Publication History

EICS '09: Proceedings of the 1st ACM SIGCHI symposium on Engineering interactive computing systems

Pages 333–336

ABSTRACT

This research aims to propose a multimodal fusion framework for high-level data integration between two or more modalities. It takes as input extracted low level features from different system devices, analyzes and identifies intrinsic meanings in these data through dedicated processes running in parallel. Extracted meanings are mutually compared to identify complementarities, ambiguities and inconsistencies to better understand the user intention when interacting with the system. The whole fusion lifecycle will be described and evaluated in an ambient intelligence scenario, where two co-workers interact by voice and movements, demonstrating their intentions and the system gives advices according to identified needs.

References

Bolt R. A., "Put-that-there: Voice and gesture at the graphics interface," in International Conference on Computer Graphics and Interactive Techniques, July 1980, pp. 262--270 Google ScholarDigital Library
Bouchet, J. and Nigay, L. ICARE: a component-based approach for the design and development of multimodal interfaces. In CHI '04 Extended Abstracts on Human Factors in Computing Systems. Vienna, 2004. Google ScholarDigital Library
Cole L., Austin D., Cole L., Visual Object Recognition using Template Matching, Proceedings of Australasian Conference on Robotics and Automation, 2004.Google Scholar
Curran J., Clark S. and Bos J. (2007): Linguistically Motivated Large-Scale NLP with C&C and Boxer. Proceedings of the ACL 2007 Demonstrations Session (ACL-07 demo), pp. 29--32. Google ScholarDigital Library
Engel, Ralf; Pfleger, Norbert. Modality Fusion. SmartKom: Foundations of Multimodal Dialogue Systems. Springer, Berlin, 2006.Google Scholar
Eugene C., Mark J. Coarse-to-Fine N-Best Parsing and MaxEnt Discriminative Reranking. Proc. ACL2005. pp. 173--180. Ann Arbor, USA, 2005. Google ScholarDigital Library
Intel Corporation, Open Source Computer Vision Library--OpenCV, http://www.intel.com/technology/computing/opencv/index.htm, 2008.Google Scholar
Kaiser, Ed; Olwal, Alex; McGee, David; Benko, Hrvoje; Corradini, Andrea; Li, Xiaoguang; Cohen, Phil; Feiner, Steven. Mutual Disambiguation of 3D Multimodal Interaction in Augmented and Virtual Reality. ICMI'03. Vancouver, Canada. November, 2003. Google ScholarDigital Library
Lawson, Lionel; Macq, Benoit. OpenInterface Platform for Multimodal Applications Prototyping. ICASSP Show&Tell '08, Las Vegas, USA. April, 2008.Google Scholar
Martin, J. C. TYCOON: Theoretical Framework and Software Tools for Multimodal Interfaces. Intelligence and Multimodality in Multimedia interfaces. (ed.) John Lee, AAAI Press, 1998.Google Scholar
Protégé: http://protege.stanford.edu/Google Scholar
Soar: http://sitemaker.umich.edu/soar/homeGoogle Scholar
Vybornova, Olga; Gemo, Monica; Macq, Benoit. Multimodal Multi-Level Fusion Using Contextual Information. ERCIM News, N⁰. 70. July, 2006.Google Scholar
Vybornova, Olga; Mendonça, Hildeberto; Neiberg, Daniel; Gomez, Antonio; Shen, Ao. Multimodal High Level Data Integration. eNTERFACE'08 Workshop, Orsay-Paris, France. August, 2008.Google Scholar
Vybornova, Olga; Mendonça, Hildeberto; Lawson, Lionel; Macq, Benoit. High Level Data Fusion on a Multimodal Interactive Application Platform. IEEE ISM, Berkeley, California, USA. December, 2008. Google ScholarDigital Library
W. Walker, et. al. "Sphinx-4: A Flexible Open Source Framework for Speech Recognition" Technical Report, SMLI TR2004-0811, 2004 SUN MICROSYSTEMS INC.Google Scholar

Index Terms

High level data fusion on a multimodal interactive application platform

Recommendations

A fusion framework for multimodal interactive applications
ICMI-MLMI '09: Proceedings of the 2009 international conference on Multimodal interfaces

This research aims to propose a multi-modal fusion framework for high-level data fusion between two or more modalities. It takes as input low level features extracted from different system devices, analyses and identifies intrinsic meanings in these ...
Read More
Toward multimodal fusion of affective cues
HCM '06: Proceedings of the 1st ACM international workshop on Human-centered multimedia

During face to face communication, it has been suggested that as much as 70% of what people communicate when talking directly with others is through paralanguage involving multiple modalities combined together (e.g. voice tone and volume, body language)...
Read More
MULTIMODAL FUSION AS COMMUNICATIVE ACTS DURING HUMAN–ROBOT INTERACTION

Research on dialog systems is a very active area in social robotics. During the last two decades, these systems have evolved from those based only on speech recognition and synthesis to the current and modern systems, which include new components and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
EICS '09: Proceedings of the 1st ACM SIGCHI symposium on Engineering interactive computing systems
July 2009
348 pages
ISBN:9781605586007
DOI:10.1145/1570433
General Chair:
T.C. Nicholas Graham
Queen's University, Canada
,
Program Chairs:
Gaëlle Calvary
University of Grenoble, France
,
Philip Gray
University of Glasgow, United Kingdom
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 July 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
ambient intelligence
context-sensitive interaction
multimodal fusion
speech recognition
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate55of238submissions,23%
Upcoming Conference
EICS '24

Sponsor:

sigchi

The 16th ACM SIGCHI Symposium on Engineering Interactive Computing Systems

June 24 - 28, 2024

Cagliari , Italy
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 194
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

High level data fusion on a multimodal interactive application platform

EICS '09: Proceedings of the 1st ACM SIGCHI symposium on Engineering interactive computing systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

A fusion framework for multimodal interactive applications

Toward multimodal fusion of affective cues

MULTIMODAL FUSION AS COMMUNICATIVE ACTS DURING HUMAN–ROBOT INTERACTION