Article

Demonstration of the complex event recognition architecture for multimodal event parsing

Authors:
Will Fitzgerald

Kalamazoo College, Kalamazoo, MI

Kalamazoo College, Kalamazoo, MI
View Profile

,
R. James Firby

I/NET, Inc., Chicago, IL

I/NET, Inc., Chicago, IL
View Profile

,
Michael Hannemann

I/NET, Inc., Chicago, IL

I/NET, Inc., Chicago, IL
View Profile

IUI '03: Proceedings of the 8th international conference on Intelligent user interfacesJanuary 2003Pages 321https://doi.org/10.1145/604045.604114

Published:12 January 2003Publication History

IUI '03: Proceedings of the 8th international conference on Intelligent user interfaces

Pages 321

ABSTRACT

An important criterion for many intelligent user interfaces is that the interface be multimodal, that is, allow the user to interact with the system using a variety of different input channels. In addition to user interface interactions per se, the system may need to process input from multiple channels to make decisions in response to interface interactions or for other related purposes.The multimodal event parsing system described in our paper has been implemented in a working system called CERA, the Complex Event Recognition Architecture. CERA, developed under contract with NASA, has been used to identify complex events across multiple sensor channels in an advanced life support system demonstration project.We will demonstrate:

The CERA event recognition language,
The CERA event recognition engine at work,
A custom development environment for writing and debugging CERA event recognizers,
Visualization tools for complex event display,
Integrating CERA with various toolkits and projects.

The CERA event recognition engine is written in Common Lisp [1] and has a custom development environment with visualization tools based within the Eclipse extensible IDE [2]. This combination provides an easy to use development environment that can be used remotely, while maintaining the interactive flexibility of Lisp.As well as being a stand-alone event recognition system, CERA has also been tightly integrated with the RAP execution system [3] and the I/NET Conversational Interface system for dialogue management [4]. This combination allows the creation of human/computer interfaces for dynamic systems that make use of natural language, multi-channel controls and sensors, and other available physical context.Our demonstration will consist of a number of different components designed to illustrate the various aspects of CERA and our approach to building multimodal interfaces. The first demonstration will show CERA processing and combining events from multiple input streams, including examples from the NASA advanced life support system domain. The emphasis of this demonstration will be to show how event recognizers are built and how they work in practice.Our second demonstration will illustrate the CERA IDE and visualization tools. These tools allow the programming of a remote CERA system and the monitoring and debugging of its operation. Techniques for monitoring recognition progress and examining partial recognition state will be examined.Finally, we will demonstrate a more complex interface that combines natural language input with various non-linguistic input streams. An automotive telematics application will form the basis of this demonstration. The audience will be encouraged to participate in this demonstration.

References

ANSI Common Lisp standard (X3.226-1994).Google Scholar
Eclispe Integrated Development Environment. Available at http://www.eclipse.org.Google Scholar
Firby, R. James. The RAP Language Manual, Version 2. Available at http://www.inetmi.com/ci/tech/rap-zzGoogle Scholar
I/NET, Inc. Conversational Interface for dialogue management. http://www.inetmi.com/ci/Google Scholar

Demonstration of the complex event recognition architecture for multimodal event parsing
1. Human-centered computing
2. Software and its engineering
  1. Software notations and tools
    1. Development frameworks and environments

Recommendations

Multimodal Event Detection on Chinese Glyphs
Advanced Intelligent Computing Technology and Applications
Abstract
At present, most of the event detection researches focus on the single text modality, and multimodal event extraction is in its infancy, which mainly used the feature information of images to enhance text information. However, obtaining the ...
Read More
Multimodal Attentive Fusion Network for audio-visual event recognition
Abstract
Event classification is inherently sequential and multimodal. Therefore, deep neural models need to dynamically focus on the most relevant time window and/or modality of a video. In this study, we propose the Multimodal Attentive ...
Highlights
- State-of-the-art audio and visual interactions in neural networks are relatively simple.
Read More
Structural and event based multimodal video data modeling
ADVIS'06: Proceedings of the 4th international conference on Advances in Information Systems

In this paper, a structural and event based multimodal video data model (SEBM) is proposed. SEBM supports three different modalities that are visual, auditory and textual modalities for video database systems and it can dissolve these three modalities ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
IUI '03: Proceedings of the 8th international conference on Intelligent user interfaces
January 2003
344 pages
ISBN:1581135866
DOI:10.1145/604045
Conference Chair:
David Leake
Indiana University
,
Program Chairs:
Lewis Johnson
USC/Information Sciences Institute
,
Elisabeth Andre
University of Augsburg
Copyright © 2003 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 January 2003
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate746of2,811submissions,27%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 312
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Demonstration of the complex event recognition architecture for multimodal event parsing

IUI '03: Proceedings of the 8th international conference on Intelligent user interfaces

ABSTRACT

References

Cited By

Recommendations

Multimodal Event Detection on Chinese Glyphs

Multimodal Attentive Fusion Network for audio-visual event recognition

Structural and event based multimodal video data modeling