skip to main content
10.1145/1040830.1040880acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
Article

CASIS: a context-aware speech interface system

Published: 10 January 2005 Publication History

Abstract

In this paper, we propose a robust natural language interface called CASIS for controlling devices in an intelligent environment. CASIS is novel in a sense that it integrates physical context acquired from the sensors embedded in the environment with traditionally used context to reduce the system error rate and disambiguate deictic references and elliptical inputs. The n-best result of the speech recognizer is re-ranked by a score calculated using a Bayesian network consisting of information from the input utterance and context. In our prototype system that uses device states, brightness, speaker location, chair occupancy, speech direction and action history as context, the system error rate has been reduced by 41% compared to a baseline system that does not leverage on context information.

References

[1]
K. Cheverst, et al, Developing a Context-aware Electronic Tourist Guide: Some Issues and Experiences, In Proceedings of CHI 2000, pp 17--24, 2000.
[2]
M. Coen, L. Weisman, K. Thomas, and M. Groh, A Context Sensitive Natural Language Modality for an Intelligent Room, In 1st International Workshop on Managing Interactions in Smart Environments (MANSE'99), pp.68--79. Dublin, Ireland, December 1999.
[3]
L. Deng and X. Huang, Challenges in Adopting Speech Recognition, Communications of the ACM January 2004, pp 69--75, 2004.
[4]
A. K. Dey, Understanding and Using Context, Personal and Ubiquitous Computing Journal, Volume 5(1), pp 4--7, 2001.
[5]
I. Gurevych, R. Malaka, R. Porzel, and H. Zorn, Semantic Coherence Scoring Using an Ontology, In Proceedings of the HLT-NAACL Conference, 2003.
[6]
K. Nagao and J. Rekimoto, Ubiquitous Talker: Spoken Language Interaction with Real World Objects, In Proceedings of the International Joint Conference on Artificial Intelligence, 1995.
[7]
S. Oviatt, Breaking the Robustness Barrier: Recent Progress on the Design of Robust Multimodal Systems, Advances in Computers, Vol. 56 pp 305--341, 2002.
[8]
R. Pieraccini, et al., A Multimodal Conversational Interface for a Concept Vehicle, In Proceedings of Eurospeech 2003, September 2003.
[9]
R. Porzel and I. Gurevych, Contextual Coherence in Natural Language Processing, CONTEXT 2003, LNAI 2680, Springer-Verlag, pp 272--285, 2003.
[10]
S. S. Pradhan and W. H. Ward, Estimating Semantic Confidence for Spoken Dialogue Systems, In Proceedings of ICASSP2002, pp 233--236, 2002.
[11]
A. Rudnicky, et al, Creating Natural Dialogs in the Carnegie Mellon Communicator System, In Proceedings of Eurospeech 1999, pp 1531--1534, 1999.
[12]
D. Siewiorek, et al, SenSay: A Context-Aware Mobile Phone, In Proceedings of 7th IEE Symposium on Wearable Computers, 2003.
[13]
S. Seneff, Response Planning and Generation in the MERCURY Flight Reservation System, Computer Speech and Language 16, pp 283--312, 2002.
[14]
A. Stent, J. Dowding, J. M. Gawron, E. O. Bratt, R. Moore, The CommandTalk Spoken Dialogue System, Proceedings of the 37th Annual Meeting of the ACL, pp 183--190, 1999.
[15]
T-Engine Forum. http://www.t-engine.org/
[16]
E. M. Tapia, S. S. Intille, and K. Larson, Activity Recognition in the Home Using Simple and Ubiquitous Sensors, PERVASIVE 2004, LNCS 3001, pp 158--175, 2004.
[17]
C. Wai, R. Pieraccini, and H. M. Meng, A Dynamic Semantic Model for Re-scoring Recognition Hypotheses, In Proceedings of ICASSP2001, pp 589--592, 2001.
[18]
Y. Wang, A. Acero, C. Chelba, B. Frey, and L. Wong, Combination of Statistical and Rule-based Approaches for Spoken Language Understanding, In Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
[19]
M. Weiser, Some Computer Sciences Issues in Ubiquitous Computing, Communications of the ACM Vol. 36 No. 2, pp 75--84, 1993.
[20]
A. Wilson, S. Shafer, XWand: UI for Intelligent Spaces, In Proceedings of SIGCHI 2003, pp 545--552, 2003.
[21]
V. Zue, et al, JUPITER: A telephone-based conversational interface for weather information, IEEE Trans. on Speech and Audio Processing, Vol. 8, No.1 pp 100--112, 2000.

Cited By

View all
  • (2024)Cooking With Agents: Designing Context-aware Voice InteractionProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642183(1-13)Online publication date: 11-May-2024
  • (2023)Conversational Interfaces in IoT Ecosystems: Where We Are, What Is Still MissingProceedings of the 22nd International Conference on Mobile and Ubiquitous Multimedia10.1145/3626705.3627775(279-293)Online publication date: 3-Dec-2023
  • (2018)RuleSelectorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/31917672:1(1-34)Online publication date: 26-Mar-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IUI '05: Proceedings of the 10th international conference on Intelligent user interfaces
January 2005
344 pages
ISBN:1581138946
DOI:10.1145/1040830
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 January 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Bayesian network
  2. context-aware computing
  3. natural language processing
  4. speech user interface

Qualifiers

  • Article

Conference

IUI05
IUI05: Tenth International Conference on Intelligent User Interfaces
January 10 - 13, 2005
California, San Diego, USA

Acceptance Rates

Overall Acceptance Rate 746 of 2,811 submissions, 27%

Upcoming Conference

IUI '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)3
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Cooking With Agents: Designing Context-aware Voice InteractionProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642183(1-13)Online publication date: 11-May-2024
  • (2023)Conversational Interfaces in IoT Ecosystems: Where We Are, What Is Still MissingProceedings of the 22nd International Conference on Mobile and Ubiquitous Multimedia10.1145/3626705.3627775(279-293)Online publication date: 3-Dec-2023
  • (2018)RuleSelectorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/31917672:1(1-34)Online publication date: 26-Mar-2018
  • (2014)Context formalization and its use on dynamic adaptation of language model in ASR systemsProceedings of the 7th Euro American Conference on Telematics and Information Systems10.1145/2590651.2590655(1-6)Online publication date: 2-Apr-2014
  • (2013)Contextual partitioning for speech recognitionACM Transactions on Embedded Computing Systems10.1145/2501626.250163913:1(1-20)Online publication date: 5-Sep-2013
  • (2011)Augmenting Context Awareness by Combining Body Sensor Networks and Social NetworksIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2010.208419060:2(345-353)Online publication date: Feb-2011
  • (2009)A context-aware autonomic packet marking mechanism2009 2nd IEEE International Conference on Broadband Network & Multimedia Technology10.1109/ICBNMT.2009.5347819(38-43)Online publication date: Oct-2009
  • (2007)Disambiguating speech commands using physical contextProceedings of the 9th international conference on Multimodal interfaces10.1145/1322192.1322235(247-254)Online publication date: 12-Nov-2007
  • (2006)Language-Derived Information and Context ModelsProceedings of the 4th annual IEEE international conference on Pervasive Computing and Communications Workshops10.1109/PERCOMW.2006.72Online publication date: 13-Mar-2006
  • (2006)Beyond traditional interaction in a mobile environment: New approach to 3D scene renderingComputers & Graphics10.1016/j.cag.2006.07.02230:5(714-726)Online publication date: Oct-2006
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media