Designing and Evaluating an Adaptive Spoken Dialogue System

Litman, Diane J.; Pan, Shimei

doi:10.1023/A:1015036910358

Designing and Evaluating an Adaptive Spoken Dialogue System

Published: June 2002

Volume 12, pages 111–137, (2002)
Cite this article

User Modeling and User-Adapted Interaction Aims and scope Submit manuscript

Diane J. Litman¹ &
Shimei Pan²

664 Accesses
81 Citations
3 Altmetric
Explore all metrics

Abstract

Spoken dialogue system performance can vary widely for different users, as well for the same user during different dialogues. This paper presents the design and evaluation of an adaptive version of TOOT, a spoken dialogue system for retrieving online train schedules. Based on rules learned from a set of training dialogues, adaptive TOOT constructs a user model representing whether the user is having speech recognition problems as a particular dialogue progresses. Adaptive TOOT then automatically adapts its dialogue strategies based on this dynamically changing user model. An empirical evaluation of the system demonstrates the utility of the approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bell, L. and Gustafson, J.: 2000, Positive and Negative User Feedback in a Spoken Dialogue Corpus. In: Proc. 6th International Conference of Spoken Language Processing (ICSLP). Beijing, China, pp. 589–592.
Bouwman, A. G., Sturm, J. and Boves, L.: 1999, Incorporating confidence measures in the Dutch train timetable information system developed in the ARISE project. In: Proc. International Conference on Acoustics, Speech and Signal Processing, Vol. 1. Phoenix, pp. 493–496.
Google Scholar
Chin, D. N.: 2001, Empirical Evaluation of User Models and User-Adapted Systems. User Modeling and User-Adapted Interaction, pp. 181–194.
Chu-Carroll, J.: 2000, MIMIC: An Adaptive Mixed Initiative Spoken Dialogue System for Information Queries. In: Proc. Applied Natural Language Processing (ANLP), pp. 97–104.
Chu-Carroll, J. and Nickerson, J. S.: 2000, Evaluating Automatic Dialogue Strategy Adaptation for a Stoken Dialogue System. In: Proc. 1st Conference of the North American Chapter of the Association for Coputational Linguistics (NAACL), pp. 202–209.
Cohen, P.: 1995, Empirical Methods for Artificial Intelligence. MIT Press, Boston.
Google Scholar
Cohen, W.: 1996, Learning trees and rules with set-valued features. In: Proc. 13th National Conference on Artificial Intelligence (AAAI), pp. 709–716.
Danieli, M. and Gerbino, E.: 1996, Metrics for Evaluating Dialogue Strategies in a Spoken Language System. In: Proc. AAAI Spring Symposium on Empirical Methods in Discourse Interpretation and Generation, pp. 34–39.
Hirasawa, J., Miyazaki, N., Nakano, M. and Aikawa, K.: 2000, New Feature Parameters For Detecting Misunderstandings in a Spoken Dialogue System. In: Proc. 6th International Conference of Spoken Language Processing (ICSLP), Vol. 2, Beijing, China, pp. 154–157.
Google Scholar
Kamm, C., Litman, D. and Walker, M.: 1998, From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Dialogue Systems. In: Proc ICSLP, pp. 1211–1214.
Kamm, C., Narayanan, S., Dutton, D. and Ritenour, R.: 1997, Evaluating Spoken Dialog Systems for Telecommunication Services. In: Proc. European Conference on Speech Communication and Technology (EUROSPEECH), pp. 22–25.
Krahmer, E., Swerts, M., Theune, M. and Weegels, M.: 1999, Error Spotting in Human–Machine Interactions. In: Proc. European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1423–1426.
Levin, E. and Pieraccini, R.: 1997, A Stochastic Model of Computer–Human Interaction for Learning Dialogue Strategies. In: Proc. European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1883–1886.
Levow, G.-A: 1998, Characterizing and Recognizing Spoken Corrections in Human–Computer Dialogue. In: Proc. 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics (COLING-ACL), pp. 736–742.
Litman, D., Hirschberg, J. and Swerts, M.: 2000a, Predicting Automatic Speech Recognition Performance Using Prosodic Cues. In: Proc. 1st Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 218–225.
Litman, D., Pan, S. and Walker, M.: 1998, Evaluating Response Strategies in a Web-Based Spoken Dialogue Agent. In: Proc. ACL/COLING, pp. 780–786.
Litman, D. J.: 1998, Predicting Speech Recognition Performance from Dialogue Phenomena. Presented at the American Association for Artificial Intelligence Spring Symposium Series on Applying Machine Learning to Discourse Processing.
Litman, D. J., Kearns, M. S., Singh, S. and Walker, M. A.: 2000b, Automatic Optimization of Dialogue Management. In: Proc. of COLING 2000.
Litman, D. J. and Pan, S.: 1999, Empirically Evaluating an Adaptable Spoken Dialogue System. In: Proc. 7th International Conference on User Modeling (UM), pp. 55–64.
Litman, D. J., Walker, M. A. and Kearns, M. J.: 1999: Automatic Detection of Poor Speech Recognition at the Dialogue Level. In: Proc. 37th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 309–316.
Monge, P. and Cappella, J. (eds): 1980, Multivariate Techniques in Human Communication Research. Academic Press, New York.
Google Scholar
Polifroni, J., Hirschman, L., Seneff, S. and Zue, V.: 1992, Experiments in Evaluating Interactive Spoken Language Systems. In: Proc. DARPA Speech and NL Workshop, pp. 28–33.
Shriberg, E., Wade, E. and Price, P.: 1992, Human–Machine Problem Solving Using Spoken Language Systems (SLS): Factors Affecting Performance and User Satisfaction. In: Proc. DARPA Speech and NL workshop, pp. 419–424.
Smith, R.W.: 1998, An Evaluation of Strategies for Selectively Verifying Utterance Meanings in Spoken Natural Lanauage Dialog. International Journal of Human–Computer Studies 48, 627–647.
Google Scholar
Strachan, L., Anderson, J., Sneesby, M. and Evans, M.: 1997, Pragmatic UserModelling in a Commercial Software System. In: Proc. UM97, pp. 189–200.
van Zanten, G. V.: 1999, User Modelling in Adaptive Dialogue Management. In: Proc. European Conference on Speech Communication and Technology (EUROSPEECH), pp. 1183–1186.
Walker, M., Fromer, J. and Narayanan, S.: 1998a, Learning Optimal Dialogue Strategies. A Case Study of a Spoken Dialogue Agent for E-mail. In: Proc. 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics (COLING-ACL), pp. 1345–1352.
Walker, M., Hindle, D., Fromer, J., Fabbrizio,G. D. and Mestel, C.: 1997a, Evaluating Competing Agent Strategies for a Voice E-mail Agent. In: Proc. European Conference on Speech Communication and Technology (EUROSPEECH), pp. 22–25.
Walker, M., Langkilde, I., Wright, J., Gorin, A. and Litman, D.: 2000a. Learning to Predict Problematic Situations in a Spoken Dialogue System: Experiments with HowMay I Help You?. In: Proceedings of the North American Meeting of the Association for Computational Linguistics, pp. 210–217.
Walker, M., Litman, D., Kamm, C. and Abella, A.: 1997b, PARADISE: A General Framework for Evaluating Spoken Dialogue Agents. In: Proc. ACL/EACL, pp. 271–280.
Walker, M., Litman, D., Kamm, C. and Abella, A.:1998b, Evaluating Spoken Dialogue Agents with PARADISE Two Case Studies. Computer Speech and Language, 12(3), pp. 317–347.
Google Scholar
Walker, M. A., Kamm, C. A. and Litman, D. J. 2000b, Towards Developing General Models of Usability with PARADISE. Natural Language Engineering: Special Issue on Best Practice in Spoken Dialogue Systems.
Weiss, S. M. and Kulikowski, C.: 1991, Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning, and Expert Systems. San Mateo, CA: Morgan Kaufmann.
Google Scholar
Zeljkovic, I.: 1996, Decoding Optimal State Sequences with Smooth State Likelihoods. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 129–132.

Download references

Author information

Authors and Affiliations

Computer Science Department and LRDC, University of Pittsburgh, Pittsburgh, PA, 15260, USA
Diane J. Litman
IBM T.J. Watson Research Center, 30 Saw Mill River Road, Hawthorne, NY, 10532, USA
Shimei Pan

Authors

Diane J. Litman
View author publications
You can also search for this author in PubMed Google Scholar
Shimei Pan
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Litman, D.J., Pan, S. Designing and Evaluating an Adaptive Spoken Dialogue System. User Modeling and User-Adapted Interaction 12, 111–137 (2002). https://doi.org/10.1023/A:1015036910358

Download citation

Issue Date: June 2002
DOI: https://doi.org/10.1023/A:1015036910358

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Designing and Evaluating an Adaptive Spoken Dialogue System

Abstract

Access this article

Similar content being viewed by others

User-Centred Spoken Dialogue Management

Conclusion and Future Research Directions

Optimisation for POMDP-Based Spoken Dialogue Systems

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Designing and Evaluating an Adaptive Spoken Dialogue System

Abstract

Access this article

Similar content being viewed by others

User-Centred Spoken Dialogue Management

Conclusion and Future Research Directions

Optimisation for POMDP-Based Spoken Dialogue Systems

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation