Skip to main content

Comparison and Combination of Confidence Measures

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

  • 583 Accesses

Abstract

A set of features for word-level confidence estimation is developed. The features should be easy to implement and should require no additional knowledge beyond the information which is available from the speech recognizer and the training data. We compare a number of features based on a common scoring method, the normalized cross entropy. We also study different ways to combine the features. An artificial neural network leads to the best performance, and a recognition rate of 76% is achieved. The approach is extended not only to detect recognition errors but also to distinguish between insertion and substitution errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. L. Chase: Error-Responsive Feedback Mechanisms for Speech Recognition. Ph.D. Thesis, Carnegie Mellon University (1997).

    Google Scholar 

  2. B. Mison and R. Gopinath: Robust Confidence Annotation and Rejection for Continuous Speech Recognition Proc. IEEE ICASSP (2001) Vol. 1.

    Google Scholar 

  3. E. Eide and H. Gish and P. Jeanrenaud and A. Mielke: Understanding and Improving Speech Recognition Performance through the Use of Diagnostic Tools. Proc. IEEE ICASSP (1995) Vol. 1, 221–224.

    Google Scholar 

  4. S. Cox and R. C. Rose: Confidence Measures for the SWITCHBOARD Database. Proc. IEEE ICASSP (1996) Vol. 1, 511–514.

    Google Scholar 

  5. F. Wessel and K. Macherey and R. Schlüter: UsingWord Probabilities as Confidence Measures. Proc. IEEE ICASSP (1998) Vol. 1, 225–228.

    Google Scholar 

  6. W. Wahlster: Verbmobil: Foundations of Speech-to-Speech Translation. Springer (2000).

    Google Scholar 

  7. S. Steidl: Konfidenzbewertung von Worthypothesen. Student Thesis (in German), Chair for Pattern Recognition, University of Erlangen-Nürnberg (2001).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Stemmer, G., Steidl, S., Nöth, E., Niemann, H., Batliner, A. (2002). Comparison and Combination of Confidence Measures. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_25

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_25

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics