Skip to main content

The Rich Transcription 2007 Meeting Recognition Evaluation

  • Conference paper
Multimodal Technologies for Perception of Humans (RT 2007, CLEAR 2007)

Abstract

We present the design and results of the Spring 2007 (RT-07) Rich Transcription Meeting Recognition Evaluation; the fifth in a series of community-wide evaluations of language technologies in the meeting domain. For 2007, we supported three evaluation tasks: Speech-To-Text (STT) transcription, “Who Spoke When” Diarization (SPKR), and Speaker Attributed Speech-To-Text (SASTT). The SASTT task, which combines STT and SPKR tasks, was a new evaluation task. The test data consisted of three test sets: Conference Meetings, Lecture Meetings, and Coffee Breaks from lecture meetings. The Coffee Break data was included as a new test set this year. Twenty-one research sites materially contributed to the evaluation by providing data or building systems. The lowest STT word error rates with up to four simultaneous speakers in the multiple distant microphone condition were 40.6 %, 49.8 %, and 48.4 % for the conference, lecture, and coffee break test sets respectively. For the SPKR task, the lowest diarization error rates for all speech in the multiple distant microphone condition were 8.5 %, 25.8 %, and 25.5 % for the conference, lecture, and coffee break test sets respectively. For the SASTT task, the lowest speaker attributed word error rates for segments with up to three simultaneous speakers in the multiple distant microphone condition were 40.3 %, 59.3 %, and 68.4 % for the conference, lecture, and coffee break test sets respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fiscus, et al.: Results of the Fall 2004 STT and MDE Evaluation. In: RT-2004F Evaluation Workshop Proceedings, November 7-10 (2004)

    Google Scholar 

  2. Garofolo, et al.: The Rich Transcription 2004 Spring Meeting Recognition Evaluation. In: ICASSP 2004 Meeting Recognition Workshop, May 17 (2004)

    Google Scholar 

  3. The (RT-07) Rich Transcription Meeting Recognition Evaluation Plan (2007), http://www.nist.gov/speech/tests/rt/rt2007

  4. LDC Meeting Recording Transcription, http://www.ldc.upenn.edu/Projects/Transcription/NISTMeet

  5. SCTK toolkit, http://www.nist.gov/speech/tools/index.htm

  6. Garofolo, J.S., Fiscus, J.G., Radde, N., Le, A., Ajot, J., Laprun, C.: The Rich Transcription 2005 Spring Meeting Recognition Evaluation. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 369–389. Springer, Heidelberg (2006)

    Google Scholar 

  7. http://www.clear-evaluation.org/

  8. Fiscus, et al.: Multiple Dimension Levenshtein Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech. In: LREC 2006: Sixth International Conference on Language Resources and Evaluation (2006)

    Google Scholar 

  9. http://isl.ira.uka.de/clear06/downloads/ClearEval_Protocol_v5.pdf

  10. Fiscus, J., Ajot, J., Michel, M., Garofolo, J.: The Rich Transcription 2006 Spring Meeting Recognition Evaluation. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  11. Burger, S.: The CHIL RT07 Evaluation Data. In: The Joint Proceedings of the 2006 CLEAR and RT Evaluations (May 2007)

    Google Scholar 

  12. Lammie Glenn, M., Strassel, S.: Shared Linguistic Resources for the Meeting Domain. In: The Joint Proceedings of the 2006 CLEAR and RT Evaluations (May 2007)

    Google Scholar 

  13. Wooters, C., Fung, J., Peskin, B., Anguera, X.: Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System. In: RT-2004F Workshop (November 2004)

    Google Scholar 

  14. Stiefelhagen, R., Bernardin, K., Bowers, R., Garofolo, J., Mostefa, D., Soundararajan, P.: The CLEAR 2006 Evaluation, Proceedings of the first International CLEAR Evaluation Workshop. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 1–45. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  15. StainStiefelhagen, R., Bowers, R., Rose, R.: Results of the CLEAR 2007 Evaluation. In: The Joint Proceedings of the 2006 CLEAR and RT Evaluations (May 2007)

    Google Scholar 

  16. http://www.nist.gov/dads/HTML/HungarianAlgorithm.html

  17. http://www.nist.gov/speech/tests/sigtests/mapsswe.htm

  18. Stanford, V.: The NIST Mark-III microphone array - infrastructure, reference data, and metrics. In: Proceedings International Workshop on Microphone Array Systems - Theory and Practice, Pommersfelden, Germany (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fiscus, J.G., Ajot, J., Garofolo, J.S. (2008). The Rich Transcription 2007 Meeting Recognition Evaluation. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68585-2_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68584-5

  • Online ISBN: 978-3-540-68585-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics