The Rich Transcription 2007 Meeting Recognition Evaluation

Fiscus, Jonathan G.; Ajot, Jerome; Garofolo, John S.

doi:10.1007/978-3-540-68585-2_36

Jonathan G. Fiscus¹,
Jerome Ajot^1,2 &
John S. Garofolo¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4625))

Included in the following conference series:

1337 Accesses
31 Citations

Abstract

We present the design and results of the Spring 2007 (RT-07) Rich Transcription Meeting Recognition Evaluation; the fifth in a series of community-wide evaluations of language technologies in the meeting domain. For 2007, we supported three evaluation tasks: Speech-To-Text (STT) transcription, “Who Spoke When” Diarization (SPKR), and Speaker Attributed Speech-To-Text (SASTT). The SASTT task, which combines STT and SPKR tasks, was a new evaluation task. The test data consisted of three test sets: Conference Meetings, Lecture Meetings, and Coffee Breaks from lecture meetings. The Coffee Break data was included as a new test set this year. Twenty-one research sites materially contributed to the evaluation by providing data or building systems. The lowest STT word error rates with up to four simultaneous speakers in the multiple distant microphone condition were 40.6 %, 49.8 %, and 48.4 % for the conference, lecture, and coffee break test sets respectively. For the SPKR task, the lowest diarization error rates for all speech in the multiple distant microphone condition were 8.5 %, 25.8 %, and 25.5 % for the conference, lecture, and coffee break test sets respectively. For the SASTT task, the lowest speaker attributed word error rates for segments with up to three simultaneous speakers in the multiple distant microphone condition were 40.3 %, 59.3 %, and 68.4 % for the conference, lecture, and coffee break test sets respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fiscus, et al.: Results of the Fall 2004 STT and MDE Evaluation. In: RT-2004F Evaluation Workshop Proceedings, November 7-10 (2004)
Google Scholar
Garofolo, et al.: The Rich Transcription 2004 Spring Meeting Recognition Evaluation. In: ICASSP 2004 Meeting Recognition Workshop, May 17 (2004)
Google Scholar
The (RT-07) Rich Transcription Meeting Recognition Evaluation Plan (2007), http://www.nist.gov/speech/tests/rt/rt2007
LDC Meeting Recording Transcription, http://www.ldc.upenn.edu/Projects/Transcription/NISTMeet
SCTK toolkit, http://www.nist.gov/speech/tools/index.htm
Garofolo, J.S., Fiscus, J.G., Radde, N., Le, A., Ajot, J., Laprun, C.: The Rich Transcription 2005 Spring Meeting Recognition Evaluation. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 369–389. Springer, Heidelberg (2006)
Google Scholar
http://www.clear-evaluation.org/
Fiscus, et al.: Multiple Dimension Levenshtein Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech. In: LREC 2006: Sixth International Conference on Language Resources and Evaluation (2006)
Google Scholar
http://isl.ira.uka.de/clear06/downloads/ClearEval_Protocol_v5.pdf
Fiscus, J., Ajot, J., Michel, M., Garofolo, J.: The Rich Transcription 2006 Spring Meeting Recognition Evaluation. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, Springer, Heidelberg (2006)
Chapter Google Scholar
Burger, S.: The CHIL RT07 Evaluation Data. In: The Joint Proceedings of the 2006 CLEAR and RT Evaluations (May 2007)
Google Scholar
Lammie Glenn, M., Strassel, S.: Shared Linguistic Resources for the Meeting Domain. In: The Joint Proceedings of the 2006 CLEAR and RT Evaluations (May 2007)
Google Scholar
Wooters, C., Fung, J., Peskin, B., Anguera, X.: Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System. In: RT-2004F Workshop (November 2004)
Google Scholar
Stiefelhagen, R., Bernardin, K., Bowers, R., Garofolo, J., Mostefa, D., Soundararajan, P.: The CLEAR 2006 Evaluation, Proceedings of the first International CLEAR Evaluation Workshop. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 1–45. Springer, Heidelberg (2007)
Chapter Google Scholar
StainStiefelhagen, R., Bowers, R., Rose, R.: Results of the CLEAR 2007 Evaluation. In: The Joint Proceedings of the 2006 CLEAR and RT Evaluations (May 2007)
Google Scholar
http://www.nist.gov/dads/HTML/HungarianAlgorithm.html
http://www.nist.gov/speech/tests/sigtests/mapsswe.htm
Stanford, V.: The NIST Mark-III microphone array - infrastructure, reference data, and metrics. In: Proceedings International Workshop on Microphone Array Systems - Theory and Practice, Pommersfelden, Germany (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute Of Standards and Technology, 100 Bureau Drive Stop 8940, Gaithersburg, MD 20899,
Jonathan G. Fiscus, Jerome Ajot & John S. Garofolo
Systems Plus, Inc., One Research Court – Suite 360, Rockville, MD 20850,
Jerome Ajot

Authors

Jonathan G. Fiscus
View author publications
You can also search for this author in PubMed Google Scholar
Jerome Ajot
View author publications
You can also search for this author in PubMed Google Scholar
John S. Garofolo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fiscus, J.G., Ajot, J., Garofolo, J.S. (2008). The Rich Transcription 2007 Meeting Recognition Evaluation. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_36

Download citation

DOI: https://doi.org/10.1007/978-3-540-68585-2_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics