Abstract
In this paper, we report the LIUM participation in the ETAPE [1] (Évaluations en Traitement Automatique de la Parole) evaluation campaign, on the rich transcription task for French track. After describing the ETAPE goals and guidelines, we present our ASR system, which ranked first in the ETAPE evaluation campaign. Two ASR systems were used for our participation in ETAPE 2011. In addition to the LIUM ASR system based on CMU Sphinx project, we utilized an additional open-source ASR system based on the RASR toolkit. We evaluate, in this paper, the gain obtained with various acoustics modeling and adaptation techniques for each of the two systems, as well as with various system combination techniques. The combination of two different ASR systems allows a significant WER reduction, from 23.6% for the best single ASR system to 22.6% for the combination.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gravier, G., Adda, G., Paulsson, N., Carré, M., Giraudel, A., Galibert, O.: The ETAPE corpus for the evaluation of speech-based TV content processing in the French language. In: International Conference on Language Resources, Evaluation and Corpora, LREC (May 2012)
Deléglise, P., Estève, Y., Meignier, S., Merlin, T.: Improvements to the LIUM French ASR system based on CMU Sphinx: what helps to significantly reduce the word error rate? In: Interspeech (2009)
Galliano, S., Geoffrois, E., Mostefa, D., Choukri, K., Bonastre, J.F., Gravier, G.: The ESTER phase II evaluation campaign for the rich transcription of French broadcast news. In: Eurospeech, European Conference on Speech Communication and Technology, Lisbon, Portugal (September 2005)
Meignier, S., Merlin, T.: LIUM SpkDiarization: an open source toolkit for diarization. In: ASRU (2010)
Zhu, Q., Chen, B., Morgan, N., Stolcke, A.: On using MLP features in LVCSR. In: Proc. ICSLP, Jeju, Korea, pp. 921–924 (2004)
Béchet, F.: LIA_PHON: un système complet de phonétisation de textes. Traitement Automatique des Langues – TAL 42, 47–67 (2001)
Stolcke, A.: SRILM – An extensible language modeling toolkit (2002)
Rybach, D., Gollan, C., Heigold, G., Hoffmeister, B., Lööf, J., Schlüter, R., Hermann, N.: The RWTH Aachen university open source speech recognition system. In: Interspeech, pp. 2111–2114 (2009)
Stüker, S., Fügen, C., Burger, S., Wölfel, M.: Cross-system adaptation and combination for continuous speech recognition: The influence of phoneme set and acoustic front-end?(2006)
Giuliani, D., Brugnara, F.: Experiments on cross-system acoustic model adaptation. In: ASRU, pp. 117–122 (2007)
Li, X., Singh, R., Stern, R.M.: Lattice combination for improved speech recogniton. In: Interspeech (2002)
Schwenk, H.: CSLM - A modular Open-Source Continuous Space Language Modeling Toolkit. In: Interspeech (2013)
Fiscus, J.G.: A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER). In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 347–352 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bougares, F., Deléglise, P., Estève, Y., Rouvier, M. (2013). LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-40585-3_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40584-6
Online ISBN: 978-3-642-40585-3
eBook Packages: Computer ScienceComputer Science (R0)