Abstract
This paper presents the development of a speech recognition platform. Its main area of use would be the evaluation of different new and improved algorithms for speech recognition (noise reduction, feature extraction, language model generation, training of acoustic models, . . . ). To enable wide usage of the platform, different test configurations were added — from alphabet spelling to large vocabulary continuous speech recognition. At the moment, this speech recognition platform is implemented and evaluated using a studio (SNABI) and a fixed telephone (SpeechDat(II)) speech database.
Preview
Unable to display preview. Download preview PDF.
References
Lindberg, B., Johansen, F.T., Warakagoda, N., Lehtinen, G., Kačič, Z., Žgank, A., Elenius, K., Salvi, G.: A noise robust multilingual reference recogniser based on SpeechDat(II). ICSLP 2000, Beijing, China, 2000.
Pols, L.C.W.: Evaluating the performance of speech input/output systems. A report of the ESPRITSAM project. Proc. DAGA’91, 139–150, Bochum, Germany, 1991.
Young, S.: The HTK Book (for HTK version 3.1). Cambridge University, 2001.
Kačič, Z., Horvat, B., Zögling A.: Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language. Proc. LREC-2000, Athens, 2000.
Kaiser, J., Kačič, Z.: Development of the Slovenian SpeechDat database. Proc. LREC-1998, Granada, Spain, 1998.
Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. Proc. of the EuroSpeech’ 97, Rhodes, Greece, 1997.
van den Heuvel, H., Boves, L., Moreno, A., Omologo, M., Richard, G., Sanders, E.: 2001. Annotation in the SpeechDat Projects. International Journal of Speech Technology, 4(2):127–143.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Žgank, A., Rotovnik, T., Kačič, Z., Horvat, B. (2002). Uniform Speech Recognition Platform for Evaluation of New Algorithms. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_47
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_47
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive