Skip to main content

Uniform Speech Recognition Platform for Evaluation of New Algorithms

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

  • 575 Accesses

Abstract

This paper presents the development of a speech recognition platform. Its main area of use would be the evaluation of different new and improved algorithms for speech recognition (noise reduction, feature extraction, language model generation, training of acoustic models, . . . ). To enable wide usage of the platform, different test configurations were added — from alphabet spelling to large vocabulary continuous speech recognition. At the moment, this speech recognition platform is implemented and evaluated using a studio (SNABI) and a fixed telephone (SpeechDat(II)) speech database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lindberg, B., Johansen, F.T., Warakagoda, N., Lehtinen, G., Kačič, Z., Žgank, A., Elenius, K., Salvi, G.: A noise robust multilingual reference recogniser based on SpeechDat(II). ICSLP 2000, Beijing, China, 2000.

    Google Scholar 

  2. Pols, L.C.W.: Evaluating the performance of speech input/output systems. A report of the ESPRITSAM project. Proc. DAGA’91, 139–150, Bochum, Germany, 1991.

    Google Scholar 

  3. Young, S.: The HTK Book (for HTK version 3.1). Cambridge University, 2001.

    Google Scholar 

  4. Kačič, Z., Horvat, B., Zögling A.: Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language. Proc. LREC-2000, Athens, 2000.

    Google Scholar 

  5. Kaiser, J., Kačič, Z.: Development of the Slovenian SpeechDat database. Proc. LREC-1998, Granada, Spain, 1998.

    Google Scholar 

  6. Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. Proc. of the EuroSpeech’ 97, Rhodes, Greece, 1997.

    Google Scholar 

  7. van den Heuvel, H., Boves, L., Moreno, A., Omologo, M., Richard, G., Sanders, E.: 2001. Annotation in the SpeechDat Projects. International Journal of Speech Technology, 4(2):127–143.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Žgank, A., Rotovnik, T., Kačič, Z., Horvat, B. (2002). Uniform Speech Recognition Platform for Evaluation of New Algorithms. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_47

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_47

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics