Skip to main content

Uniform Speech Recognition Platform for Evaluation of New Algorithms

  • Conference paper
  • First Online:
  • 552 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Abstract

This paper presents the development of a speech recognition platform. Its main area of use would be the evaluation of different new and improved algorithms for speech recognition (noise reduction, feature extraction, language model generation, training of acoustic models, . . . ). To enable wide usage of the platform, different test configurations were added — from alphabet spelling to large vocabulary continuous speech recognition. At the moment, this speech recognition platform is implemented and evaluated using a studio (SNABI) and a fixed telephone (SpeechDat(II)) speech database.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lindberg, B., Johansen, F.T., Warakagoda, N., Lehtinen, G., Kačič, Z., Žgank, A., Elenius, K., Salvi, G.: A noise robust multilingual reference recogniser based on SpeechDat(II). ICSLP 2000, Beijing, China, 2000.

    Google Scholar 

  2. Pols, L.C.W.: Evaluating the performance of speech input/output systems. A report of the ESPRITSAM project. Proc. DAGA’91, 139–150, Bochum, Germany, 1991.

    Google Scholar 

  3. Young, S.: The HTK Book (for HTK version 3.1). Cambridge University, 2001.

    Google Scholar 

  4. Kačič, Z., Horvat, B., Zögling A.: Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language. Proc. LREC-2000, Athens, 2000.

    Google Scholar 

  5. Kaiser, J., Kačič, Z.: Development of the Slovenian SpeechDat database. Proc. LREC-1998, Granada, Spain, 1998.

    Google Scholar 

  6. Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. Proc. of the EuroSpeech’ 97, Rhodes, Greece, 1997.

    Google Scholar 

  7. van den Heuvel, H., Boves, L., Moreno, A., Omologo, M., Richard, G., Sanders, E.: 2001. Annotation in the SpeechDat Projects. International Journal of Speech Technology, 4(2):127–143.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Žgank, A., Rotovnik, T., Kačič, Z., Horvat, B. (2002). Uniform Speech Recognition Platform for Evaluation of New Algorithms. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_47

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_47

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics