Skip to main content

Integration of an On-line Kaldi Speech Recogniser to the Alex Dialogue Systems Framework

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8655))

Abstract

This paper describes the integration of an on-line Kaldi speech recogniser into the Alex Dialogue Systems Framework (ADSF). As the Kaldi OnlineLatgenRecogniser is written in C++, we first developed a Python wrapper for the recogniser so that the ADSF, written in Python, could interface with it. Training scripts for acoustic and language modelling were developed and integrated into ADSF, and acoustic and language models were build. Finally, optimal recogniser parameters were determined and evaluated. The dialogue system Alex with the new speech recogniser is evaluated on Public Transport Information (PTI) domain.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Skantze, G., Schlangen, D.: Incremental dialogue processing in a micro-domain. In: Proc. ECACL, pp. 745–753 (2009)

    Google Scholar 

  2. Akinobu, L.: Open-Source Large Vocabulary CSR Engine Julius (2014), http://julius.sourceforge.jp/en_index.php

  3. Allauzen, C., Riley, M., Schalkwyk, J., Skut, W., Mohri, M.: OpenFst: A general and efficient weighted finite-state transducer library. In: Holub, J., Žďárek, J. (eds.) CIAA 2007. LNCS, vol. 4783, pp. 11–23. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  4. Huggins-Daines, D., Kumar, M., Chan, A., Black, A., Ravishankar, M., Rudnicky, A.: Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices. In: Proc. ICASSP, pp. I–I (December 2006)

    Google Scholar 

  5. D. Povey, M. Hannemann, G. Boulianne, L. Burget, A. Ghoshal, M. Janda, M. Karafiát, S. Kombrink, P. Motlicek, Y. Qian at al.: Generating exact lattices in the WFST framework. In Proc. ICASSP, pp. 4213–4216 (2012)

    Google Scholar 

  6. Rybach, D., Hahn, S., Lehnen, P., Nolden, D., Sundermeyer, M., Tüske, Z., Wiesler, S., Schlüter, R., Ney, H.: The RASR-The RWTH Aachen University open source speech recognition toolkit. In: Proc. IEEE Automatic Speech Recognition and Understanding Workshop (2011)

    Google Scholar 

  7. Povey, D., et al.: The Kaldi speech recognition toolkit. In: Proc. ASRU, Hawaii, US, pp. 1–4 (December 2011)

    Google Scholar 

  8. Public Transport Information System for Czech Republic, https://ufal.mff.cuni.cz/alex-dialogue-systems-framework/ptics

  9. Korvas, M., Plátek, O., Dušek, O., Žilka, L., Jurčćček, F.: Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license. In: Proceedings of International Conference on Language Resources and Evaluation (to be published, 2014)

    Google Scholar 

  10. The Kaldi ASR toolkit (2014), http://sourceforge.net/projects/kaldi

  11. The Alex Dialogue Systems Framework (2014), https://github.com/UFAL-DSG/alex

  12. The OnlineLatgenRecogniser (2014), https://github.com/UFAL-DSG/pykaldi

  13. The pyfst library: OpenFst in Python (2014), http://pyfst.github.com/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Plátek, O., Jurčíček, F. (2014). Integration of an On-line Kaldi Speech Recogniser to the Alex Dialogue Systems Framework. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_73

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10816-2_73

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10815-5

  • Online ISBN: 978-3-319-10816-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics