Skip to main content

On the Use of Phoneme Lattices in Spoken Language Understanding

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8082))

Abstract

This paper presents a novel approach to spoken language understanding in dialogue systems. Unlike prevalent methods that use only the word lattices, the presented approach works with phoneme lattices generated by a phoneme recognizer. The hierarchical discriminative model for speech understanding was used together with modifications proposed in this paper. The method was experimentally evaluated using two semantic corpora and the results are presented.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Švec, J., Jurčíček, F.: Extended Hidden Vector State Parser. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 403–410. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  2. Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K., Young, S.: Spoken language understanding from unaligned data using discriminative classification models. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, Taipei, pp. 4749–4752. IEEE (2009)

    Google Scholar 

  3. Raymond, C., Béchet, F., De Mori, R., Damnati, G.: On the use of finite state transducers for semantic interpretation. Speech Communication 48(3-4), 288–304 (2006)

    Article  Google Scholar 

  4. Valenta, T., Švec, J., Šmídl, L.: Spoken Dialogue System Design in 3 Weeks. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 624–631. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  5. Švec, J., Šmídl, L., Ircing, P.: Hierarchical Discriminative Model for Spoken Language Understanding. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 8322–8326. IEEE, Vancouver (2013)

    Google Scholar 

  6. Cortes, C., Haffner, P.: Rational kernels: Theory and algorithms. The Journal of Machine Learning 5, 1035–1062 (2004)

    MathSciNet  MATH  Google Scholar 

  7. Graf, A.B., Smola, A.J., Borer, S.: Classification in a normalized feature space using support vector machines. IEEE Transactions on Neural Networks 14(3), 597–605 (2003)

    Article  Google Scholar 

  8. Švec, J., Ircing, P.: Efficient algorithm for rational kernel evaluation in large lattice sets. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 3133–3137. IEEE, Vancouver (2013)

    Google Scholar 

  9. Jurčíček, F., Zahradil, J., Jelínek, L.: A human-human train timetable dialogue corpus. In: Proceedings of EUROSPEECH, Lisboa, pp. 1525–1528 (2005)

    Google Scholar 

  10. Jurčíček, F., Švec, J., Müller, L.: Extension of HVS semantic parser by allowing left-right branching. In: IEEE International Conference on Acoustics Speech and Signal Processing, vol. (1), pp. 4993–4996 (2008)

    Google Scholar 

  11. Psutka, J., Švec, J., Psutka, J.V., Vaněk, J., Pražák, A., Šmídl, L., Ircing, P.: System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive. EURASIP Journal on Audio, Speech, and Music Processing (1), 1–11 (2011)

    Google Scholar 

  12. Soutner, D., Loose, Z., Müller, L., Pražák, A.: Neural Network Language Model with Cache. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 528–534. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  13. Dogan, C., Saraclar, M.: Lattice Indexing for Spoken Term Detection. IEEE Transactions on Audio, Speech and Language Processing 19(8), 2338–2347 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Švec, J., Šmídl, L. (2013). On the Use of Phoneme Lattices in Spoken Language Understanding. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40585-3_47

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40584-6

  • Online ISBN: 978-3-642-40585-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics