On the Use of Phoneme Lattices in Spoken Language Understanding

Švec, Jan; Šmídl, Luboš

doi:10.1007/978-3-642-40585-3_47

On the Use of Phoneme Lattices in Spoken Language Understanding

Jan Švec^20,21 &
Luboš Šmídl^20,21

Conference paper

2409 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8082))

Abstract

This paper presents a novel approach to spoken language understanding in dialogue systems. Unlike prevalent methods that use only the word lattices, the presented approach works with phoneme lattices generated by a phoneme recognizer. The hierarchical discriminative model for speech understanding was used together with modifications proposed in this paper. The method was experimentally evaluated using two semantic corpora and the results are presented.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Švec, J., Jurčíček, F.: Extended Hidden Vector State Parser. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 403–410. Springer, Heidelberg (2009)
Chapter Google Scholar
Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K., Young, S.: Spoken language understanding from unaligned data using discriminative classification models. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, Taipei, pp. 4749–4752. IEEE (2009)
Google Scholar
Raymond, C., Béchet, F., De Mori, R., Damnati, G.: On the use of finite state transducers for semantic interpretation. Speech Communication 48(3-4), 288–304 (2006)
Article Google Scholar
Valenta, T., Švec, J., Šmídl, L.: Spoken Dialogue System Design in 3 Weeks. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 624–631. Springer, Heidelberg (2012)
Chapter Google Scholar
Švec, J., Šmídl, L., Ircing, P.: Hierarchical Discriminative Model for Spoken Language Understanding. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 8322–8326. IEEE, Vancouver (2013)
Google Scholar
Cortes, C., Haffner, P.: Rational kernels: Theory and algorithms. The Journal of Machine Learning 5, 1035–1062 (2004)
MathSciNet MATH Google Scholar
Graf, A.B., Smola, A.J., Borer, S.: Classification in a normalized feature space using support vector machines. IEEE Transactions on Neural Networks 14(3), 597–605 (2003)
Article Google Scholar
Švec, J., Ircing, P.: Efficient algorithm for rational kernel evaluation in large lattice sets. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 3133–3137. IEEE, Vancouver (2013)
Google Scholar
Jurčíček, F., Zahradil, J., Jelínek, L.: A human-human train timetable dialogue corpus. In: Proceedings of EUROSPEECH, Lisboa, pp. 1525–1528 (2005)
Google Scholar
Jurčíček, F., Švec, J., Müller, L.: Extension of HVS semantic parser by allowing left-right branching. In: IEEE International Conference on Acoustics Speech and Signal Processing, vol. (1), pp. 4993–4996 (2008)
Google Scholar
Psutka, J., Švec, J., Psutka, J.V., Vaněk, J., Pražák, A., Šmídl, L., Ircing, P.: System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive. EURASIP Journal on Audio, Speech, and Music Processing (1), 1–11 (2011)
Google Scholar
Soutner, D., Loose, Z., Müller, L., Pražák, A.: Neural Network Language Model with Cache. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 528–534. Springer, Heidelberg (2012)
Chapter Google Scholar
Dogan, C., Saraclar, M.: Lattice Indexing for Spoken Term Detection. IEEE Transactions on Audio, Speech and Language Processing 19(8), 2338–2347 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Cybernetics, University of West Bohemia, Czech Republic
Jan Švec & Luboš Šmídl
NTIS - New Technologies for Information Society, Faculty of Applied Sciences, University of West Bohemia, Czech Republic
Jan Švec & Luboš Šmídl

Authors

Jan Švec
View author publications
You can also search for this author in PubMed Google Scholar
Luboš Šmídl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of West Bohemia, 306 14, Pilsen, Czech Republic
Ivan Habernal & Václav Matoušek &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Švec, J., Šmídl, L. (2013). On the Use of Phoneme Lattices in Spoken Language Understanding. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_47

Download citation

DOI: https://doi.org/10.1007/978-3-642-40585-3_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40584-6
Online ISBN: 978-3-642-40585-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics