Abstract
This paper presents a novel approach to spoken language understanding in dialogue systems. Unlike prevalent methods that use only the word lattices, the presented approach works with phoneme lattices generated by a phoneme recognizer. The hierarchical discriminative model for speech understanding was used together with modifications proposed in this paper. The method was experimentally evaluated using two semantic corpora and the results are presented.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Švec, J., Jurčíček, F.: Extended Hidden Vector State Parser. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 403–410. Springer, Heidelberg (2009)
Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K., Young, S.: Spoken language understanding from unaligned data using discriminative classification models. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, Taipei, pp. 4749–4752. IEEE (2009)
Raymond, C., Béchet, F., De Mori, R., Damnati, G.: On the use of finite state transducers for semantic interpretation. Speech Communication 48(3-4), 288–304 (2006)
Valenta, T., Švec, J., Šmídl, L.: Spoken Dialogue System Design in 3 Weeks. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 624–631. Springer, Heidelberg (2012)
Švec, J., Šmídl, L., Ircing, P.: Hierarchical Discriminative Model for Spoken Language Understanding. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 8322–8326. IEEE, Vancouver (2013)
Cortes, C., Haffner, P.: Rational kernels: Theory and algorithms. The Journal of Machine Learning 5, 1035–1062 (2004)
Graf, A.B., Smola, A.J., Borer, S.: Classification in a normalized feature space using support vector machines. IEEE Transactions on Neural Networks 14(3), 597–605 (2003)
Švec, J., Ircing, P.: Efficient algorithm for rational kernel evaluation in large lattice sets. In: IEEE International Conference on Acoustics Speech and Signal Processing, pp. 3133–3137. IEEE, Vancouver (2013)
Jurčíček, F., Zahradil, J., Jelínek, L.: A human-human train timetable dialogue corpus. In: Proceedings of EUROSPEECH, Lisboa, pp. 1525–1528 (2005)
Jurčíček, F., Švec, J., Müller, L.: Extension of HVS semantic parser by allowing left-right branching. In: IEEE International Conference on Acoustics Speech and Signal Processing, vol. (1), pp. 4993–4996 (2008)
Psutka, J., Švec, J., Psutka, J.V., Vaněk, J., Pražák, A., Šmídl, L., Ircing, P.: System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive. EURASIP Journal on Audio, Speech, and Music Processing (1), 1–11 (2011)
Soutner, D., Loose, Z., Müller, L., Pražák, A.: Neural Network Language Model with Cache. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 528–534. Springer, Heidelberg (2012)
Dogan, C., Saraclar, M.: Lattice Indexing for Spoken Term Detection. IEEE Transactions on Audio, Speech and Language Processing 19(8), 2338–2347 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Švec, J., Šmídl, L. (2013). On the Use of Phoneme Lattices in Spoken Language Understanding. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_47
Download citation
DOI: https://doi.org/10.1007/978-3-642-40585-3_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40584-6
Online ISBN: 978-3-642-40585-3
eBook Packages: Computer ScienceComputer Science (R0)