Abstract
We present a method of robust domain selection against out-of-grammar (OOG) utterances in multi-domain spoken dialogue systems. These utterances cause language-understanding errors because of a limited set of grammar and vocabulary of the systems, and deteriorate the domain selection. This is critical for multi-domain spoken dialogue systems to determine a system’s response. We first define a topic as a domain from which the user wants to retrieve information, and estimate it as the user’s intention. This topic estimation is enabled by using a large amount of sentences collected from the Web and Latent Semantic Mapping (LSM). The results are reliable even for OOG utterances. We then integrated both the topic estimation results and the dialogue history to construct a robust domain classifier against OOG utterances. The idea of integration is based on the fact that the reliability of the dialogue history is often impeded by language-understanding errors caused by OOG utterances, from which using topic estimation obtains useful information. Experimental results using 2191 utterances showed that our integrated method reduced domain selection errors by 14.3%.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Komatani, K., Kanda, N., Nakano, M., Nakadai, K., Tsujino, H., Ogata, T., Okuno, H.G.: Multi-domain spoken dialogue system with extensibility and robustness against speech recognition errors. In: Proc. SIGDial, pp. 9–17 (2006)
Bellegarda, J.R.: Latent semantic mapping. IEEE Signal Processing Mag. 22(5), 70–80 (2005)
Lin, B., Wang, H., Lee, L.: A distributed agent architecture for intelligent multi-domain spoken dialogue systems. In: Proc. ASRU (1999)
O’Neill, I., Hanna, P., Liu, X., McTear, M.: Cross domain dialogue modelling: an object-based approach. In: Proc. ICSLP, pp. 205–208 (2004)
Lane, I.R., Kawahara, T., Matsui, T., Nakamura, S.: Topic classification and verification modeling for out-of-domain utterance detection. In: Proc. ICSLP, pp. 2197–2200 (2004)
Ikeda, S., Komatani, K., Ogata, T., Okuno, H.G.: Topic estimation with domain extensibility for guiding user’s out-of-grammar utterance in multi-domain spoken dialogue systems. In: Proc. Interspeech, pp. 2561–2564 (2007)
Misu, T., Kawahara, T.: A bootstrapping approach for developing language model of new spoken dialogue systems by selecting Web texts. In: Proc. ICSLP, pp. 9–12 (2006)
Komatani, K., Fukubayashi, Y., Ogata, T., Okuno, H.G.: Introducing utterance verification in spoken dialogue system to improve dynamic help generation for novice users. In: Proc. SIGDial, pp. 202–205 (2007)
Kawahara, T., Lee, A., Takeda, K., Itou, K., Shikano, K.: Recent progress of open-source LVCSR engine Julius and Japanese model repository. In: Proc. ICSLP, pp. 3069–3072 (2004)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993), http://www.rulequest.com/see5-info.html
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ikeda, S., Komatani, K., Ogata, T., Okuno, H.G. (2008). Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-domain Spoken Dialogue Systems. In: Nguyen, N.T., Borzemski, L., Grzech, A., Ali, M. (eds) New Frontiers in Applied Artificial Intelligence. IEA/AIE 2008. Lecture Notes in Computer Science(), vol 5027. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69052-8_31
Download citation
DOI: https://doi.org/10.1007/978-3-540-69052-8_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69045-0
Online ISBN: 978-3-540-69052-8
eBook Packages: Computer ScienceComputer Science (R0)