Abstract
Since the design and acquisition of a new dialog corpus is a complex task, new methods to facilitate this task are necessary. In this paper, we present a methodology to make use of our previous work within the framework of dialog systems in order to acquire a dialog corpus for a new domain. The main idea is the simulation of recognition and understanding errors in the acquisition of the new dialog corpus. This simulation is based on the analysis of such errors in a previously acquired corpus and the definition of a correspondence table among the concepts and attributes of both tasks. This correspondence table is based on the similarity of semantic meaning and frequencies. Finally, the application of this methodology is illustrated in some examples.
This work has been partially supported by the Spanish Government and FEDER under contract TIN2005-08660-C04-02, and by the Vicerrectorado de Innovación y Desarrollo of the Universidad Politécnica de Valencia under contract 4681.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Potamianos, A., Narayanan, S., Riccardi, G.: Adaptive Categorical Understanding for Spoken Dialogue Systems. IEEE Transactions on Speech and Audio Processing 13(3), 321–329 (2005)
Torres, F., Hurtado, L., García, F., Sanchis, E., Segarra, E.: Error handling in a stochastic dialog system through confidence measures. Speech Communication 45, 211–229 (2005)
Hurtado, L.F., Griol, D., Segarra, E., Sanchis, E.: A stochastic approach for dialog management based on neural networks. In: Proc. of Interspeech 2006-ICSLP, Pittsburgh, pp. 49–52 (2006)
Williams, J., Young, S.: Partially Observable Markov Decision Processes for Spoken Dialog Systems. Computer Speech and Language 21(2), 393–422 (2007)
Grau, S., Segarra, E., Sanchís, E., García, F., Hurtado, L.F.: Incorporating semantic knowledge to the language model in a speech understanding system. In: IV Jornadas en Tecnologia del Habla, Zaragoza, Spain, pp. 145–148 (2006)
Benedí, J., Lleida, E., Varona, A., Castro, M., Galiano, I., Justo, R., López, I., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA. In: Proc. of LREC 2006, Genove, Italy, pp. 1636–1639 (2006)
Lleida, E., Segarra, E., Torres, M., Macías-Guarasa, J.I.: EDECAN: sistEma de Diálogo multidominio con adaptación al contExto aCústico y de AplicacióN. In: IV Jornadas en Tecnologia del Habla, Zaragoza, Spain, pp. 291–296 (2006)
Griol, D., Torres, F., Hurtado, L., Grau, S., García, F., Sanchis, E., Segarra, E.: A dialog system for the DIHANA Project. In: Proc. of SPECOM 2006, S. Petersburgh, pp. 131–136 (2006)
Fukada, T., Koll, D., Waibel, A., Tanigaki, K.: Probabilistic dialogue extraction for concept based multilingual translation systems. In: Proc. Int. Conf. on Spoken Language Processing, pp. 2771–2774 (1998)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Garcia, F., Hurtado, L.F., Griol, D., Castro, M., Segarra, E., Sanchis, E. (2007). Recognition and Understanding Simulation for a Spoken Dialog Corpus Acquisition. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_74
Download citation
DOI: https://doi.org/10.1007/978-3-540-74628-7_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74627-0
Online ISBN: 978-3-540-74628-7
eBook Packages: Computer ScienceComputer Science (R0)