Abstract
Dialogue annotation is a necessary step for the development of dialogue systems, specially for data-based dialogue strategies. Manual annotation is hard and time-consuming, and automatic techniques can be used to obtain a draft annotation and speed up the process. The presentation of the draft annotation with confidence levels on the correctness of every part of the hypothesis can make even faster the supervision process. In this paper we propose two methods to calculate confidence measures for an automatic dialogue annotation model, and test them for the annotation of a task-oriented human-computer corpus on railway information. The results show that our proposals have a similar behaviour and that they are a good starting point for incorporating confidence measures in the dialogue annotation process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Grammatical Inference and Alignment for Transducer Infer.
- 2.
References
Alcácer, N., Benedí, J.M., Blat, F., Granell, R., Martínez, C.D., Torres, F.: Acquisition and labelling of a spontaneous speech dialogue corpus. In: SPECOM, Greece, pp. 583–586 (2005)
Austin, J.L.: How to Do Things with Words. Oxford University Press, London (1962)
Benedí, J.M., Lleida, E., Varona, A., Castro, M.J., Galiano, I., Justo, R., López, I., Miguel, A.: Design and acquisition of a telephone spontaneous speech dialogue corpus in spanish: DIHANA. In: Fifth LREC, Genova, Italy, pp. 1636–1639 (2006)
Bunt, H.: Context and dialogue control. THINK Q. 3, 19–31 (1994)
Casacuberta, F., Vidal, E., Picó, D.: Inference of finite-state transducers from regular languages. Pattern Recogn. 38(9), 1431–1443 (2005)
Core, M.G., Allen, J.F.: Coding dialogues with the DAMSL annotation scheme. In: Traum, D. (ed.) Working Notes: AAAI Fall Symposium on Communicative Action in Humans and Machines, pp. 28–35. AAAI, Menlo Park (1997)
Devijver, P.A.: Baum’s forward-backward algorithm revisited. Pattern Recogn. Lett. 3(6), 369–373 (1985)
Egan, J.P.: Signal Detection Theory and Roc Analysis. Academic Press, New York (1975)
Fraser, M., Gilbert, G.: Simulating speech systems. Comp. Speech Lang. 5, 81–99 (1991)
Fukada, T., Koll, D., Waibel, A., Tanigaki, K.: Probabilistic dialogue act extraction for concept based multilingual translation systems. In: Proceedings of ICSLP, vol. 6, pp. 2771–2774 (1998)
Ghigi, F., Tamarit, V., Martínez-Hinarejos, C.-D., Benedí, J.-M.: Active learning for dialogue act labelling. In: Vitrià, J., Sanches, J.M., Hernández, M. (eds.) IbPRIA 2011. LNCS, vol. 6669, pp. 652–659. Springer, Heidelberg (2011)
Jiang, H.: Confidence measures for speech recognition: a survey. Speech Comm. 45(4), 455–470 (2005)
Lee, G.G., Mariani, J., Minker, W., Nakamura, S. (eds.): IWSDS 2010. LNCS, vol. 6392. Springer, Heidelberg (2010)
Martínez-Hinarejos, C.-D., Benedí, J.M., Granell, R.: Statistical framework for a spanish spoken dialogue corpus. Speech Commun. 50, 992–1008 (2008)
Martínez-Hinarejos, C.-D., Tamarit, V., Benedí, J.M.: Improving unsegmented dialogue turns annotation with N-gram transducers. In: Proceedings of PACLIC23, vol. 1, pp. 345–354. City University of Hong Kong Press, Hong Kong (2009)
San-Segundo, R., Pellom, B., Hacioglu, K., Ward, W., Pardo, J.M.: Confidence measures for spoken dialogue systems. In: ICASSP, vol. 1, pp. 393–396. IEEE Computer Society, Los Alamitos (2001)
Sánchez-Sáez, R., Sánchez, J.A., Benedí, J.M.: Statistical confidence measures for probabilistic parsing. In: Proceedings of RANLP’09, Borovets, Bulgaria, pp. 388–392, September 2009
Seneff, S., Polifroni, J.: Dialogue management in Mercury flight reservation system. In: ANLP-NAACL, pp. 1–6 (2000)
Stolcke, A., Coccaro, N., Bates, R., Taylor, P., van Ess-Dykema, C., Ries, K., Shriberg, E., Jurafsky, D., Martin, R., Meteer, M.: Dialogue act modelling for automatic tagging and recognition of conversational speech. Comput. Linguist. 26(3), 1–34 (2000)
Tamarit, V., Martínez-Hinarejos, C.-D., Benedí, J.M.: On the use of N-gram transducers for dialogue annotation. In: Spoken Dialogue Systems Technology and Design, pp. 255–276. Springer, New York (2011)
Ueffing, N., Macherey, K., Ney, H.: Confidence measures for statistical machine translation. In: Proceedings of the MT Summit IX, pp. 394–401. Springer (2003)
Walker, M., Passonneau, R.: DATE: a dialogue act tagging scheme for evaluation of spoken dialogue systems. In: HLT’01: Proceedings of the 1st International Conference on Human Language Technology, San Diego, pp. 1–8 (2001)
Wessel, F., Schlüter, R., Macherey, K., Ney, H.: Confidence measures for large vocabulary continuous speech recognition. IEEE Trans. Speech Audio Process. 9, 288–298 (2001)
Williams, J.D., Young, S.: Partially observable markov decision processes for spoken dialog systems. Comput. Speech Lang. 21(2), 393–422 (2007)
Acknowledgments
Work supported by EC under FP7 project CasMaCat (FP7-28757), and by Spanish MINECO under projects STraDA (TIN2012-37475-C02-01) and Active2Trans (TIN2012-31723), and by GVA under project AMIIS (ISIC/2012/004).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Martínez-Hinarejos, CD., Tamarit, V., Benedí, JM. (2014). Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers. In: Vetulani, Z., Mariani, J. (eds) Human Language Technology Challenges for Computer Science and Linguistics. LTC 2011. Lecture Notes in Computer Science(), vol 8387. Springer, Cham. https://doi.org/10.1007/978-3-319-08958-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-08958-4_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08957-7
Online ISBN: 978-3-319-08958-4
eBook Packages: Computer ScienceComputer Science (R0)