Abstract
Spoken dialogue is notoriously hard to process with standard language processing technologies. Dialogue systems must indeed meet two major challenges. First, natural spoken dialogue is replete with disfluent, partial, elided or ungrammatical utterances. Second, speech recognition remains a highly error-prone task, especially for complex, open-ended domains. We present an integrated approach for addressing these two issues, based on a robust incremental parser. The parser takes word lattices as input and is able to handle ill-formed and misrecognised utterances by selectively relaxing its set of grammatical rules. The choice of the most relevant interpretation is then realised via a discriminative model augmented with contextual information. The approach is fully implemented in a dialogue system for autonomous robots. Evaluation results on a Wizard of Oz test suite demonstrate very significant improvements in accuracy and robustness compared to the baseline.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baldridge, J., Kruijff, G.-J.M.: Coupling CCG and hybrid logic dependency semantics. In: ACL 2002: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 319–326. Association for Computational Linguistics, Philadelphia (2002)
Clark, S., Curran, J.R.: Log-linear models for wide-coverage ccg parsing. In: Proceedings of the 2003 conference on Empirical methods in natural language processing, pp. 97–104. Association for Computational Linguistics, Morristown (2003)
Collins, M., Roark, B.: Incremental parsing with the perceptron algorithm. In: ACL 2004: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, p. 111. Association for Computational Linguistics, Morristown (2004)
Hawes, N.A., Sloman, A., Wyatt, J., Zillich, M., Jacobsson, H., Kruijff, G.-J.M., Brenner, M., Berginc, G., Skocaj, D.: Towards an integrated robot with multiple cognitive functions. In: Proc. AAAI 2007, pp. 1548–1553. AAAI Press, Menlo Park (2007)
Lison, P.: Robust processing of situated spoken dialogue. Master’s thesis, Universität des Saarlandes, Saarbrücken (2008), http://www.dfki.de/~plison/pubs/thesis/main.thesis.plison2008.pdf
Lison, P.: A method to improve the efficiency of deep parsers with incremental chart pruning. In: Proceedings of the ESSLLI Workshop on Parsing with Categorial Grammars, Bordeaux, France (in press, 2009)
Lison, P., Kruijff, G.-J.M.: Salience-driven contextual priming of speech recognition for human-robot interaction. In: Proceedings of the 18th European Conference on Artificial Intelligence, Patras, Greece (2008)
Steedman, M., Baldridge, J.: Combinatory categorial grammar. In: Borsley, R., Börjars, K. (eds.) Nontransformational Syntax: A Guide to Current Models. Blackwell, Oxford (2009)
Weilhammer, K., Stuttle, M.N., Young, S.: Bootstrapping language models for dialogue systems. In: Proceedings of Interspeech 2006, Pittsburgh, PA (2006)
Zettlemoyer, L.S., Collins, M.: Online learning of relaxed CCG grammars for parsing to logical form. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 678–687 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lison, P., Kruijff, GJ.M. (2009). Robust Processing of Situated Spoken Dialogue. In: Mertsching, B., Hund, M., Aziz, Z. (eds) KI 2009: Advances in Artificial Intelligence. KI 2009. Lecture Notes in Computer Science(), vol 5803. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04617-9_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-04617-9_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04616-2
Online ISBN: 978-3-642-04617-9
eBook Packages: Computer ScienceComputer Science (R0)