Abstract
Data annotation is a common way to improve the reliability of advanced dialog applications. Unfortunately, since those annotations are highly language-dependent, the universalization can become a very lenghty process. Even though some projection methods exist, most of them require a deeper level of annotation than the one used for advanced dialogs. In this paper, we present a consensus approach that exploits the specificities of a sparse annotation in order to do the data projection.
Keywords
- Machine Translation
- Statistical Machine Translation
- Computational Linguistics
- Consensus Approach
- Annotation Projection
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bangalore, S., Murdock, V., Riccardi, G.: Bootstrapping Bilingual Data Using Consensus Translation for a Multilingual Instant Messaging System. In: Proceedings of the 19th International Conference on Computational Linguistics, pp. 1–7 (2002)
Bouillon, P., Rayner, M., Novellas, B., Nakao, Y., Santaholma, M., Starlander, M., Chatzichrisafis, N.: Une grammaire multilingue partagée pour la traduction automatique de la parole. In: Proceedings of Traitement Automatique des Langues Naturelles, pp. 155–173 (2006)
Chiang, D.: A Hierarchical Phrase-Based Model for Statistical Machine Translation. In: ACL 2005 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 263–270 (2005)
Rebecca, H., Resnik, P., Weinberg, A., Cabezas, C., Kolak, O.: Bootstrapping Parsers via Syntactic Projection across Parallel Texts. In: Natural Language Engineering, vol. 11(3), pp. 311–325 (2004)
Kim, R., Dalrymple, M., Kaplan, R., King, T.H., Masuichi, H., Ohkuma, T.: Multilingual Grammar Development via Grammar Porting. In: Proceedings of the ESSLLI 2003 Workshop on Ideas and Strategies for Multilingual Grammar Development, pp. 49–56 (2003)
Kim, R., Jeong, M., Lee, J., Lee, G.G.: A Cross-lingual Annotation Projection Approach for Relation Detection. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 564–571 (2010)
Koehn, P., Och, F.J., Marcu, D.: Statistical Phrase-Based Translation. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 48–54 (2003)
Koehn, P., Schroeder, J.: Experiments in Domain Adaptation for Statistical Machine Translation. In: Proceedings of the Second Workshop on Statistical Machine Translation, StatMT 2007, pp. 224–227 (2007)
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open Source Toolkit for Statistical Machine Translation. In: Annual Meeting of the Association for Computational Linguistics, pp. 177–180 (2007)
Santaholma, M.: Linguistic Representation of Finnish in a Limited Domain Speech-to-Speech Translation System. In: Proceedings of the 10th Conference on European Association of Machine Translation, pp. 226–234 (2005)
Santaholma, M.: Multilingual Grammar Resources in Multilingual Application Development. In: Proceedings of Grammar Engineering Across Frameworks Workshop, pp. 25–32 (2008)
Tiedemann, J.: News from OPUS – A Collection of Multilingual Parallel Corpora with Tools and Interface. In: Recent Advances in Natural Language Processing, vol. V, pp. 237–248 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Julien, S., Langlais, P., Tremblay, R. (2014). A Consensus Approach for Annotation Projection in an Advanced Dialog Context. In: Sokolova, M., van Beek, P. (eds) Advances in Artificial Intelligence. Canadian AI 2014. Lecture Notes in Computer Science(), vol 8436. Springer, Cham. https://doi.org/10.1007/978-3-319-06483-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-06483-3_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06482-6
Online ISBN: 978-3-319-06483-3
eBook Packages: Computer ScienceComputer Science (R0)