Abstract
Knowledge acquisition is a critical problem for machine translation and translation selection. In this paper, I propose a tranlsation selection method that combines variable features from multiple language resources using machine learning. I introduce multiple measures for sense disambiguation and word selection that are based on language resources, and apply machine learning to combine those measures for translation selection. In evaluation, precision of translation selection improves even though a small-sized bilingual corpus is used as training data.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Brown, P.F., Cocke, J., Pietra, V.D., Pietra, S.D., Jelinek, F., Lafferty, J.D., Mercer, R.L., Roossin, P.S.: A Statistical Approach to Machine Translation. Computational Linguistics 16(2) (1990)
Munteanu, D., Marcu, D.: Improving Machine Translation Performance by Exploiting Comparable Corpora. Computational Linguistics 31(4) (2005)
Dagan, I., Itai, A.: Word Sense Disambiguation Using a Second Language Monolingual Corpus. Computational Linguistics 20(4) (1994)
Prescher, D., Riezler, S., Rooth, M.: Using a Probabilistic Class-Based Lexicon for Lexical Ambiguity Resolution. In: Proceedings of the 18th International Conference on Computational Linguistics (2000)
Koehn, P., Knight, K.: Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm. In: Proceedings of National Conference on Artificial Intelligence (2000)
Gaussier, E., Renders, J.-M., Matveeva, I., Goutte, C., Déjean, H.: A Geometric view on bilingual lexicon extraction from comparable corpora. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (2004)
Lee, H.A., Yoon, J., Kim, G.C.: Translation Selection by Combining Multiple Measures for Sense Disambiguation and Word Selection. International Journal of Computer Processing of Oriental Languages 16(3) (2003)
Daelemans, W., Zavrel, J., Van der Sloot, K., Van den Bosch, A.: TiMBL: Tilburg Memory Based Learner, version 5.1, Reference Guide. ILK Technical Report Series 04-02 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, H.A. (2006). Translation Selection Through Machine Learning with Language Resources. In: Matsumoto, Y., Sproat, R.W., Wong, KF., Zhang, M. (eds) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead. ICCPOL 2006. Lecture Notes in Computer Science(), vol 4285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11940098_38
Download citation
DOI: https://doi.org/10.1007/11940098_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49667-0
Online ISBN: 978-3-540-49668-7
eBook Packages: Computer ScienceComputer Science (R0)