Abstract
We present a classification approach to building a English-Korean machine translation (MT) system. We attempt to build a word-based MT system from scratch using a set of parallel documents, online dictionary queries, and monolingual documents on the web. In our approach, MT problem is decomposed into two sub-problems — word selection problem and word ordering problem of the selected words. In this paper, we will focus on the word selection problem and discuss some preliminary results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bangalore, S., Riccardi, G.: A finite-state approach to machine translation. In: NAACL. (2001)
Carlson, A., Cumby, C., Rosen, J., Roth, D.: The SNoW learning architecture. Technical Report UIUCDCS-R-99-2101, UIUC Computer Science Department (1999)
Even-Zohar, Y., Roth, D.: A sequential model for multi-class classification. In: EMNLP. (2001)
Germann, U.: Building a statistical machine translation system from scratch: How much bang can we expect for the buck. In: Proceedings of the Data-Driven MT Workshop of ACL-01. (2001)
Golding, A., Roth, D.: A winnow-based approach to spelling correction. Machine Learning 34 (1999) 107–130
Koehn, P., Knight, K.: Knowledge sources for word-level translation models. In: Empirical Methods in Natural Language Processing conference. (2001)
Lee, H.: A theory of portability. In: LREC2002: Workshop on Portability Issues in HLT. (2002)
Littlestone, N.: Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning 2 (1988) 285–318
Marcu, Y.A.O.U.G.U.H.K.K.P.K.D., Yamada, K.: Translating with scarce resources. In: National Conference on Artificial Intelligence (AAAI). (2000)
Marcu, U.G.M.J.K.K.D., Yamada, K.: Fast decoding and optimal decoding for machine translation. In: Proc. of the Conference of the Association for Computational Linguistics (ACL). (2001)
Munoz, M., Punyakanok, V., Roth, D., Zimak, D.: A learning approach to shallow parsing. In: EMNLP-VLC’99, the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. (1999) 168–178
Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In: Proc. of 34th Conference of the ACL. (1996)
Pederson, T.: Evaluating the effectiveness of ensembles of decision trees in dis-ambiguating senseval lexical samples. In: ACL02: Workshop on WSD: Recent Successes and Future Directions. (2002)
Roth, D., Zelenko, D.: Part of speech tagging using a network of linear separators. In: COLING-ACL 98, The 17th International Conference on Computational Linguistics. (1998) 1136–1142
Roth, D.: Learning to resolve natural language ambiguities: A unified approach. In: Proc. of the American Association of Artificial Intelligence. (1998) 806–813
Roth, D.: Learning in natural language. In: Proc. of the International Joint Conference on Artificial Intelligence. (1999) 898–904
Valiant, L.G.: A theory of the learnable. Communications of the ACM 27 (1984) 1134–1142
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, H.K. (2002). Classification Approach to Word Selection in Machine Translation. In: Richardson, S.D. (eds) Machine Translation: From Research to Real Users. AMTA 2002. Lecture Notes in Computer Science(), vol 2499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45820-4_12
Download citation
DOI: https://doi.org/10.1007/3-540-45820-4_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44282-0
Online ISBN: 978-3-540-45820-3
eBook Packages: Springer Book Archive