Classification Approach to Word Selection in Machine Translation

Lee, Hyo -Kyung

doi:10.1007/3-540-45820-4_12

Hyo -Kyung Lee²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2499))

Included in the following conference series:

Conference of the Association for Machine Translation in the Americas

643 Accesses

Abstract

We present a classification approach to building a English-Korean machine translation (MT) system. We attempt to build a word-based MT system from scratch using a set of parallel documents, online dictionary queries, and monolingual documents on the web. In our approach, MT problem is decomposed into two sub-problems — word selection problem and word ordering problem of the selected words. In this paper, we will focus on the word selection problem and discuss some preliminary results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bangalore, S., Riccardi, G.: A finite-state approach to machine translation. In: NAACL. (2001)
Google Scholar
Carlson, A., Cumby, C., Rosen, J., Roth, D.: The SNoW learning architecture. Technical Report UIUCDCS-R-99-2101, UIUC Computer Science Department (1999)
Google Scholar
Even-Zohar, Y., Roth, D.: A sequential model for multi-class classification. In: EMNLP. (2001)
Google Scholar
Germann, U.: Building a statistical machine translation system from scratch: How much bang can we expect for the buck. In: Proceedings of the Data-Driven MT Workshop of ACL-01. (2001)
Google Scholar
Golding, A., Roth, D.: A winnow-based approach to spelling correction. Machine Learning 34 (1999) 107–130
Article MATH Google Scholar
Koehn, P., Knight, K.: Knowledge sources for word-level translation models. In: Empirical Methods in Natural Language Processing conference. (2001)
Google Scholar
Lee, H.: A theory of portability. In: LREC2002: Workshop on Portability Issues in HLT. (2002)
Google Scholar
Littlestone, N.: Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning 2 (1988) 285–318
Google Scholar
Marcu, Y.A.O.U.G.U.H.K.K.P.K.D., Yamada, K.: Translating with scarce resources. In: National Conference on Artificial Intelligence (AAAI). (2000)
Google Scholar
Marcu, U.G.M.J.K.K.D., Yamada, K.: Fast decoding and optimal decoding for machine translation. In: Proc. of the Conference of the Association for Computational Linguistics (ACL). (2001)
Google Scholar
Munoz, M., Punyakanok, V., Roth, D., Zimak, D.: A learning approach to shallow parsing. In: EMNLP-VLC’99, the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. (1999) 168–178
Google Scholar
Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In: Proc. of 34th Conference of the ACL. (1996)
Google Scholar
Pederson, T.: Evaluating the effectiveness of ensembles of decision trees in dis-ambiguating senseval lexical samples. In: ACL02: Workshop on WSD: Recent Successes and Future Directions. (2002)
Google Scholar
Roth, D., Zelenko, D.: Part of speech tagging using a network of linear separators. In: COLING-ACL 98, The 17th International Conference on Computational Linguistics. (1998) 1136–1142
Google Scholar
Roth, D.: Learning to resolve natural language ambiguities: A unified approach. In: Proc. of the American Association of Artificial Intelligence. (1998) 806–813
Google Scholar
Roth, D.: Learning in natural language. In: Proc. of the International Joint Conference on Artificial Intelligence. (1999) 898–904
Google Scholar
Valiant, L.G.: A theory of the learnable. Communications of the ACM 27 (1984) 1134–1142
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, 1304 W. Springfield Ave., Urbana, IL, 61801, USA
Hyo -Kyung Lee

Authors

Hyo -Kyung Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research, 1 Microsoft Way, Redmond, WA, 98052, USA
Stephen D. Richardson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, H.K. (2002). Classification Approach to Word Selection in Machine Translation. In: Richardson, S.D. (eds) Machine Translation: From Research to Real Users. AMTA 2002. Lecture Notes in Computer Science(), vol 2499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45820-4_12

Download citation

DOI: https://doi.org/10.1007/3-540-45820-4_12
Published: 20 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44282-0
Online ISBN: 978-3-540-45820-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics