Skip to main content

Classification Approach to Word Selection in Machine Translation

  • Conference paper
  • First Online:
Machine Translation: From Research to Real Users (AMTA 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2499))

Included in the following conference series:

  • 643 Accesses

Abstract

We present a classification approach to building a English-Korean machine translation (MT) system. We attempt to build a word-based MT system from scratch using a set of parallel documents, online dictionary queries, and monolingual documents on the web. In our approach, MT problem is decomposed into two sub-problems — word selection problem and word ordering problem of the selected words. In this paper, we will focus on the word selection problem and discuss some preliminary results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bangalore, S., Riccardi, G.: A finite-state approach to machine translation. In: NAACL. (2001)

    Google Scholar 

  2. Carlson, A., Cumby, C., Rosen, J., Roth, D.: The SNoW learning architecture. Technical Report UIUCDCS-R-99-2101, UIUC Computer Science Department (1999)

    Google Scholar 

  3. Even-Zohar, Y., Roth, D.: A sequential model for multi-class classification. In: EMNLP. (2001)

    Google Scholar 

  4. Germann, U.: Building a statistical machine translation system from scratch: How much bang can we expect for the buck. In: Proceedings of the Data-Driven MT Workshop of ACL-01. (2001)

    Google Scholar 

  5. Golding, A., Roth, D.: A winnow-based approach to spelling correction. Machine Learning 34 (1999) 107–130

    Article  MATH  Google Scholar 

  6. Koehn, P., Knight, K.: Knowledge sources for word-level translation models. In: Empirical Methods in Natural Language Processing conference. (2001)

    Google Scholar 

  7. Lee, H.: A theory of portability. In: LREC2002: Workshop on Portability Issues in HLT. (2002)

    Google Scholar 

  8. Littlestone, N.: Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning 2 (1988) 285–318

    Google Scholar 

  9. Marcu, Y.A.O.U.G.U.H.K.K.P.K.D., Yamada, K.: Translating with scarce resources. In: National Conference on Artificial Intelligence (AAAI). (2000)

    Google Scholar 

  10. Marcu, U.G.M.J.K.K.D., Yamada, K.: Fast decoding and optimal decoding for machine translation. In: Proc. of the Conference of the Association for Computational Linguistics (ACL). (2001)

    Google Scholar 

  11. Munoz, M., Punyakanok, V., Roth, D., Zimak, D.: A learning approach to shallow parsing. In: EMNLP-VLC’99, the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. (1999) 168–178

    Google Scholar 

  12. Ng, H.T., Lee, H.B.: Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In: Proc. of 34th Conference of the ACL. (1996)

    Google Scholar 

  13. Pederson, T.: Evaluating the effectiveness of ensembles of decision trees in dis-ambiguating senseval lexical samples. In: ACL02: Workshop on WSD: Recent Successes and Future Directions. (2002)

    Google Scholar 

  14. Roth, D., Zelenko, D.: Part of speech tagging using a network of linear separators. In: COLING-ACL 98, The 17th International Conference on Computational Linguistics. (1998) 1136–1142

    Google Scholar 

  15. Roth, D.: Learning to resolve natural language ambiguities: A unified approach. In: Proc. of the American Association of Artificial Intelligence. (1998) 806–813

    Google Scholar 

  16. Roth, D.: Learning in natural language. In: Proc. of the International Joint Conference on Artificial Intelligence. (1999) 898–904

    Google Scholar 

  17. Valiant, L.G.: A theory of the learnable. Communications of the ACM 27 (1984) 1134–1142

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, H.K. (2002). Classification Approach to Word Selection in Machine Translation. In: Richardson, S.D. (eds) Machine Translation: From Research to Real Users. AMTA 2002. Lecture Notes in Computer Science(), vol 2499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45820-4_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-45820-4_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44282-0

  • Online ISBN: 978-3-540-45820-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics