Empirical Methods for MT Lexicon Development

Dan Melamed, I.

doi:10.1007/3-540-49478-2_2

I. Dan Melamed⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1529))

Included in the following conference series:

Conference of the Association for Machine Translation in the Americas

684 Accesses
3 Citations

Abstract

This article reviews some recently invented methods for automatically extracting translation lexicons from parallel texts. The accuracy of these methods has been significantly improved by exploiting known properties of parallel texts and of particular language pairs. The state of the art has advanced to the point where non-compositional compounds can be automatically identified with high reliability, and their translations can be found. Most importantly, all of these methods can be smoothly integrated into the usual work ow of MT system developers. Semi-automatic MT lexicon construction is likely to be more efficient and more accurate than either fully automatic or fully manual methods alone.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Bibliography

I. D. Melamed. (1998) Empirical Methods for Exploiting Parallel Texts, Ph.D. dissertation. University of Pennsylvania, Philadelphia, PA.
Google Scholar
I. D. Melamed. (to appear) “Bitext Maps and Alignment via Pattern Recognition,” to appear in Computational Linguistics.
Google Scholar
I. D. Melamed. (submitted) “Word-to-Word Models of Translational Equivalence,” submitted to Computational Linguistics.
Google Scholar
D. Yarowsky. (1993) “One Sense Per Collocation,” DARPA Workshop on Human Language Technology. Princeton, NJ.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Research Department, West Group D1-66F, 610 Opperman Drive, Eagan, MN, 55123
I. Dan Melamed

Authors

I. Dan Melamed
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computing Research Lab, New Mexico State University, Box 30001 / 3CRL, Las Cruces, NM, 88003, USA
David Farwell
SYSTRAN Inc., 7855 Fay Avenue, Suite 300, P.O. Box 907, La Jolla, CA, 92037, USA
Laurie Gerber
Information Sciences Institute, University of Southern California, 4676 Admiralty Way, Marina del Rey, CA, 90292-6695, USA
Eduard Hovy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dan Melamed, I. (1998). Empirical Methods for MT Lexicon Development. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_2

Download citation

DOI: https://doi.org/10.1007/3-540-49478-2_2
Published: 24 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65259-5
Online ISBN: 978-3-540-49478-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics