Abstract
Although the problem of full machine translation (MT) is unsolved yet, the computer aided translation (CAT) makes progress. In this field we created a work environment for monolingual translator 1. This package of tools generally enables a user who masters a source language to translate texts to a target language which the user does not master. The application is for Hebrew-to-Russian case, emphasizing specific problems of these languages, but it can be adapted for other pairs of languages also. After Source Text Preparation, Morphological Analysis provides all the meanings for every word. The ambiguity problem is very serious in languages with incomplete writing, like Hebrew. But the main problem is the translation itself. Words’ meanings mapping between languages is M:M, i.e., almost every source word has a number of possible translations, and almost every target word can be a translation of several words. Many methods for resolving of these ambiguities propose using large data bases, like dictionaries with semantic fields based on θ-theory. The amount of information needed to deal with general texts is prohibitively large. We propose here to solve ambiguities by a new method: Accumulation with Inversion and then Weighted Selection, plus Learning, using only two regular dictionaries: from source to target and from target to source languages. The method is built from a number of phases: (1) during Accumulation with Inversion, all the possible translations to the target language of every word are brought, and every one of them is translated back to the source language; (2) Selection of suitable suggestions is being made by user in source language, this is the only manual phase; (3) Weighting of the selection’s results is being made by software and determines the most suitable translation to the target language; (4) Learning of word’s context will provide preferable translation in the future. Target Text Generation is based on morphological records in target language, that are produced by the disambiguation phase. To complete the missing features for word’s building, we propose here a method of Features Expansion. This method is based on assumptions about feature flow through the sentence, and on dependence of grammatical phenomena in the two languages. Software of the workstation combines four tools: Source Text Preparation, Morphological Analysis, Disambiguation and Target Text Generation. The application includes an elaborated windows interface, on which the user’s work is based.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Choueka, Lusignan. Disambiguation by short contexts. Computers and the Humanities, 19(3), 1985.
Ido Dagan, Alon Itai and Ulrike Schwall. Two languages are more informative than one. In Proc. of the Annual Meeting of the ACL, 1991.
Igal Golan, Shalom Lappin and Mori Rimon. An active bilingual lexicon for machine translation. In Proc. of COLING, 1988.
Sergei Nirenburg, editor. Machine Translation: Theoretical and Methodological Issues. Cmbridge: Cambridge University Press, 1987.
Nirenburg and Raskin. The Analysis Lexicon and the Lexicon Management System. Computers and Translation 2:177–188, 1987.
Uzzi Ornan. Phonemic Script: A central Vehicle for Processing Natural Language-The Case of Hebrew, IBM Tech. Rep.88.181, 1986.
Andrew Radford. Transformational Grammar. Cambridge University Press, 1988.
Terry Winograd. Natural Language Processing. In Scientific American, 1984. bs]In Hebrew
Ilana Cohen-Zamir. Semantic Syntactical Analyzer for Sentences out of Context in Hebrew. M.Sc. thesis, Technion, 1991.
Ido Dagan. Multilingual Statistical Approaches to Disambiguation in Natural Language. Ph.D. thesis, Technion, 1992.
Lior Goldstein. Analysis and Generation of the Possessive Inflection in Ultimate-Accented Nouns. M.Sc. project, Technion, 1992.
Kravitz Technologies Inc., Hebrew-Russian-Hebrew Dictionary on ROM card, 1992
M. Kleinbart and A. Solomonik, eds. Hebrew-Russian Dictionary. Lexicon Publishers, 1987.
Moshe Levinger. Dispersal of Morphological Opacity in Hebrew. M.Sc. thesis, Technion, 1992.
Uzzi Ornan. The Simple Sentence, 1980.
Uzzi Ornan. Computer processing of Hebrew texts in the basis of single-meaning script. Mishpatim 17(1), 1987.
Uzzi Ornan. Theoretical linguistics and artificial intelligence. In Hebrew Computational Linguistics, Israel Ministry of Science and Technology. 1992
Uzzi Ornan. Phonemic transliteration of Hebrew in Latin letters. Organization of Special Libraries, 1993.
Uzzi Ornan and Vadim Kazatzki. Processes of analysis and generation in Hebrew morphology. 13 th National Conference on DataProcessing. A.Y.L.A., 1986.
Shuli Vinter. Syntactical Analysis of Sentences in Hebrew. M.Sc. project, Technion, 1991.
Michal Shani-Klein. Analysis and Generation of Inflection of Segolate Nouns in Hebrew. M.Sc. project, Technion, 1990.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bashkansky, G., Ornan, U. (1998). Monolingual Translator Workstation. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_13
Download citation
DOI: https://doi.org/10.1007/3-540-49478-2_13
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65259-5
Online ISBN: 978-3-540-49478-2
eBook Packages: Springer Book Archive