Skip to main content
Log in

Unit Completion for a Computer-aided Translation Typing System

  • Published:
Machine Translation

Abstract

This work is in the context of, a system thatwatches over the users as they type a translation andrepeatedly suggests completions for the text already entered.The users may either accept, modify, or ignore these suggestions. Wedescribe the design, implementation, and performance of aprototype which suggests completions of units of texts that arelonger than one word.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Berger, Adam L., Stephen A. Della Pietra, and Vincent J. Della Pietra: 1996,‘A Maximum Entropy Approach to Natural Language Processing’, Computational Linguistics 22, 39–71.

    Google Scholar 

  • Blanchon, Hervé: 1991, ‘Problè mes de désambiguï sation interactive et TAO personnelle’ [Problems with Interactive Disambiguation and Personal MT], L'environnement traductionnel, journées scientifiques du réseau thématique de recherche “Lexicologie, terminologie, traduction”, Mons, pp. 31–48.

  • Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra and Robert L. Mercer: 1993, ‘The Mathematics of Machine Translation: Parameter Estimation’, Computational Linguistics 19, 263–312.

    Google Scholar 

  • Brown, Ralf D. and Sergei Nirenburg: 1990, ‘Human-Computer Interaction for Semantic Disambiguation’, COLING 90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Vol. 3, pp. 42–47.

    Google Scholar 

  • Chen, Kuang Hua and Hsin-Hsi Chen: 1994, ‘Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and its Automatic Evaluation’, 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp. 234–241.

  • Derouault, A.-M. and B. Merialdo: 1986, ‘Natural Language Modeling for Phoneme-to-Text Transcription’, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 8, 742–749.

    Google Scholar 

  • Dunning, Ted: 1993, ‘Accurate Methods for the Statistics of Surprise and Coincidence’, Computational Linguistics 19, 61–74.

    Google Scholar 

  • Evans, David A. and Chengxiang Zhai: 1996, ‘Noun-Phrase Analysis in Unrestricted Text for Information Retrieval’, 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, CA, pp. 17–24.

  • Foster, George, Pierre Isabelle and Pierre Plamondon: 1997, ‘Target-Text Mediated Interactive Machine Translation’, Machine Translation 12, 175–194.

    Google Scholar 

  • Fung, Pascale: 1995, ‘A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora’, 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, MA, pp. 236–243.

  • Furuse, Osamu and Hitoshi Iida: 1996, ‘Incremental Translation Utilizing Constituent Boundary Patterns’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 412–417.

  • Gaussier, Éric: 1995, Modè les statistiques et patrons morphosyntaxiques pour l'extraction de lexiques bilingues [Statistical Models and Morphosyntactic Templates for the Extraction of Bilingual Lexicons], Thè se de doctorat, Université de Paris 7.

  • Haruno, Masahiko, Satoru Ikehara and Takefumi Yamazaki: 1996, ‘Learning Bilingual Collocations by Word-Level Sorting’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 525–530.

  • Ikehara, Satorn, Satoshi Shirai and Hajime Uchino: 1996, ‘A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Corpora’, COLING-96: The 16 th International Conference on Computational Linguistics, Copenhagen, pp. 574–579.

  • Jelinek, Frederick: 1990, ‘Self-Organized Language Modeling for Speech Recognition’, in A. Waibel and K. Lee (eds), Readings in Speech Recognition, Morgan Kaufmann, San Mateo, CA, pp. 450–506.

    Google Scholar 

  • Kaji, Hiroyuki and Toshiko Aizono: 1996, ‘Extracting Word Correspondances from Bilingual Corpora Based on Word Co-Occurence Information’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 23–28.

  • Kay, Martin: 1973, The MIND System', in R. Rustin (ed.) Natural Language Processing, Algorithmics Press, New York, pp. 155–188.

    Google Scholar 

  • Kuhn, Roland and Renato De Mon: 1990, ‘A Cache-Based Natural Language Model for Speech Recognition’, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 12, 570–583.

    Google Scholar 

  • Kupiec, Julian: 1993, ‘An Algorithm for Finding Noun Phrase Correspondences in Bilingual Corpora’, 31st Annual Meeting of the Association for Computational Linguistics, Columbus, OH, pp. 17–22.

  • Langlais, Philippe and George Foster: 2000, ‘Using Context-Dependent Interpolation to Combine Statistical Language and Translation Models for Interactive Machine Translation’, Proceedings of the 6th International Conference on Computer-Assisted Information Retrieval (RIAO), Paris, pp. 507–519.

  • Langlais, Philippe, Sébastien Sauvé, George Foster, Elliott Macklovitch, and Guy Lapalme: 2000, ‘Evaluation of TransType, a Computer-aided Translation Typing System: A Comparison of a Theoretical-and a User-Oriented Evaluation Procedures’, LREC 2000 Second International Conference on Language Resources and Evaluation, Athens, pp. 641–648.

  • Lin, Dekang: 1999, ‘Automatic Identification of Non-Compositional Phrases’, 37th Annual Meeting of the Association for Computational Linguistics, College Park, MD, pp. 317–324.

  • Maruyama, Hiroshi, Hideo Watanabe and Shiho Ogino: 1990, ‘An Interactive Japanese Parser for Machine Translation’, COLING-90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Vol. 2, pp. 257–262.

  • Melamed, I. Dan: 1997, ‘Automatic Discovery of Non-Compositional Compounds in Parallel Data’, Second Conference on Empirical Methods in Natural Language Processing, Providence, RI, pp. 97–108.

  • Nagao, Makoto and Shinsuke Mon: 1994, ‘A New Method of n-Gram Statistics for Large Number of n and Automatic Extraction of Words and Phrases from Large Text Data of Japanese’, COLING 94: The 15th International Conference on Computational Linguistics, Kyoto, pp. 611–615.

  • Nieß en, S., S. Vogel, H. Ney and C. Tillman: 1998, ‘A DP Based Search Algorithm for Statistical Machine Translation’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 960–966.

  • Och, Franz Josef, Christoph Tillman and Herman Ney: 1999, ‘Improved Alignment Models for Statistical Machine Translation’, Proceedings of the 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, College Park, MD, pp. 20–28.

  • Och, Franz Josef and Hans Weber: 1998, ‘Improving Statistical Natural Language Translation with Categories and Rules’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 985–989.

  • Ohomori, Kumiko and Masanobu Higashida: 1999, ‘Extracting Bilingual Collocations from Non-Aligned Parallel Corpora’, Proceedings of the 8th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 99), Chester, pp. 88–97.

  • Rapp, Reinhard: 1999, ‘Automatic Identification of Word Translations from Unrelated English and German Corpora’, 37th Annual Meeting of the Association for Computational Linguistics, College Park, MD, pp. 519–526.

  • Russell, Graham: 1998, Identification of Salient Token Sequences, Internal report, RALI, University of Montreal.

  • Shimohata, Sayori, Toshiyuki Sugio and Junji Nagata: 1997, ‘Retrieving Collocations by Cooccurrences and Word Order Constraints’, 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, pp. 476–481.

  • Su, Keb-Yih, Ming-Wen Wu and Jing-Shin Chang: 1994, ‘A Corpus-Based Approach to Automatic Compound Extraction’, 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp. 242–247.

  • Tanaka, Kumiko and Hideya Iwasaki: 1996, ‘Extraction of Lexical Translations from Non-Aligned Corpora’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, pp. 580–585.

  • Tanaka, Takaaki and Yoshihiro Matsuo: 1999, ‘Extraction of Translation Equivalents from Non-Parallel Corpora’, Proceedings of the 8th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 99), Chester, pp. 109–119.

  • Tillman, C., S. Vogel, H. Ney and A. Zubiaga: 1997, ‘A DP Based Search Using Monotone Alignments in Statistical Translation’, 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, pp. 289–296.

  • Wang, Ye-Yi and Alex Waibel: 1997, ‘Decoding Algorithm in Statistical Machine Translation’, 35 th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, pp. 366–372.

  • Wang, Ye-Yi and Alex Waibel: 1998a, ‘Fast Decoding for Statistical Machine Translation’, 5th International Conference on Spoken Language Processing, ICSLP'98 Proceedings, Sydney, pp. 2775–2779.

  • Wang, Ye-Yi and Alex Waibel: 1998, ‘Modeling with Structures in Statistical Machine Translation’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 1357–1363.

  • Whitelock, P. J., M. McGee Wood, B. J. Chandler, N. Holden and H. J. Horsfall: 1986, ‘Strategies for Interactive Machine Translation: The Experience and Implications of the UMIST Japanese Project’, 11th International Conference on Computational Linguistics: Proceedings of Coling '86, Bonn, pp. 329–334.

  • Wu, Dekai: 1995, ‘Stochastic Inversion Transduction Grammars, with Application to Segmentation, Bracketing, and Alignment of Parallel Corpora’, Fourteenth International Joint Conference on Artificial Intelligence, Montreal, pp. 1328–1335.

  • Wu, Dekai and Hongsing Wong: 1998, ‘Machine Translation with a Stochastic Grammatical Channel’, COLING-ACL '98: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, pp. 1408–1414.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Langlais, P., Foster, G. & Lapalme, G. Unit Completion for a Computer-aided Translation Typing System. Machine Translation 15, 267–294 (2000). https://doi.org/10.1023/A:1012262211784

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1012262211784

Navigation