Skip to main content

Integrating Query Translation and Document Translation in a Cross-Language Information Retrieval System

  • Conference paper
  • First Online:
Machine Translation and the Information Soup (AMTA 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1529))

Included in the following conference series:

Abstract

Due to the explosive growth of the WWW, very large multilingual textual resources have motivated the researches in Cross-Language Information Retrieval and online Web Machine Translation. In this paper, the integration of language translation and text processing system is proposed to build a multilingual information system. A distributed English-Chinese system on WWW is introduced to illustrate how to integrate query translation, search engines, and web translation system. Since July 1997, more than 46,000 users have accessed our system and about 250,000 English web pages have been translated to pages in Chinese or bilingual English-Chinese versions. And the average satisfaction degree of users at document level is 67.47%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bian, G.W. and Chen, H.H.: “An MT Meta-Server for Information Retrieval on WWW.” In Working Notes of the AAAI Spring Symposium on Natural Language Processing for the World Wide Web, Palo Alto, California, USA, March, 1997, pp. 10–16.

    Google Scholar 

  2. David, M.W. and Ogden, W.C.: “QUILT: Implementing a Large-Scale Cross-Language Text Retrieval System.” In Proceedings of ACM SIGIR’97, 1997, pp. 92–98.

    Google Scholar 

  3. Gachot, D.A.; Lange, E. and Yang, J.: “The SYSTRAN NLP Browser: An Application of Machine Translation Technology in Multilingual Information Retrieval.” In Proceedings of Workshop on Cross-Linguistic Information Retrieval, 1996, pp. 44–54.

    Google Scholar 

  4. Hayashi, Y; Kikui, G. and Susaki, S.: “TITAN: A Cross-linguistic Search Engine for the WWW.” In Working Notes of the AAAI-97 Spring Symposium on Cross-Language Text and Speech Retrieval, 1997, pp. 58–65.

    Google Scholar 

  5. Ballesteros, L. and Croft, W.C.: “Phrasal Translation and Query Expansion Techniques for Cross-Language Information retrieval.” In Proceedings of ACM SIGIR’97, pp. 84–91, 1997.

    Google Scholar 

  6. David, M.W.: “New Experiments in Cross-Language Text Retrieval at New Mexico State University’s Computing Research Laboratory.” In Proceedings of the Fifth Text Retrieval Evaluation Conference (TREC-5), Gaithersburg, MD, National Institute of Standards and Technology, 1996.

    Google Scholar 

  7. Hull, D.A. and Grefenstette, G.: “Querying Across Languages: A Dictionary-Based Approach to Multilingual Information Retrieval.” In Proceedings of ACM SIGIR’96, pp. 49–57, 1996.

    Google Scholar 

  8. Kwok, K.L.: “Evaluation of an English-Chinese Cross-Lingual Retrieval Experiment.” In Working Notes of the AAAI-97 Spring Symposium on Cross-Language Text and Speech Retrieval, 1997, pp. 110–114.

    Google Scholar 

  9. David, M.W. and Dunning, T.: “A TREC Evaluation of Query Translation Methods for Multi-Lingual Text Retrieval.” In Proceedings of the Fourth Text Retrieval Evaluation Conference (TREC-4), Gaithersburg, MD, National Institute of Standards and Technology, 1995.

    Google Scholar 

  10. Landauer, T.K. and Littman, M.L.: “Fully Automatic Cross-Language Document Retrieval.” In Proceedings of the Sixth Conference on Electronic Text Research, pp. 31–38, 1990.

    Google Scholar 

  11. Radwan, K.: Vers l’Acces Multilingue en Langage Naturel aux Baess de Donnees Textuelles. PhD Thesis, Universite de Paris-Sud, Centre d’Orsay. 1994.

    Google Scholar 

  12. Church, K. and Hanks, P.: “Word Association Norms, Mutual Information and Lexicography.” Computational Linguistics, 16(1), 1990, pp. 22–29.

    Google Scholar 

  13. Fox, E., ed.: Virginia Disk One, Blacksburg: Virginia Polytechnic Institute and State University, 1990.

    Google Scholar 

  14. Salton, G. and Buckley, C: “Term Weighting Approaches in Automatic Text Retrieval.” Information Processing and Management, 5(24), 1988, pp. 513–523.

    Article  Google Scholar 

  15. Rijsbergen, C.J. van: Information Retrieval, 2nd Edition, London, Butterworths, 1979.

    Google Scholar 

  16. Baker, K., et al.: “Coping with Ambiguity in a Large-scale Machine Translation System.” In Proceedings of COLING-94, 1994, pp. 90–94.

    Google Scholar 

  17. Bennett, W. and Slocum, J.: “The LRC Machine Translation System.” Computational Linguistics, 11(2–3), 1985, pp. 111–119.

    Google Scholar 

  18. Brown, P.; et al: “A Statistical Approach to Machine Translation.” Computational Linguistics, 16(2), 1990, pp. 79–85.

    Google Scholar 

  19. Mitamura, T.; Nyberg, E. and Carbonell, J.: “An Efficient Interlingua Translation System for Multilingual Document Production.” In Proceedings of Machine Translation Summit III, 1991.

    Google Scholar 

  20. Nagao, M.: “A Framework of Mechanical Translation between Japanese and English by Analogy Principle.” Artificial and Human Intelligence, 1984, pp. 173–180.

    Google Scholar 

  21. Nirenburg, S., et al: “Multi-purpose Development and Operations Environments for Natural Language Applications,” In Proceedings of Applied Language Processing, Trento, Italy, 1993.

    Google Scholar 

  22. Chen, K.H. and Chen, H.H.: “Machine Translation: An Integrated Approach.” In Proceedings of the 6th International Conference on Theoretical and Methodological Issues in Machine Translation, 1995, pp. 287–294.

    Google Scholar 

  23. Chen, K.H. and Chen, H.H.: “Acquisition of Verb Subcategorization Frames from Large Scale Texts.” In Proceedings of KONVENS94, 1994, pp. 407–410.

    Google Scholar 

  24. Chen, K.H. and Chen, H.H.: “A Rule-Based and MT-Oriented Approach to Prepositional Phrases Attachment.” In Proceedings of the 16 th COLING, 1996, pp. 216–221.

    Google Scholar 

  25. Chen, H.H. and Bian, G.W.: “Proper Name Extraction from Web Pages for Finding People in Internet.” In Proceedings of the 10 th Research on Computational Linguistics (ROCLING X) International Conference, 1997, pp. 143–158.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bian, GW., Chen, HH. (1998). Integrating Query Translation and Document Translation in a Cross-Language Information Retrieval System. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_23

Download citation

  • DOI: https://doi.org/10.1007/3-540-49478-2_23

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65259-5

  • Online ISBN: 978-3-540-49478-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics