Abstract
Due to the explosive growth of the WWW, very large multilingual textual resources have motivated the researches in Cross-Language Information Retrieval and online Web Machine Translation. In this paper, the integration of language translation and text processing system is proposed to build a multilingual information system. A distributed English-Chinese system on WWW is introduced to illustrate how to integrate query translation, search engines, and web translation system. Since July 1997, more than 46,000 users have accessed our system and about 250,000 English web pages have been translated to pages in Chinese or bilingual English-Chinese versions. And the average satisfaction degree of users at document level is 67.47%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bian, G.W. and Chen, H.H.: “An MT Meta-Server for Information Retrieval on WWW.” In Working Notes of the AAAI Spring Symposium on Natural Language Processing for the World Wide Web, Palo Alto, California, USA, March, 1997, pp. 10–16.
David, M.W. and Ogden, W.C.: “QUILT: Implementing a Large-Scale Cross-Language Text Retrieval System.” In Proceedings of ACM SIGIR’97, 1997, pp. 92–98.
Gachot, D.A.; Lange, E. and Yang, J.: “The SYSTRAN NLP Browser: An Application of Machine Translation Technology in Multilingual Information Retrieval.” In Proceedings of Workshop on Cross-Linguistic Information Retrieval, 1996, pp. 44–54.
Hayashi, Y; Kikui, G. and Susaki, S.: “TITAN: A Cross-linguistic Search Engine for the WWW.” In Working Notes of the AAAI-97 Spring Symposium on Cross-Language Text and Speech Retrieval, 1997, pp. 58–65.
Ballesteros, L. and Croft, W.C.: “Phrasal Translation and Query Expansion Techniques for Cross-Language Information retrieval.” In Proceedings of ACM SIGIR’97, pp. 84–91, 1997.
David, M.W.: “New Experiments in Cross-Language Text Retrieval at New Mexico State University’s Computing Research Laboratory.” In Proceedings of the Fifth Text Retrieval Evaluation Conference (TREC-5), Gaithersburg, MD, National Institute of Standards and Technology, 1996.
Hull, D.A. and Grefenstette, G.: “Querying Across Languages: A Dictionary-Based Approach to Multilingual Information Retrieval.” In Proceedings of ACM SIGIR’96, pp. 49–57, 1996.
Kwok, K.L.: “Evaluation of an English-Chinese Cross-Lingual Retrieval Experiment.” In Working Notes of the AAAI-97 Spring Symposium on Cross-Language Text and Speech Retrieval, 1997, pp. 110–114.
David, M.W. and Dunning, T.: “A TREC Evaluation of Query Translation Methods for Multi-Lingual Text Retrieval.” In Proceedings of the Fourth Text Retrieval Evaluation Conference (TREC-4), Gaithersburg, MD, National Institute of Standards and Technology, 1995.
Landauer, T.K. and Littman, M.L.: “Fully Automatic Cross-Language Document Retrieval.” In Proceedings of the Sixth Conference on Electronic Text Research, pp. 31–38, 1990.
Radwan, K.: Vers l’Acces Multilingue en Langage Naturel aux Baess de Donnees Textuelles. PhD Thesis, Universite de Paris-Sud, Centre d’Orsay. 1994.
Church, K. and Hanks, P.: “Word Association Norms, Mutual Information and Lexicography.” Computational Linguistics, 16(1), 1990, pp. 22–29.
Fox, E., ed.: Virginia Disk One, Blacksburg: Virginia Polytechnic Institute and State University, 1990.
Salton, G. and Buckley, C: “Term Weighting Approaches in Automatic Text Retrieval.” Information Processing and Management, 5(24), 1988, pp. 513–523.
Rijsbergen, C.J. van: Information Retrieval, 2nd Edition, London, Butterworths, 1979.
Baker, K., et al.: “Coping with Ambiguity in a Large-scale Machine Translation System.” In Proceedings of COLING-94, 1994, pp. 90–94.
Bennett, W. and Slocum, J.: “The LRC Machine Translation System.” Computational Linguistics, 11(2–3), 1985, pp. 111–119.
Brown, P.; et al: “A Statistical Approach to Machine Translation.” Computational Linguistics, 16(2), 1990, pp. 79–85.
Mitamura, T.; Nyberg, E. and Carbonell, J.: “An Efficient Interlingua Translation System for Multilingual Document Production.” In Proceedings of Machine Translation Summit III, 1991.
Nagao, M.: “A Framework of Mechanical Translation between Japanese and English by Analogy Principle.” Artificial and Human Intelligence, 1984, pp. 173–180.
Nirenburg, S., et al: “Multi-purpose Development and Operations Environments for Natural Language Applications,” In Proceedings of Applied Language Processing, Trento, Italy, 1993.
Chen, K.H. and Chen, H.H.: “Machine Translation: An Integrated Approach.” In Proceedings of the 6th International Conference on Theoretical and Methodological Issues in Machine Translation, 1995, pp. 287–294.
Chen, K.H. and Chen, H.H.: “Acquisition of Verb Subcategorization Frames from Large Scale Texts.” In Proceedings of KONVENS94, 1994, pp. 407–410.
Chen, K.H. and Chen, H.H.: “A Rule-Based and MT-Oriented Approach to Prepositional Phrases Attachment.” In Proceedings of the 16 th COLING, 1996, pp. 216–221.
Chen, H.H. and Bian, G.W.: “Proper Name Extraction from Web Pages for Finding People in Internet.” In Proceedings of the 10 th Research on Computational Linguistics (ROCLING X) International Conference, 1997, pp. 143–158.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bian, GW., Chen, HH. (1998). Integrating Query Translation and Document Translation in a Cross-Language Information Retrieval System. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_23
Download citation
DOI: https://doi.org/10.1007/3-540-49478-2_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65259-5
Online ISBN: 978-3-540-49478-2
eBook Packages: Springer Book Archive