Abstract
This paper reports experimental results of cross-lingual information retrieval from German to Italian. The authors are concerned with CLIR in cases where available language resources are very limited. Thus transitive translation of queries using English as a pivot language was used to search Italian document collections for German queries without any direct bilingual dictionary or MT system for these two languages. In order to remove irrelevant translations produced by the transitive translation, we propose a new disambiguation technique, in which two stages of refinement of query translation are executed. Basically, this refinement is based on the idea of pseudo relevance feedback. Our experimental results show that the two-stage refinement method is able to significantly improve search performance of bilingual IR using a pivot language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ballesteros, L.A., Croft, W.B.: Phrasal translation and query expansion techniques for cross-language information retrieval. In: Proceedings of the 20th ACM SIGIR conference on Research and Development in Information Retrieval, pp. 84–91 (1977)
Ballesteros, L., Croft, W.B.: Resolving ambiguity for cross-language retrieval. In: Proceedings of the 21st ACM SIGIR conference on Research and Development in Information Retrieval, pp. 64–71 (1988)
Yamabana, K., Muraki, K., Doi, S., Kamei, S.: A language conversion front-end for cross-language information retrieval. In: Grefenstette, G. (ed.) Cross-Language Information Retrieval, pp. 93–104. Kluwer, Boston (1998)
Gao, J., Nie, J.Y., Xun, E.X., Zhang, J., Zhou, M., Huang, C.: Improving query translation for cross-language information retrieval using statistical models. In: Proceedings of 24th ACM SIGIR conference on Research and Development in Information Retrieval, pp. 96–104 (2001)
Lin, C.J., Lin, W.C., Bian, G.W., Chen, H.H.: Description of the NTU Japanese-English cross-lingual information retrieval system used for NTCIR workshop. In: Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, National Institute of Informatics, Tokyo (1999), http://research.nii.ac.jp/ntcir/workshop/
Sadat, F., Maeda, A., Yoshikawa, M., Uemura, S.: Query expansion techniques for the CLEF Bilingual track. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 177–184. Springer, Heidelberg (2002)
Adriani, M.: English-Dutch CLIR using query translation techniques. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 219–225. Springer, Heidelberg (2002)
Qu, Y., Grefenstette, G., Evans, D.A.: Resolving translation ambiguity using monolingual corpora: a report on Clairvoyance CLEF 2002 experiments. In: Working Notes for the CLEF 2002 Workshop, pp. 115–126 (2002)
Seo, H.C., Kim, S.B., Kim, B.I., Rim, H.C., Lee, S.Z.: KUNLP system for NTCIR-3 English-Korean cross-language information retrieval. In: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, National Institute of Informatics, Tokyo (2003), http://research.nii.ac.jp/ntcir/workshop/
Ballesteros, L.A.: Cross-language retrieval via transitive translation. In: Croft, W.B. (ed.) Advances in Information Retrieval: Recent Research from the Center for Intelligent Information Retrieval, pp. 203–234. Kluwer, Boston (2000)
Franz, M., McCarley, J.S., Roukos, S.: Ad hoc and multilingual information retrieval at IBM. In: Proceedings of the TREC-7, National Institute of Standards and Technology, Gaithersburg (1999), http://trec.nist.gov/pubs/
Gey, F.C., Jiang, H., Chen, A., Larson, R.R.: Manual queries and machine translation in cross-language retrieval at TREC-7. In: Proceedings of the TREC-7, National Institute of Standards and Technology, Gaithersburg (1999), http://trec.nist.gov/pubs/
Hiemstra, D., Kraaij, W.: Twenty-one at TREC-7: ad-hoc and cross-language track. In: Proceedings of the TREC-7, National Institute of Standards and Technology, Gaithersburg (1999), http://trec.nist.gov/pubs/
Chen, A., Gey, F.C.: Experiments on cross-language and patent retrieval at NTCIR-3 workshop. In: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, National Institute of Informatics, Tokyo (2003), http://research.nii.ac.jp/ntcir/workshop/
Lin, W.C., Chen, H.H.: Description of NTU approach to Multilingual Information retrieval. In: Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering, National Institute of Informatics, Tokyo (2003), http://research.nii.ac.jp/ntcir/workshop/
Gollins, T., Sanderson, M.: Improving cross language information retrieval with triangulated translation. In: Proceedings of the 24th ACM SIGIR conference on Research and Development in Information Retrieval, pp. 90–95 (2001)
Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)
Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M.: Okapi at TREC-3. In: Proceedings of TREC-3, National Institute of Standards and Technology, Gaithersburg (1995), http://trec.nist.gov/pubs/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kishida, K., Kando, N. (2004). Two-Stage Refinement of Query Translation in a Pivot Language Approach to Cross-Lingual Information Retrieval: An Experiment at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds) Comparative Evaluation of Multilingual Information Access Systems. CLEF 2003. Lecture Notes in Computer Science, vol 3237. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30222-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-30222-3_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24017-4
Online ISBN: 978-3-540-30222-3
eBook Packages: Springer Book Archive