Skip to main content

Caption and Query Translation for Cross-Language Image Retrieval

  • Conference paper
Multilingual Information Access for Text, Speech and Images (CLEF 2004)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3491))

Included in the following conference series:


For many cross-language retrieval tasks, the predominant approach is to translate the query into the language of the document collection (target language). This often gives results as good as, if not better, than translating the document collection into the query language (source language). In this paper, we evaluate query versus document translation for the ImageCLEF 2004 bilingual ad hoc retrieval task. Image retrieval is achieved through matching textual queries to associated image captions for the following languages: French, German, Spanish and Italian using commercially and publicly available resources. On average, we find query translation to outperform document translation (77% of English MAP compared to 65% respectively) but this varies widely across language and query. Combining document and query translation we achieve an average MAP of 85% of English.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others


  1. Grefenstette, G.: Cross Language Information Retrieval. Kluwer Academic Publishers, Norwell (1998)

    Google Scholar 

  2. Peters, C., Braschler, M.: Cross Language System Evaluation: The CLEF Campaigns. Journal of the American Society for Information Science and Technology 22, 1067–1072 (2001)

    Article  Google Scholar 

  3. Oard, D.: Serving Users in Many Languages. D-Lib magazine (1997)

    Google Scholar 

  4. Clough, P., Sanderson, M.: User Experiments with the Eurovision Cross-Langauge Image Retrieval System. Journal of the American Society for Information Science and Technology (2005) (to appear)

    Google Scholar 

  5. Flank, S.: Cross-Language Multimedia Information Retrieval. In: Proceedings of the sixth conference on Applied natural language processing, pp. 13–20. Morgan Kaufmann Publishers Inc., San Francisco (2000)

    Chapter  Google Scholar 

  6. Oard, D.: A comparative study of query and document translation for cross-language information retrieval. In: Proceedings of the 3rd Conference of the Association for Machine Translation in the Americas, pp. 472–483 (1998)

    Google Scholar 

  7. McCarley, S.: Should we translate the documents or the queries in cross language information retrieval? In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, pp. 208–214 (1999)

    Google Scholar 

  8. Fujii, A., Ishikawa, T.: Applying machine translation to two-stage cross-language information retrieval. In: White, J.S. (ed.) AMTA 2000. LNCS (LNAI), vol. 1934, pp. 13–24. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  9. Clough, P., Müller, H., Sanderson, M.: The CLEF Cross Language Image Retrieval Track (ImageCLEF) 2004. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 597–613. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  10. Zhai, C., Lafferty, J.: A study of smoothing methods for langauge models applied to ad hoc information retrieval. In: Proceedings of SIGIR 2001, pp. 334–342 (2001)

    Google Scholar 

  11. Clough, P., Sanderson, M.: Assessing translation quality for cross language image retrieval. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 594–607. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  12. Hutchins, W., Somers, H.: An Introduction to machine Translation. Academic Press, London (1986)

    Google Scholar 

  13. Clough, P., Sanderson, M.: The effects of relevance feedback in cross language image retrieval. In: McDonald, S., Tait, J.I. (eds.) ECIR 2004. LNCS, vol. 2997, pp. 353–363. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations


Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Clough, P. (2005). Caption and Query Translation for Cross-Language Image Retrieval. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg.

Download citation

  • DOI:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27420-9

  • Online ISBN: 978-3-540-32051-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics