Abstract
We present a translation-free technique for multilingual information retrieval. This technique is based on an ontological representation of documents and queries. For each language, we use a dictionary (set of lexical reference for concepts) to map a term to its corresponding concept. The same mapping is applied to each document and each query. Then, we use a classic vector space model based on concept for indexing and querying the document corpus. The main advantages of our approach are: no merging phase is required; no dependency on automatic translators between all pairs of languages; and adding a new language only requires a new mapping dictionary to be added into the multilingual ontology. Experimental results on the CLEF 2005 multi8 collection show that this approach is efficient, even with relatively small and low fidelity dictionaries and without word sense disambiguation.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bozsak, E., et al.: KAON - Towards a Large Scale Semantic Web. In: Bauknecht, K., Tjoa, A.M., Quirchmayr, G. (eds.) EC-Web 2002. LNCS, vol. 2455, pp. 304–313. Springer, Heidelberg (2002)
Chen, A., Gey, F.: Combining Query Translation and Document Translation in Cross-Language Retrieval. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 108–121. Springer, Heidelberg (2004)
Ergane: http://download.travlang.com/ , see also http://www.majstro.com/ [visited on August 2005]
Gruber, T.R.: A translation Approach to Portable Ontology Specifications. Knowledge Acquisition 5, 199–220 (1993)
Guyot, J.: yaaa: yet another alignment algorithm - Alignement ontologique bi-texte pour un corpus multilingue. Cahier du CUI – Université de Genève (2005)
Guyot, J., Falquet, G.: Construire un moteur d’indexation. Cahier du CUI - Université de Genève (2005)
Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)
SnowBall [visited on August 2005], http://snowball.tartarus.org/
Uchida, H., Zhu, M., Della Senta, T.G.: Universal Networking Language. UNDL Foundation (2005)
UNL: Universal Networking Language, http://cui.unige.ch/isi/unl
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Guyot, J., Radhouani, S., Falquet, G. (2006). Conceptual Indexing for Multilingual Information Retrieval. In: Peters, C., et al. Accessing Multilingual Information Repositories. CLEF 2005. Lecture Notes in Computer Science, vol 4022. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11878773_11
Download citation
DOI: https://doi.org/10.1007/11878773_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45697-1
Online ISBN: 978-3-540-45700-8
eBook Packages: Computer ScienceComputer Science (R0)