Abstract
The LIC2M has designed a cross-lingual search engine based on a deep linguistic analysis of documents and queries that works on French, English, Spanish, German, Arabic and Chinese. For our participation in the CLEF 2004 campaign, we tested the integration in our system of Russian and Finnish, based on a simplified processing. The results we obtained are not good on the new languages introduced, which shows that our system strongly depends on a correct linguistic analysis of the documents. However, integrating more processing steps in the simplified analysis of new languages so that the results of this analysis are more comparable with the results of the complete linguistic analysis seems to be a good direction for improvements.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Besançon, R., de Chalendar, G., Ferret, O., Fluhr, C., Mesnard, O., Naets, H.: The LIC2M’s CLEF 2003 System. In: Working Notes for the CLEF 2003 Workshop, Trondheim, Norway (2003)
Porter, M.: Finnish Snowball Stemmer (2002), http://snowball.tartarus.org/finnish/stemmer.html
Savoy, J.: A Stopword List for Finnish, http://www.unine.ch/info/clef/
Hämäläinen, K., Kivirinta, T.: Freelang Finnish-English Dictionary, http://www.kasvua.org/~kphamala/dict.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Besançon, R., Ferret, O., Fluhr, C. (2005). Integrating New Languages in a Multilingual Search System Based on a Deep Linguistic Analysis. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_8
Download citation
DOI: https://doi.org/10.1007/11519645_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)