Abstract
In our first participation in clef retrieval tasks, the primary objective was to define a general stopword list for various European languages (namely, French, Italian, German and Spanish) and also to suggest simple and efficient stemming procedures for these languages. Our second aim was to suggest a combined approach that could facilitate effective access to multilingual collections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Robertson, S.E., Walker, S., Beaulieu, M.: Experimentation as a Way of Life: Okapi at TREC. Information Processing & Management 36 (2000) 95–108
Fox, C.: A Stop List for General Text. ACM-SIGIR Forum 24 (1999) 19–35
Savoy, J.: A Stemming Procedure and Stopword List for General French Corpora. Journal of the American Society for Information Science 50 (1999) 944–952
Sproat, R.: Morphology and Computation. The MIT Press, Cambridge (1988)
Lovins, J.B.: Development of a Stemming Algorithm. Mechanical Translation and Computational Linguistics 11 (1968) 22–31
Porter, M.F.: An Algorithm for Suffix Stripping. Program 14 (1980) 130–137
Figuerola, C.G., Gómez, R., Zazo Rodríguez, A.F.: Stemming in Spanish: A First Approach to its Impact on Information Retrieval. In this volume
Monz, C., de Rijke, M.: The University of Amsterdam at CLEF 2001. In this volume
Chen, A.: Multilingual Information Retrieval using English and Chinese Queries. In this volume
Molina-Salgado, H., Moulinier, I., Knutson, M., Lund, E., Sekhon, K.: Thomson Legal and Regulatory at CLEF 2001: Monolingual and Bilingual Experiments. In this volume
McNamee, P., Mayfield, J.: A Language-Independent Approach to European Text Retrieval. In: Peters, C. (ed.): Cross-Language Information Retrieval and Evaluation. Lecture Notes in Computer Science, Vol. 2069. Springer-Verlag, Berlin Heidelberg New York (2001) 131–139
McNamee, P., Mayfield, J.: JHU/APL Experiments at CLEF: Translation Resources and Score Normalization. In this volume
Savoy, J.: Bilingual Information Retrieval: CLEF-2000 Experiments. In Proceedings ECSQARU-2001 Workshop. IRIT, Toulouse (2001) 53–63
Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)
Buckley, C., Singhal, A., Mitra, M., Salton, G.: New Retrieval Approaches Using SMART. In Proceedings TREC-4. NIST, Gaithersburg (1996) 25–48
Singhal, A., Choi, J., Hindle, D., Lewis, D.D., Pereira, F.: AT&T at TREC-7. In Proceedings TREC-7. NIST, Gaithersburg (1999) 239–251
Oard, D., Dorr, B.J.: A Survey of Multilingual Text Retrieval. Institute for Advanced Computer Studies and Computer Science Department, University of Maryland (1996), http://www.clis.umd.edu/dlrg/filter/papers/mlir.ps
Grefenstette, G. (ed.): Cross-Language Information Retrieval. Kluwer, Amsterdam (1998)
Peters, C. (ed.): Cross-Language Information Retrieval and Evaluation. Lecture Notes in Computer Science, Vol. 2069. Springer-Verlag, Berlin Heidelberg New York (2001)
Gachot, D.A., Lange, E., Yang, J.: The SYSTRAN NLP Browser: An Application of Machine Translation Technology. In: Grefenstette, G. (ed.): Cross-Language Information Retrieval. Kluwer, Boston (1998) 105–118.
Hull, D., Grefenstette, G.: Querying Across Languages. In Proceedings of the ACM-SIGIR’1996. The ACM Press, New York (1996) 49–57
Voorhees, E.M., Gupta, N.K., Johnson-Laird, B.: The Collection Fusion Problem. In Proceedings of TREC-3. NIST, Gaithersburg (1995) 95–104
Callan, J.P., Lu, Z., Croft, W.B.: Searching Distributed Collections with Inference Networks. In Proceedings of the ACM-SIGIR’1995. The ACM Press, New York (1995) 21–28
Savoy, J., Rasolofo, Y.: Report on the TREC-9 Experiment: Link-Based Retrieval and Distributed Collections. In Proceedings TREC-9. NIST, Gaithersburg (2001)
Kwok, K.L., Grunfeld L., Lewis, D.D.: TREC-3 Ad-hoc, Routing Retrieval and Thresholding Experiments Using PIRCS. In Proceedings of TREC-3. NIST, Gaithersburg (1995) 247–255
Moffat, A., Zobel, J.: Information Retrieval Systems for Large Document Collections. In Proceedings of TREC-3. Gaithersburg, NIST (1995) 85-93
Dumais, S.T.: Latent Semantic Indexing (LSI) and TREC-2. In Proceedings of TREC-2. NIST, Gaithersburg (1994) 105–115
Powell, A.L., French, J.C., Callan, J., Connell, M., Viles, C.L.: The Impact of Database Selection on Distributed Searching. In Proceedings of ACM-SIGIR’2000. The ACM Press, New York (2000) 232–239
Nie, J.Y., Simard, M.: Using Statistical Translation Models for Bilingual IR. In this volume
Vossen, P.: EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Kluwer, Dordrecht (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Savoy, J. (2002). Report on CLEF-2001 Experiments: Effective Combined Query-Translation Approach. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds) Evaluation of Cross-Language Information Retrieval Systems. CLEF 2001. Lecture Notes in Computer Science, vol 2406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45691-0_3
Download citation
DOI: https://doi.org/10.1007/3-540-45691-0_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44042-0
Online ISBN: 978-3-540-45691-9
eBook Packages: Springer Book Archive