Abstract
Information Retrieval systems can benefit from advanced linguistic resources when carrying out tasks such as word-stemming or query translation. The main goal of our experiments has been the development of methodologies that minimize the human labor needed for creating linguistic resources for new languages. For this purpose, we have applied statistical techniques to extract information directly from the collections.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Di Nunzio, G.M., Ferro, N., Melucci, M., Orio, N.: The University of Padova at CLEF 2003: Experiments to Evaluate Probabilistic Models for Automatic Stemmer Generation and Query Word Translation. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 211–223. Springer, Heidelberg (2004)
Gibbons, J.D.: Nonparametric Statistical Inference, 2nd edn. Marcel Dekker, Inc., New York (1985)
Johnson, S.C.: Hierarchical Clustering Schemes. Psychometrika 32, 241–254 (1967)
Nie, J.-Y., Simard, M., Isabelle, P., Durand, R.: Cross-language Information Retrieval Based on Parallel Texts and Automatic Mining of Parallel Texts from the Web. In: Proc. of the 22nd ACM SIGIR Conference, Berkeley, CA, pp. 74–81 (1999)
Rabiner, L., Juang, B.H.: Fundamentals of speech recognition, pp. 321–389. Prentice Hall, Englewood Cliffs (1993)
Sheridan, P., Ballerini, J.P.: Experiments in Multilingual Information Retrieval Using the SPIDER System. In: Proc. of the 19th ACM SIGIR Conference, Zurich, Switzerland, pp. 58–65 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Di Nunzio, G.M., Ferro, N., Orio, N. (2005). Experiments on Statistical Approaches to Compensate for Limited Linguistic Resources. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_6
Download citation
DOI: https://doi.org/10.1007/11519645_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)