Skip to main content

SICS at CLEF 2002: Automatic Query Expansion Using Random Indexing

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2785))

Abstract

Vector space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data collections. We have used a technique called Random Indexing to accumulate context vectors for Swedish, French and Italian. We have then used the context vectors to perform automatic query expansion. In this paper, we report on our CLEF 2002 experiments on Swedish, French and Italian monolingual query expansion.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Bayer and K. Unterauer. Prefix B-trees. ACM Transactions on Database Systems, 2(1):11–26, March 1977. 314

    Article  Google Scholar 

  2. S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. Journal of the Society for Information Science, 41(6):391–407, 1990. 312

    Article  Google Scholar 

  3. M. J. Folk, B. Zoellick, and G. Riccardi. File Structures: An Object-Oriented Approach with C++. Addison-Wesley, 3rd edition, 1998. 314

    Google Scholar 

  4. Z. Harris. Mathematical Structures of Language. Interscience publishers, 1968. 313

    Google Scholar 

  5. P. Kanerva, J. Kristofersson, and A. Holst. Random indexing of text samples for latent semantic analysis. In Proceedings of the 22nd Annual Conference of the Cognitive Science Society, page 1036. Erlbaum, 2000. 312

    Google Scholar 

  6. J. Karlgren and M. Sahlgren. From words to understanding. In Y. Uesaka, P. Kanerva, and H. Asoh, editors, Foundations of Real World Intelligence, pages 294-308. CSLI publications, 2001. 312, 313

    Google Scholar 

  7. S. Kaski. Dimensionality reduction by random mapping: Fast similarity computation for clustering. In Proceedings of the IJCNN’98, International Joint Conference on Neural Networks, pages 413-418. IEEE Service Center, 1998. 312

    Google Scholar 

  8. T. Landauer and S. Dumais. A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review, 104(2):211–240, 1997. 312

    Article  Google Scholar 

  9. K. Lund and C. Burgess. Producing high-dimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instruments and Computers, 28(2):203–208, 1996. 312

    Article  Google Scholar 

  10. C. Monz, J. Kamps, and M. de Rijke. Combining Evidence for Cross-language Information Retrieval. This volume. 318

    Google Scholar 

  11. Y. Qiu and H.P. Frei. Concept based query expansion. In Proceedings of the 16th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 160-169, 1993. 317

    Google Scholar 

  12. H. E. Williams and J. Zobel. Compressing integers for fast file access. The Computer Journal, 42(3):193–201, 1999. 315

    Article  Google Scholar 

  13. I. H. Witten, A. Moffat, and T. C. Bell. Managing Gigabytes: Compressing and Indexing Documents and Images. Morgan Kaufmann Publishing, 2nd edition, 1999. 314, 315

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sahlgren, M., Karlgren, J., Cöster, R., Järvinen, T. (2003). SICS at CLEF 2002: Automatic Query Expansion Using Random Indexing. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds) Advances in Cross-Language Information Retrieval. CLEF 2002. Lecture Notes in Computer Science, vol 2785. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45237-9_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45237-9_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40830-7

  • Online ISBN: 978-3-540-45237-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics