Abstract
The effects of query structures and query expansion (QE) on retrieval performance were tested with a best match retrieval system (InQuery1). Query structure means the use of operators to express the relations between search keys. Six different structures were tested, representing strong structures (e.g., queries with facets or concepts identified) and weak structures (no concepts identified, a query is ‘a bag of search keys’). QE was based on concepts, which were first selected from a searching thesaurus, and then expanded by semantic relationships given in the thesaurus. The expansion levels were (a) no expansion, (b) a synonym expansion, (c) a narrower concept expansion, (d) an associative concept expansion, and (e) a cumulative expansion of all other expansions. With weak structures and Boolean structured queries, QE was not very effective. The best performance was achieved with a combination of a facet structure, where search keys within a facet were treated as instances of one search key (the SYN operator), and the largest expansion.
Article PDF
Similar content being viewed by others
References
Allan J, Callan J, Croft B, Ballesteros L, Broglio J, Xu J and Shu H (1997) INQUERY at TREC 5. In: Voorhees EM and Harman DK, Eds., Information Technology: The Fifth Text Retrieval Conference (TREC-5). National Institute of Standards and Technology, Gaithersburg, MD, pp. 119¶132.
Belkin N, Kantor P, Fox EA and Shaw JA (1995) Combining evidence of multiple query representations for information retrieval. Information Processing and Management, 31(3):431¶448.
Buckley C, Mitra M, Walz J and Cardie C (1998) Using clustering and superconcepts within SMART: TREC 6. Online, available from: <URL: http://trec.nist.gov /pubs/trec6/papers/cornell.ps>, cited 7.7.1998 (to appear in Proceedings of TREC-6).
Conover WJ (1980) Practical Nonparametric Statistics, 2nd ed. John Wiley & Sons, New York.
Efthimiadis EN (1996) Query expansion. In: Williams ME, Ed., Annual Review of Information Science and Technology, Vol. 31. Information Today, Medford, NJ, pp. 121¶187.
Fidel R and Efthimiadis EN (1995) Terminological knowledge structure for intermediary expert systems. Information Processing & Management, 31(1):15¶27.
Green R (1995) The expression of conceptual syntagmatic relationships: A comparative survey.Journal of Documentation, 51(4):315¶338.
Harman DK (1995) Overview of the Fourth Text Retrieval Conference (TREC-4). Online, available from: <URL: http://trec.nist.gov/pubs/trec4 /papers/overview.ps>, cited 5.2.1998.
Hawking D, Thistlewaite P and Bailey P (1997) ANU/ACSys TREC-5 experiments. In: Voorhees EM and Harman DK, Eds., Information technology: The Fifth Text Retrieval Conference (TREC-5). National Institute of Standards and Technology, Gaithersburg, MD, pp. 359¶375.
Hawking D, Thistlewaite P and Craswell P (1997) ANU/ACSys TREC-6 Experiments. Online, available from: <URL: http://trec.nist.gov/pubs/trec6/papers/anu.ps>, cited 26.2.1998 (to appear in Proceedings of TREC-6).
Hull D (1993) Using statistical testing in the evaluation of retrieval experiments. In: Korfhage R, Rasmussen EM and Willett P, Eds., Proceedings of the 16th International Conference on Research and Development in Information Retrieval. New York, NY, ACM, pp. 349¶338.
Hull DA (1997) Using structured queries for disambiguation in cross-language information retrieval. In: AAAI Spring Symposium on Cross-Language Text and Speech Retrieval Electronic Working Notes. Online, available from: <URL: http://www.clis.umd.edu/dlrg/filter/sss/papers/hull3.ps>, cited 13.8.1997.
Ingwersen P and Willett P (1995) An introduction to algorithmic and cognitive approaches for information retrieval. Libri, 45:160¶177.
Järvelin K, Kristensen J, Niemi T, Sormunen E, and Keskustalo H (1996) A deductive data model for query expansion. In: Frei H-P, Harman D, Schäuble P and Wilkinson R, Eds., Proceedings of the 19th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, pp. 235¶249.
Keen EM (1991) The use of term position devices in ranked output experiments. Journal of Documentation, 47(1):1¶22.
Kekäläinen J and Järvelin K (1998) The impact of query structure and query expansion on retrieval performance. In: Croft WB, Moffat A, van Rijsbergen CJ, Wilkinson R and Zobel J, Eds., Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, pp. 130¶137.
Kekäläinen J (1999) The Effects of Query Complexity, Expansion and Structure on Retrieval Performance in Probabilistic Text Retrieval. Ph.D. Thesis, University of Tampere. Acta Universitatis Tamperensis, Vol. 678.
Pirkola A (1998) The Effects of Query Structure and Dictionary setups in Dictionary-Based Cross-Language Information Retrieval. In: Croft WB, Moffat A, van Rijsbergen CJ, Wilkinson R and Zobel J, Eds., Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, pp. 55¶63.
Rajashekar TB and Croft WB (1995) Combining automatic and manual index representations in probabilistic retrieval. Journal of the American Society for Information Science, 46(4):272¶283.
Shaw JA and Fox EA (1995) Combination of multiple searches. In: Harman DK, Ed., The Third Text REtrieval Conference (TREC-3). National Institute of Standards and Technology, Gaithersburg, MD, pp. 105¶108.
Sormunen E (1994) Vapaatekstihaun tehokkuus ja siihen vaikuttavat tekijät sanomalehtiaineistoa sisältävässä tekstikannassa [Free-text searching efficiency and factors affecting it in a newspaper article database]. VTT Julkaisuja 790. Espoo: Valtion Teknillinen Tutkimuskeskus. [In Finnish.]
Turtle HR (1990) Inference Networks for Document Retrieval. Ph.D. Dissertation, Computer and Information Science Department, University of Massachusetts. COINS Technical Report, pp. 90¶92.
UMLS (1994) UMLS Knowledge Sources, 5th experimental edn. National Library of Medicine, Bethesda, MD.
Wang Y-C, Vandendorpe J and Evens M (1985) Relational thesauri in information retrieval. Journal of the American Society for Information Science, 36(1):15¶27.
Voorhees E (1994) Query expansion using lexical-semantic relations. In: Bruce Croft W and van Rijsbergen CJ, Eds., Proceedings of the 17th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, pp. 61¶69.
Xu J and Croft WB (1996) Query expansion using local and global document analysis. In: Frei H-P, Harman D, Schäuble P and Wilkinson R, Eds., Proceedings of the 19th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, pp. 4¶11.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Kekäläinen, J., Järvelin, K. The Co-Effects of Query Structure and Expansion on Retrieval Performance in Probabilistic Text Retrieval. Information Retrieval 1, 329–344 (2000). https://doi.org/10.1023/A:1009983401464
Issue Date:
DOI: https://doi.org/10.1023/A:1009983401464