short-paper

Refining Query Expansion Terms using Query Context

Authors:
Reuben Crimp

University of Otago, Dunedin, New Zealand

University of Otago, Dunedin, New Zealand
View Profile

,
Andrew Trotman

University of Otago, Dunedin, New Zealand

University of Otago, Dunedin, New Zealand
View Profile

ADCS '18: Proceedings of the 23rd Australasian Document Computing SymposiumDecember 2018Article No.: 12Pages 1–4https://doi.org/10.1145/3291992.3292000

Published:11 December 2018Publication History

ADCS '18: Proceedings of the 23rd Australasian Document Computing Symposium

Pages 1–4

ABSTRACT

Query expansion is commonly used to combat the vocabulary mismatch problem, it bridges the disparity between the vocabulary used in the corpus and search queries. However, if expansion terms are not chosen carefully, there is a risk of including spurious expansion terms, which can broaden the potential interpretations of the modified query. Unintentionally increasing the semantic ambiguity in this way is known as query drift.

In this short paper we propose using the query context to inform the expansion term selection process. Using WordNet as an initial source of expansion terms, we refine the candidate expansions by discriminating relevancy. We found that our term selection process is more effective than the standard approach. Our technique targets terms which relate to the entire query as a whole, but predominately focuses on excluding spurious expansion terms. Both help reduce query drift and increase query performance.

References

Jing Bai, Dawei Song, Peter Bruza, Jian-Yun Nie, and Guihong Cao. 2005. Query Expansion Using Term Relationships in Language Models for Information Retrieval. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM '05). ACM, New York, NY, USA, 688--695. Google ScholarDigital Library
Claudio Carpineto, Renato de Mori, Giovanni Romano, and Brigitte Bigi. 2001. An Information-theoretic Approach to Automatic Query Expansion. ACM Trans. Inf. Syst. 19, 1 (Jan. 2001), 1--27. Google ScholarDigital Library
Claudio Carpineto and Giovanni Romano. 2012. A Survey of Automatic Query Expansion in Information Retrieval. ACM Comput. Surv. 44, 1, Article 1 (Jan. 2012), 50 pages. Google ScholarDigital Library
Reuben Crimp and Andrew Trotman. 2017. Automatic Term Reweighting for Query Expansion. In Proceedings of the 22Nd Australasian Document Computing Symposium (ADCS 2017). ACM, New York, NY, USA, Article 3, 4 pages. Google ScholarDigital Library
Tamas E. Doszkocs. 1978. AID, An Associative Interactive Dictionary for Online Searching. Online Information Review 2 (12 1978), 163--173.Google Scholar
David S. Johnson, Maria Minkoff, and Steven Phillips. 2000. The Prize Collecting Steiner Tree Problem: Theory and Practice. In Proceedings of the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '00). Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 760--769. Google ScholarDigital Library
Prashanti Manda and Todd Vision. 2018. An analysis and comparison of the statistical sensitivity of semantic similarity metrics. bioRxiv (2018).Google Scholar
G. A. Miller. 1995. WordNet: A Lexical Database for English. CACM 38, 11 (1995), 39--41. Google ScholarDigital Library
S.E. Robertson, S. Walker, S. Jones, M.M. Hancock-Beaulieu, and M. Gatford. 1996. Okapi at TREC-3. 109--126.Google Scholar
S.E. Robertson. 1991. On Term Selection for Query Expansion. J. Doc. 46, 4 (Jan. 1991), 359--364. Google ScholarDigital Library
S. E. Robertson and Sparck J. K. 1976. Relevance Weighting of Search Terms. Journal of the American Society for Information Science (pre-1986) 27, 3 (May 1976), 129. Copyright - Copyright Wiley Periodicals Inc. May/Jun 1976; Last updated - 2010-06-09.Google Scholar
J. J. Rocchio. 1971. Relevance feedback in information retrieval. In The Smart retrieval system - experiments in automatic document processing, G. Salton (Ed.). Englewood Cliffs, NJ: Prentice-Hall, 313--323.Google Scholar
G. Salton and M. E. Lesk. 1968. Computer Evaluation of Indexing and Text Processing. J. ACM 15, 1 (1968), 8--36. Google ScholarDigital Library
A. Trotman, C. L. A. Clarke, I. Ounis, S. Culpepper, M.-A. Cartright, and S. Geva. 2012. Open Source Information Retrieval: A Report on the SIGIR 2012 Workshop. SIGIR Forum 46, 2 (2012), 95--101. Google ScholarDigital Library
A. Trotman, A. Puurula, and B. Burgess. 2014. Improvements to BM25 and Language Models Examined. In ADCS '14. 58:58--58:65. Google ScholarDigital Library
E. M. Voorhees. 1994. Query Expansion Using Lexical-semantic Relations. In SIGIR '94. 61--69. Google ScholarDigital Library
Y.-C. Wang, J. Vandendorpe, and M. Evens. 1985. Relational thesauri in information retrieval. JASIS 36, 1 (1985), 15--27. Google ScholarDigital Library
Zhibiao Wu and Martha Palmer. 1994. Verbs Semantics and Lexical Selection. In Proceedings of the 32Nd Annual Meeting on Association for Computational Linguistics (ACL '94). Association for Computational Linguistics, Stroudsburg, PA, USA, 133--138. Google ScholarDigital Library
L. Zhao and J. Callan. 2010. Term Necessity Prediction. In CIKM 2010. 259--268. Google ScholarDigital Library

Index Terms

Refining Query Expansion Terms using Query Context
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
      1. Query reformulation

Recommendations

Automatic Term Reweighting for Query Expansion
ADCS '17: Proceedings of the 22nd Australasian Document Computing Symposium

Query expansion is used to overcome the vocabulary mismatch between the documents and queries, but it can lead to query drift. We propose an automatic term reweighting strategy for BM25 ranking functions. Using expansion terms obtained from general ...
Read More
Evaluating sources of query expansion terms
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval

This study investigates the effectiveness of retrieval systems and human users in generating terms for query expansion. We compare three sources of terms: system generated terms, terms users select from top-ranked sentences, and user generated terms. ...
Read More
Query expansion of zero-hit subject searches: using a thesaurus in conjunction with NLP techniques
TPDL'12: Proceedings of the Second international conference on Theory and Practice of Digital Libraries

The focus of our study is zero-hit queries in keyword subject searches and the effort of increasing recall in these cases by reformulating and, then, expanding the initial queries using an external source of knowledge, namely a thesaurus. To this end, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ADCS '18: Proceedings of the 23rd Australasian Document Computing Symposium
December 2018
78 pages
ISBN:9781450365499
DOI:10.1145/3291992

Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 December 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Ad-hoc Retrieval
Query Drift
Query Expansion
Thesaurus
Word-Net
Wu-Palmer Similarity
Qualifiers
- short-paper
- Research
- Refereed limited
Conference

Acceptance Rates
ADCS '18 Paper Acceptance Rate13of20submissions,65%Overall Acceptance Rate30of57submissions,53%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 112
  Total Downloads
- Downloads (Last 12 months)12
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Refining Query Expansion Terms using Query Context

ADCS '18: Proceedings of the 23rd Australasian Document Computing Symposium

ABSTRACT

References

Cited By

Index Terms

Recommendations

Automatic Term Reweighting for Query Expansion

Evaluating sources of query expansion terms

Query expansion of zero-hit subject searches: using a thesaurus in conjunction with NLP techniques

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Refining Query Expansion Terms using Query Context

ADCS '18: Proceedings of the 23rd Australasian Document Computing Symposium

ABSTRACT

References

Cited By

Index Terms

Recommendations

Automatic Term Reweighting for Query Expansion

Evaluating sources of query expansion terms

Query expansion of zero-hit subject searches: using a thesaurus in conjunction with NLP techniques

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media