skip to main content
10.1145/2124295.2124344acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries

Published: 08 February 2012 Publication History

Abstract

Query expansion is an important and commonly used technique for improving Web search results. Existing methods for query expansion have mostly relied on global or local analysis of document collection, click-through data, or simple ontologies such as WordNet. In this paper, we present the results of a systematic study of the methods leveraging the ConceptNet knowledge base, an emerging new Web resource, for query expansion. Specifically, we focus on the methods leveraging ConceptNet to improve the search results for poorly performing (or difficult) queries. Unlike other lexico-semantic resources, such as WordNet and Wikipedia, which have been extensively studied in the past, ConceptNet features a graph-based representation model of commonsense knowledge, in which the terms are conceptually related through rich relational ontology. Such representation structure enables complex, multi-step inferences between the concepts, which can be applied to query expansion. We first demonstrate through simulation experiments that expanding queries with the related concepts from ConceptNet has great potential for improving the search results for difficult queries. We then propose and study several supervised and unsupervised methods for selecting the concepts from ConceptNet for automatic query expansion. The experimental results on multiple data sets indicate that the proposed methods can effectively leverage ConceptNet to improve the retrieval performance of difficult queries both when used in isolation as well as in combination with pseudo-relevance feedback.

References

[1]
J. Allan. Overview of the trec 2004 robust retrieval track. In Proceedings of TREC 13, 2004.
[2]
J. Allan. Hard track overview in trec 2005 - high accuracy retrieval from documents. In Proceedings of TREC 14, 2005.
[3]
J. Bai, D. Song, P. Bruza, J.-Y. Nie, and G. Cao. Query expansion using term relationships in language models for information retrieval. In Proceedings of CIKM, pages 688--695, Bremen, Germany, 2005.
[4]
R. C. Bodner and F. Song. Knowledge-based approaches to query expansion in information retrieval. Advances in Artificial Intelligence, pages 146--158.
[5]
P. Bruza and D. Song. Inferring query models by computing information flow. In Proceedings of CIKM, pages 260--269, 2002.
[6]
C. Burgess, K. Livesay, and K. Lund. Explorations in context space: Words, sentences and discourse. Discourse Processes, 25:211--257, 1998.
[7]
G. Cao, J.-Y. Nie, and J. Bai. Integrating word relationships into language models. In Proceedings of SIGIR, pages 298--305, 2005.
[8]
Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In Proceedings of ICML, pages 129--136, 2007.
[9]
P. R. Cohen and R. Kjeldsen. Information retrieval by constrained spreading activation in semantic networks. Information Processing and Management, 23(4):255--268, 1987.
[10]
K. Collins-Thompson and J. Callan. Query expansion using random walk models. In Proceedings of CIKM, pages 704--711, 2005.
[11]
N. Craswell and D. Hawking. Overview of the trec 2004 web track. In Proceedings of TREC 13, 2004.
[12]
F. Crestani. Application of spreading activation techniques in information retrieval. Artificial Intelligence Review, 11(6):453--482, 1997.
[13]
F. Diaz and D. Metzler. Improving the estimation of relevance models using large external corpora. In Proceedings of SIGIR, pages 154--161, 2006.
[14]
E. Gabrilovich and S. Markovitch. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In Proceedings of IJCAI, pages 1606--16011, 2007.
[15]
S. Gauch and J. Wang. A corpus analysis approach for automatic query expansion. In Proceedings of CIKM, pages 278--284, 1997.
[16]
M.-H. Hsu and H.-H. Chen. Information retrieval with commonsense knowledge. In Proceedings of SIGIR'06, pages 651--652, 2006.
[17]
M.-H. Hsu, M.-F. Tsai, and H.-H. Chen. Query expansion with conceptnet and wordnet: An intrinsic comparison. In Proceedings of AIRS, pages 1--13, 2006.
[18]
M.-H. Hsu, M.-F. Tsai, and H.-H. Chen. Combining wordnet and conceptnet for automatic query expansion: A learning approach. In Proceedings of AIRS, pages 213--224, 2008.
[19]
A. Kotov and C. Zhai. Towards natural question-guided search. In Proceedings of WWW, pages 541--550, 2010.
[20]
A. Kotov and C. Zhai. Interactive sense feedback for difficult queries. In Proceedings of CIKM, pages 163--172, 2011.
[21]
Y. Li, W. P. R. Luk, K. S. E. Ho, and F. L. K. Chung. Improving weak ad-hoc queries using wikipedia as external corpus. In Proceedings of SIGIR, pages 797--798, 2007.
[22]
H. Liu and P. Singh. Conceptnet-a practical commonsense reasoning tool-kit. BT Technology Journal, 22(4):146--158.
[23]
S. Liu, F. Liu, C. Yu, and W. Meng. An effective approach to document retrieval via utilizing wordnet and recognizing phrases. In Proceedings of ACM SIGIR'04, pages 266--272, 2004.
[24]
R. Mandala, T. Tokunaga, and H. Tanaka. Combining multiple evidence from different types of thesaurus for query expansion. In Proceedings of ACM SIGIR'99, pages 191--197, 1999.
[25]
E. Meij, D. Trieschnigg, M. de Rijke, and W. Kraaij. Conceptual language models for domain-specific retrieval. Information Processing and Management, 2:25--45, 2000.
[26]
G. Salton and C. Buckley. On the use of spreading activation methods in automatic information retrieval. In Proceedings of SIGIR, pages 147--160, 1988.
[27]
C. Shah and W. B. Croft. Evaluating high accuracy retrieval techniques. In Proceedings of SIGIR, pages 2--9, 2004.
[28]
C. van Rijsbergen. Information Retrieval. Butterworths, London, UK, 1979.
[29]
E. M. Voorhees. Query expansion using lexical-semantic relations. In Proceedings of ACM SIGIR'93, pages 61--69, 1994.
[30]
J. Xu and W. B. Croft. Query expansion using local and global document analysis. In Proceedings of ACM SIGIR'96, pages 4--11, 1996.
[31]
J. Xu and W. B. Croft. Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems, 18:79--112, 2000.
[32]
Z. Yin, M. Shokouhi, and N. Craswell. Query expansion using external evidence. In Proceedings of ECIR, pages 362--374, 2009.
[33]
J. Yufeng and W. B. Croft. An association thesaurus for information retrieval. In Proceedings of RIAO, pages 146--160, 1994.
[34]
C. Zhai and J. Lafferty. Document language models, query models, and risk minimization for information retrieval. In Proceedings of SIGIR'01, pages 111--119, 2001.
[35]
C. Zhai and J. Lafferty. Model-based feedback in the language modeling approach to information retrieval. In Proceedings of CIKM'01, pages 403--410, 2001.

Cited By

View all
  • (2023)Entity-Based Relevance Feedback for Document RetrievalProceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3578337.3605128(177-187)Online publication date: 9-Aug-2023
  • (2023)SPRF: A semantic Pseudo-relevance Feedback enhancement for information retrieval via ConceptNetKnowledge-Based Systems10.1016/j.knosys.2023.110602274(110602)Online publication date: Aug-2023
  • (2022)Maps of Medical Reason: Applying Knowledge Graphs and Artificial Intelligence in Medical Education and PracticeBioinformational Philosophy and Postdigital Knowledge Ecologies10.1007/978-3-030-95006-4_8(133-159)Online publication date: 22-Apr-2022
  • Show More Cited By

Index Terms

  1. Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '12: Proceedings of the fifth ACM international conference on Web search and data mining
      February 2012
      792 pages
      ISBN:9781450307475
      DOI:10.1145/2124295
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 February 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. conceptnet
      2. knowledge bases
      3. query analysis
      4. query expansion

      Qualifiers

      • Research-article

      Conference

      Acceptance Rates

      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)18
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 05 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Entity-Based Relevance Feedback for Document RetrievalProceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3578337.3605128(177-187)Online publication date: 9-Aug-2023
      • (2023)SPRF: A semantic Pseudo-relevance Feedback enhancement for information retrieval via ConceptNetKnowledge-Based Systems10.1016/j.knosys.2023.110602274(110602)Online publication date: Aug-2023
      • (2022)Maps of Medical Reason: Applying Knowledge Graphs and Artificial Intelligence in Medical Education and PracticeBioinformational Philosophy and Postdigital Knowledge Ecologies10.1007/978-3-030-95006-4_8(133-159)Online publication date: 22-Apr-2022
      • (2021)Non-linear Relationship Learning Algorithm for Related Recommendations2021 International Conference on Big Data and Intelligent Decision Making (BDIDM)10.1109/BDIDM53834.2021.00019(58-64)Online publication date: Jul-2021
      • (2021)Leveraging Closed Patterns and Formal Concept Analysis for Enhanced Microblogs RetrievalComplex Data Analytics with Formal Concept Analysis10.1007/978-3-030-93278-7_7(151-166)Online publication date: 8-Dec-2021
      • (2020)Semantic Service Search in IT Crowdsourcing Platform: A Knowledge Graph-Based ApproachInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402040006930:06(765-783)Online publication date: 29-Jul-2020
      • (2020)A contemporary combined approach for query expansionMultimedia Tools and Applications10.1007/s11042-020-09172-281:24(35195-35221)Online publication date: 3-Jul-2020
      • (2020)Lexifield: a system for the automatic building of lexicons by semantic expansion of short word listsKnowledge and Information Systems10.1007/s10115-020-01451-6Online publication date: 20-Mar-2020
      • (2020)A Text Extraction-Based Smart Knowledge Graph Composition for Integrating Lessons Learned During the Microchip DesignIntelligent Systems and Applications10.1007/978-3-030-55187-2_43(594-610)Online publication date: 25-Aug-2020
      • (2019)Mini-Map: Mixed-Initiative Mind-Mapping via Contextual Query ExpansionAIAA Scitech 2019 Forum10.2514/6.2019-2347Online publication date: 6-Jan-2019
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media