Skip to main content

Improving Difficult Queries by Leveraging Clusters in Term Graph

  • Conference paper
  • First Online:
Information Retrieval Technology (AIRS 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9460))

Included in the following conference series:

Abstract

Term graphs, in which the nodes correspond to distinct lexical units (words or phrases) and the weighted edges represent semantic relatedness between those units, have been previously shown to be beneficial for ad-hoc IR. In this paper, we experimentally demonstrate that indiscriminate utilization of term graphs for query expansion limits their retrieval effectiveness. To address this deficiency, we propose to apply graph clustering to identify coherent structures in term graphs and utilize these structures to derive more precise query expansion language models. Experimental evaluation of the proposed methods using term association graphs derived from document collections and popular knowledge bases (ConceptNet and Wikipedia) on TREC datasets indicates that leveraging semantic structure in term graphs allows to improve the results of difficult queries through query expansion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://wiki.dbpedia.org/Downloads39.

  2. 2.

    http://conceptnet5.media.mit.edu/downloads/20130917/associations.txt.gz.

References

  1. Bai, J., Song, D., Bruza, P., Nie, J.-Y., Cao, G.: Query expansion using term relationships in language models for information retrieval. In: Proceedings of CIKM 2005, pp. 688–695 (2005)

    Google Scholar 

  2. Karimzadehgan, M., Zhai, C.: Estimation of statistical translation models based on mutual information for ad hoc information retrieval. In: Proceedings of SIGIR 2010, pp. 323–330 (2010)

    Google Scholar 

  3. Kotov, A., Zhai, C.: Interactive sense feedback for difficult queries. In: Proceedings of CIKM 2011, pp. 59–66 (2009)

    Google Scholar 

  4. Kotov, A., Zhai, C.: Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries. In: Proceedings of WSDM 2012, pp. 403–412 (2012)

    Google Scholar 

  5. Kurland, O.: The opposite of smoothing: a language model approach to ranking query-specific document clusters. In: Proceedings of SIGIR 2008, pp. 171–178 (2008)

    Google Scholar 

  6. Liu, H., Singh, P.: Conceptnet - a practical commonsense reasoning tool-kit. BT Technol. J. 22(4), 211–226 (2004)

    Article  MathSciNet  Google Scholar 

  7. Liu, X., Croft, B.: Cluster-based retrieval using language models. In: Proceedings of SIGIR 2004, pp. 186–193 (2004)

    Google Scholar 

  8. Manning, C., Schütze, H.: Foundations of statistical natural language processing. The MIT Press, Cambridge (1999)

    Google Scholar 

  9. Rosvall, M., Bergstrom, C.: Maps of information flow reveal community structure in complex network. PNAS 105, 1118–1123 (2008)

    Article  Google Scholar 

  10. Xu, Y., Jones, G., Wang, B.: Query dependent pseudo-relevance feedback based on Wikipedia. In: Proceedings of SIGIR 2009, pp. 59–66 (2009)

    Google Scholar 

  11. Zhai, C., Lafferty, J.: Document language models, query models, and risk minimization for information retrieval. In: Proceedings of SIGIR 2001, pp. 111–119 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alexander Kotov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Anand, R., Kotov, A. (2015). Improving Difficult Queries by Leveraging Clusters in Term Graph. In: Zuccon, G., Geva, S., Joho, H., Scholer, F., Sun, A., Zhang, P. (eds) Information Retrieval Technology. AIRS 2015. Lecture Notes in Computer Science(), vol 9460. Springer, Cham. https://doi.org/10.1007/978-3-319-28940-3_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-28940-3_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-28939-7

  • Online ISBN: 978-3-319-28940-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics