research-article

Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval

Authors:
Kai Hui

Max Planck Institute for Informatics & SAP SE, Saarbruecken, Germany

Max Planck Institute for Informatics & SAP SE, Saarbruecken, Germany
View Profile

,
Andrew Yates

Max Planck Institute for Informatics, Saarbruecken, Germany

Max Planck Institute for Informatics, Saarbruecken, Germany
View Profile

,
Klaus Berberich

Max Planck Institute for Informatics & htw saar, Saarbruecken, Germany

Max Planck Institute for Informatics & htw saar, Saarbruecken, Germany
View Profile

,
Gerard de Melo

Rutgers University-New Brunswick, New Brunswick, NJ, USA

Rutgers University-New Brunswick, New Brunswick, NJ, USA
View Profile

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data MiningFebruary 2018Pages 279–287https://doi.org/10.1145/3159652.3159689

Published:02 February 2018Publication History

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

Pages 279–287

ABSTRACT

Neural IR models, such as DRMM and PACRR, have achieved strong results by successfully capturing relevance matching signals. We argue that the context of these matching signals is also important. Intuitively, when extracting, modeling, and combining matching signals, one would like to consider the surrounding text(local context) as well as other signals from the same document that can contribute to the overall relevance score. In this work, we highlight three potential shortcomings caused by not considering context information and propose three neural ingredients to address them: a disambiguation component, cascade k-max pooling, and a shuffling combination layer. Incorporating these components into the PACRR model yields Co-PACER, a novel context-aware neural IR model. Extensive comparisons with established models on TREC Web Track data confirm that the proposed model can achieve superior search results. In addition, an ablation analysis is conducted to gain insights into the impact of and interactions between different components. We release our code to enable future comparisons.

References

Omar Alonso and Stefano Mizzaro . 2012. Using crowdsourcing for TREC relevance assessment. Information Processing & Management Vol. 48, 6(2012), 1053--1066. Google ScholarDigital Library
Olivier Chapelle, Donald Metlzer, Ya Zhang, and Pierre Grinspan . 2009. Expected reciprocal rank for graded relevance. In Proceedings of the 18th ACM conference on Information and knowledge management(CIKM '09). ACM, New York, NY, USA, 621--630. Google ScholarDigital Library
Kevyn Collins-Thompson, Craig Macdonald, Paul Bennett, Fernando Diaz, and Ellen M Voorhees . 2015. TREC 2014 web track overview. Technical Report. DTIC Document.Google Scholar
Nick Craswell, Onno Zoeter, Michael Taylor, and Bill Ramsey . 2008. An experimental comparison of click position-bias models Proceedings of the 2008 International Conference on Web Search and Data Mining. ACM, 87--94. Google ScholarDigital Library
Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Jaap Kamps, and W Bruce Croft . 2017. Neural Ranking Models with Weak Supervision. arXiv preprint arXiv:1704.08803(2017). Google ScholarDigital Library
Ian Goodfellow, Yoshua Bengio, and Aaron Courville . 2016. Deep Learning. MIT Press. http://www.deeplearningbook.org Google ScholarDigital Library
Jiafeng Guo, Yixing Fan, Qingyao Ai, and W Bruce Croft . 2016. A deep relevance matching model for ad-hoc retrieval Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 55--64. Google ScholarDigital Library
Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen . 2014. Convolutional Neural Network Architectures for Matching Natural Language Sentences. Advances in Neural Information Processing Systems 27. 2042--2050. Google ScholarDigital Library
Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck . 2013. Learning Deep Structured Semantic Models for Web Search Using Clickthrough Data Proceedings of the 22nd ACM International Conference on Information & Knowledge Management(CIKM '13). 2333--2338. Google ScholarDigital Library
Kai Hui, Andrew Yates, Klaus Berberich, and Gerard de Melo . 2017 a. A Position-Aware Deep Model for Relevance Matching in Information Retrieval EMNLP '17.Google Scholar
Kai Hui, Andrew Yates, Klaus Berberich, and Gerard de Melo . 2017 b. Position-Aware Representations for Relevance Matching in Neural Information Retrieval Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 799--800. Google ScholarDigital Library
Samuel Huston and W. Bruce Croft . 2014. A Comparison of Retrieval Models using Term Dependencies Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management(CIKM'14). 111--120. Google ScholarDigital Library
Tie-Yan Liu et almbox. . 2009. Learning to rank for information retrieval. Foundations and Trends® in Information Retrieval, Vol. 3, 3(2009), 225--331. Google ScholarDigital Library
Donald Metzler and W Bruce Croft . 2005. A Markov random field model for term dependencies. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 472--479. Google ScholarDigital Library
Bhaskar Mitra, Fernando Diaz, and Nick Craswell . 2017. Learning to Match Using Local and Distributed Representations of Text for Web Search Proceedings of WWW 2017. ACM. Google ScholarDigital Library
Bhaskar Mitra, Eric Nalisnick, Nick Craswell, and Rich Caruana . 2016. A dual embedding space model for document ranking. arXiv preprint arXiv:1602.01137(2016).Google Scholar
I. Ounis, G. Amati, V. Plachouras, B. He, C. Macdonald, and C. Lioma . 2006. Terrier: A High Performance and Scalable Information Retrieval Platform Proceedings of ACM SIGIR'06 Workshop on Open Source Information Retrieval(OSIR 2006). Google ScholarDigital Library
Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng . 2016 a. A Study of MatchPyramid Models on Ad-hoc Retrieval. CoRR Vol. abs/1606.04648(2016). http://arxiv.org/abs/1606.04648Google Scholar
Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Shengxian Wan, and Xueqi Cheng . 2016 b. Text Matching As Image Recognition. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence(AAAI'16). 2793--2799. Google ScholarDigital Library
Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, and Grégoire Mesnil . 2014. Learning Semantic Representations Using Convolutional Neural Networks for Web Search Proceedings of the 23rd International Conference on World Wide Web (WWW '14 Companion). Google ScholarDigital Library
Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556(2014).Google Scholar
Tao Tao and ChengXiang Zhai . 2007. An exploration of proximity measures in information retrieval Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 295--302. Google ScholarDigital Library
Chenyan Xiong, Zhuyun Dai, Jamie Callan, Zhiyuan Liu, and Russell Power . 2017. End-to-End Neural Ad-hoc Ranking with Kernel Pooling Proceedings of the 40th International ACM SIGIR Conference(SIGIR '17). ACM. Google ScholarDigital Library
Hamed Zamani and W Bruce Croft . 2017. Relevance-based Word Embedding. arXiv preprint arXiv:1705.03556(2017). Google ScholarDigital Library

Index Terms

Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Learning to rank

Recommendations

Exploration of query context for information retrieval
WWW '07: Proceedings of the 16th international conference on World Wide Web

A number of existing information retrieval systems propose the notion of query context to combine the knowledge of query and user into retrieval to reveal the most exact description of user's information needs. In this paper we interpret query context ...
Read More
A context-dependent relevance model

Numerous past studies have demonstrated the effectiveness of the relevance modelRM for information retrieval IR. This approach enables relevance or pseudo-relevance feedback to be incorporated within the language modeling framework of IR. In the ...
Read More
Improvement of vector space information retrieval model based on supervised learning
IRAL '00: Proceedings of the fifth international workshop on on Information retrieval with Asian languages

This paper proposes and method to improve retrieval performance of the vector space model (VSM) by utilizing user-supplied information of those documents that are relevant to the query in question. In addition to the user's relevance feedback ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining
February 2018
821 pages
ISBN:9781450355810
DOI:10.1145/3159652
General Chairs:
Yi Chang
Jilin University, Huawei Inc.
,
Chengxiang Zhai
University of Illinois Urbana-Champaign
,
Program Chairs:
Yan Liu
University of Southern California
,
Yoelle Maarek
Amazon
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 February 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
WSDM '18 Paper Acceptance Rate81of514submissions,16%Overall Acceptance Rate498of2,863submissions,17%
More
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 46
  Total Citations
  View Citations
- 349
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Exploration of query context for information retrieval

A context-dependent relevance model

Improvement of vector space information retrieval model based on supervised learning