skip to main content
10.1145/3159652.3159689acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval

Authors Info & Claims
Published:02 February 2018Publication History

ABSTRACT

Neural IR models, such as DRMM and PACRR, have achieved strong results by successfully capturing relevance matching signals. We argue that the context of these matching signals is also important. Intuitively, when extracting, modeling, and combining matching signals, one would like to consider the surrounding text(local context) as well as other signals from the same document that can contribute to the overall relevance score. In this work, we highlight three potential shortcomings caused by not considering context information and propose three neural ingredients to address them: a disambiguation component, cascade k-max pooling, and a shuffling combination layer. Incorporating these components into the PACRR model yields Co-PACER, a novel context-aware neural IR model. Extensive comparisons with established models on TREC Web Track data confirm that the proposed model can achieve superior search results. In addition, an ablation analysis is conducted to gain insights into the impact of and interactions between different components. We release our code to enable future comparisons.

References

  1. Omar Alonso and Stefano Mizzaro . 2012. Using crowdsourcing for TREC relevance assessment. Information Processing & Management Vol. 48, 6(2012), 1053--1066. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Olivier Chapelle, Donald Metlzer, Ya Zhang, and Pierre Grinspan . 2009. Expected reciprocal rank for graded relevance. In Proceedings of the 18th ACM conference on Information and knowledge management(CIKM '09). ACM, New York, NY, USA, 621--630. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Kevyn Collins-Thompson, Craig Macdonald, Paul Bennett, Fernando Diaz, and Ellen M Voorhees . 2015. TREC 2014 web track overview. Technical Report. DTIC Document.Google ScholarGoogle Scholar
  4. Nick Craswell, Onno Zoeter, Michael Taylor, and Bill Ramsey . 2008. An experimental comparison of click position-bias models Proceedings of the 2008 International Conference on Web Search and Data Mining. ACM, 87--94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Jaap Kamps, and W Bruce Croft . 2017. Neural Ranking Models with Weak Supervision. arXiv preprint arXiv:1704.08803(2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Ian Goodfellow, Yoshua Bengio, and Aaron Courville . 2016. Deep Learning. MIT Press. http://www.deeplearningbook.org Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jiafeng Guo, Yixing Fan, Qingyao Ai, and W Bruce Croft . 2016. A deep relevance matching model for ad-hoc retrieval Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 55--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen . 2014. Convolutional Neural Network Architectures for Matching Natural Language Sentences. Advances in Neural Information Processing Systems 27. 2042--2050. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck . 2013. Learning Deep Structured Semantic Models for Web Search Using Clickthrough Data Proceedings of the 22nd ACM International Conference on Information & Knowledge Management(CIKM '13). 2333--2338. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kai Hui, Andrew Yates, Klaus Berberich, and Gerard de Melo . 2017 a. A Position-Aware Deep Model for Relevance Matching in Information Retrieval EMNLP '17.Google ScholarGoogle Scholar
  11. Kai Hui, Andrew Yates, Klaus Berberich, and Gerard de Melo . 2017 b. Position-Aware Representations for Relevance Matching in Neural Information Retrieval Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 799--800. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Samuel Huston and W. Bruce Croft . 2014. A Comparison of Retrieval Models using Term Dependencies Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management(CIKM'14). 111--120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Tie-Yan Liu et almbox. . 2009. Learning to rank for information retrieval. Foundations and Trends® in Information Retrieval, Vol. 3, 3(2009), 225--331. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Donald Metzler and W Bruce Croft . 2005. A Markov random field model for term dependencies. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 472--479. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Bhaskar Mitra, Fernando Diaz, and Nick Craswell . 2017. Learning to Match Using Local and Distributed Representations of Text for Web Search Proceedings of WWW 2017. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Bhaskar Mitra, Eric Nalisnick, Nick Craswell, and Rich Caruana . 2016. A dual embedding space model for document ranking. arXiv preprint arXiv:1602.01137(2016).Google ScholarGoogle Scholar
  17. I. Ounis, G. Amati, V. Plachouras, B. He, C. Macdonald, and C. Lioma . 2006. Terrier: A High Performance and Scalable Information Retrieval Platform Proceedings of ACM SIGIR'06 Workshop on Open Source Information Retrieval(OSIR 2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng . 2016 a. A Study of MatchPyramid Models on Ad-hoc Retrieval. CoRR Vol. abs/1606.04648(2016). http://arxiv.org/abs/1606.04648Google ScholarGoogle Scholar
  19. Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Shengxian Wan, and Xueqi Cheng . 2016 b. Text Matching As Image Recognition. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence(AAAI'16). 2793--2799. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, and Grégoire Mesnil . 2014. Learning Semantic Representations Using Convolutional Neural Networks for Web Search Proceedings of the 23rd International Conference on World Wide Web (WWW '14 Companion). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556(2014).Google ScholarGoogle Scholar
  22. Tao Tao and ChengXiang Zhai . 2007. An exploration of proximity measures in information retrieval Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 295--302. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Chenyan Xiong, Zhuyun Dai, Jamie Callan, Zhiyuan Liu, and Russell Power . 2017. End-to-End Neural Ad-hoc Ranking with Kernel Pooling Proceedings of the 40th International ACM SIGIR Conference(SIGIR '17). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Hamed Zamani and W Bruce Croft . 2017. Relevance-based Word Embedding. arXiv preprint arXiv:1705.03556(2017). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining
        February 2018
        821 pages
        ISBN:9781450355810
        DOI:10.1145/3159652

        Copyright © 2018 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 2 February 2018

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        WSDM '18 Paper Acceptance Rate81of514submissions,16%Overall Acceptance Rate498of2,863submissions,17%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader