skip to main content
10.1145/2809948.2809951acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Improving Dynamic Index Pruning via Linear Programming

Published:22 October 2015Publication History

ABSTRACT

Dynamic index pruning techniques are commonly used to speed up query processing in Web search engines. In this work, we propose a linear programming technique which can further improve the performance of the state-of-the-art dynamic index pruning techniques. The experiments we conducted demonstrate that the proposed technique achieves reduction in terms of the disk access, index decompression, and scoring costs compared to the well-known Max-Score technique.

References

  1. A. Broder, D. Carmel, M. Herscovici, A. Soffer, and J. Zien. Efficient query evaluation using a two-level retrieval process. In Proceedings of the 12th International Conference on Information and Knowledge Management (CIKM), pages 426--434, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. B. Cambazoglu, E. Varol, E. Kayaaslan, C. Aykanat, and R. Baeza-Yates. Query forwarding in geographically distributed search engines. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 90--97, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. K. Chakrabarti, S. Chaudhuri, and V. Ganti. Interval-based pruning for top-k processing over compressed lists. In Proceedings of the 2011 IEEE 27th International Conference on Data Engineering (ICDE), pages 709--720, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Dimopoulos, S. Nepomnyachiy, and T. Suel. Optimizing top-k document retrieval strategies for block-max indexes. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining (WSDM), pages 113--122, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Ding and T. Suel. Faster top-k document retrieval using block-max indexes. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 993--1002, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Fontoura, V. Josifovski, J. Liu, S. Venkatesan, X. Zhu, and J. Y. Zien. Evaluation strategies for top-k queries over memory-resident inverted indexes. Proceedings of the VLDB Endowment, 4(12):1213--1224, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Jonassen. Efficient Query Processing in Distributed Search Engines. PhD thesis, Norwegian University of Science and Technology, Department of Computer and Information Science, pages 133--137, 2013.Google ScholarGoogle Scholar
  8. S. Jonassen and S. E. Bratsberg. Efficient compressed inverted index skipping for disjunctive text-queries. In Proceedings of the 33rd European Conference on Information Retrieval (ECIR), pages 530--542, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Jonassen and S. E. Bratsberg. Intra-query concurrent pipelined processing for distributed full-text retrieval. In Proceedings of the 34th European Conference on Information Retrieval (ECIR), pages 413--425, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Jonassen and S. E. Bratsberg. Improving the performance of pipelined query processing with skipping - and its comparison to document-wise partitioning. World Wide Web, 17(5):949--967, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Macdonald, I. Ounis, and N. Tonellotto. Upper-bound approximations for dynamic pruning. ACM Transactions on Information Systems, 29(4):17:1--17:28, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Macdonald, N. Tonellotto, and I. Ounis. Learning to predict response times for online query scheduling. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 621--630, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. O. Rojas, V. Gil-Costa and M. Marin. Efficient Parallel Block-Max WAND Algorithm. In Proceedings of the 19th International Conference on Parallel Processing (Euro-Par), pages 394--405, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Shan, S. Ding, J. He, H. Yan, and X. Li. Optimized top-k processing with global page scores on block-max indexes. In Proceedings of the 5th International Conference on Web Search and Data Mining (WSDM), pages 423--432, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. Strohman, H. Turtle, and W. B. Croft. Optimization strategies for complex queries. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 219--225, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Turtle and J. Flood. Query evaluation: strategies and optimizations. Information Processing and Management, 31(6):831--850, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improving Dynamic Index Pruning via Linear Programming

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      LSDS-IR '15: Proceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval
      October 2015
      32 pages
      ISBN:9781450337816
      DOI:10.1145/2809948

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 22 October 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      LSDS-IR '15 Paper Acceptance Rate3of5submissions,60%Overall Acceptance Rate3of5submissions,60%

      Upcoming Conference

    • Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader