skip to main content
10.1145/2452376.2452433acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Efficient top-k query answering using cached views

Published:18 March 2013Publication History

ABSTRACT

Top-k query processing has recently received a significant amount of attention due to its wide application in information retrieval, multimedia search and recommendation generation. In this work, we consider the problem of how to efficiently answer a top-k query by using previously cached query results. While there has been some previous work on this problem, existing algorithms suffer from either limited scope or lack of scalability. In this paper, we propose two novel algorithms for handling this problem. The first algorithm LPTA+ provides significantly improved efficiency compared to the state-of-the-art LPTA algorithm [26] by reducing the number of expensive linear programming problems that need to be solved. The second algorithm we propose leverages a standard space partition-based index structure in order to avoid many of the drawbacks of LPTA-based algorithms, thereby further improving the efficiency of query processing. Through extensive experiments on various datasets, we demonstrate that our algorithms significantly outperform the state of the art.

References

  1. Auto123 consumer car ratings. http://www.auto123.com/en/car-reviews/consumer-ratings/.Google ScholarGoogle Scholar
  2. Flickr. http://www.flickr.com.Google ScholarGoogle Scholar
  3. IMDB. http://imdb.com.Google ScholarGoogle Scholar
  4. Memcached. http://memcached.org.Google ScholarGoogle Scholar
  5. Metacritics. http://www.metacritic.com.Google ScholarGoogle Scholar
  6. Metascore. http://www.metacritic.com/about-metascores.Google ScholarGoogle Scholar
  7. Nba basketball statistics. http://www.databasebasketball.com.Google ScholarGoogle Scholar
  8. Rottentomatoes. http://www.rottentomatoes.com.Google ScholarGoogle Scholar
  9. Twitter. http://twitter.com.Google ScholarGoogle Scholar
  10. U.s. news best cars. http://usnews.rankingsandreviews.com/cars-trucks/rankings/cars/.Google ScholarGoogle Scholar
  11. U.s. news best collage rankings. http://www.usnews.com/rankings.Google ScholarGoogle Scholar
  12. Wikipedia. http://www.wikipedia.org.Google ScholarGoogle Scholar
  13. World university rankings. http://www.timeshighereducation.co.uk/world-university-rankings/.Google ScholarGoogle Scholar
  14. Youtube. http://www.youtube.com.Google ScholarGoogle Scholar
  15. S. Abiteboul and O. M. Duschka. Complexity of answering queries using materialized views. In PODS, pages 254--263, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Akbarinia, E. Pacitti, and P. Valduriez. Best position algorithms for top-k queries. In VLDB, pages 495--506, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. G. J. Badros, A. Borning, and P. J. Stuckey. The Cassowary linear arithmetic constraint solving algorithm. ACM Trans. Comput.-Hum. Interact., 8(4):267--306, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. E. Baikousi and P. Vassiliadis. View usability and safety for the answering of top-k queries via materialized views. In DOLAP, pages 97--104, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. Böhm, S. Berchtold, and D. A. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Comput. Surv., 33(3):322--373, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. Börzsönyi, D. Kossmann, and K. Stocker. The skyline operator. In ICDE, pages 421--430, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. A. K. Chandra and P. M. Merlin. Optimal implementation of conjunctive queries in relational data bases. In STOC, pages 77--90, 1977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Y.-C. Chang, L. D. Bergman, V. Castelli, C.-S. Li, M.-L. Lo, and J. R. Smith. The onion technique: Indexing for linear optimization queries. In SIGMOD, pages 391--402, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H.-T. Chou and D. J. DeWitt. An evaluation of buffer management strategies for relational database systems. In VLDB, pages 127--141, 1985. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. G. Dantzig. Linear Programming and Extensions. Princeton University, 1998.Google ScholarGoogle Scholar
  25. G. Das, D. Gunopulos, N. Koudas, and N. Sarkas. Ad-hoc top-k query answering for data streams. In VLDB, pages 183--194, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. G. Das, D. Gunopulos, N. Koudas, and D. Tsirogiannis. Answering top-k queries using views. In VLDB, pages 451--462, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci., 66(4):614--656, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Y. Halevy. Answering queries using views: A survey. VLDB J., 10(4):270--294, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J.-S. Heo, J. Cho, and K.-Y. Whang. The hybrid-layer index: A synergic approach to answering top-k queries in arbitrary subspaces. In ICDE, pages 445--448, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  30. V. Hristidis, N. Koudas, and Y. Papakonstantinou. PREFER: A system for the efficient execution of multi-parametric ranked queries. In SIGMOD, pages 259--270, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. I. F. Ilyas, W. G. Aref, and A. K. Elmagarmid. Supporting top-k join queries in relational databases. In VLDB, pages 754--765, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. I. F. Ilyas, G. Beskales, and M. A. Soliman. A survey of top-k query processing techniques in relational database systems. ACM Comput. Surv., 40(4), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Y. E. Ioannidis. The history of histograms. In VLDB, pages 19--30, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. T. Johnson and D. Shasha. 2Q: A low overhead high performance buffer management replacement algorithm. In VLDB, pages 439--450, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. H. Kellerer, U. Pferschy, and D. Pisinger. Knapsack Problems. Springer, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  36. C. T. Kwok and D. S. Weld. Planning to gather information. In AAAI/IAAI, Vol. 1, pages 32--39, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. J. Lee, H. Cho, and S. won Hwang. Efficient dual-resolution layer indexing for top-k queries. In ICDE, pages 1084--1095, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. A. Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava. Answering queries using views. In PODS, pages 95--104, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. C. Li, K. C.-C. Chang, I. F. Ilyas, and S. Song. RankSQL: Query algebra and optimization for relational top-k queries. In SIGMOD, pages 131--142, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. S. J. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Pearson Education, third edition, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. N. H. Ryeng, A. Vlachou, C. Doulkeridis, and K. Nørvåg. Efficient distributed top-k query processing with caching. In DASFAA, pages 280--295, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. D. Theodoratos and T. K. Sellis. Data warehouse configuration. In VLDB, pages 126--135, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. P. Tsaparas, T. Palpanas, Y. Kotidis, N. Koudas, and D. Srivastava. Ranked join indices. In ICDE, pages 277--288, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  44. D. Xin, C. Chen, and J. Han. Towards robust indexing for ranked queries. In VLDB, pages 235--246, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. A. Yu, P. K. Agarwal, and J. Yang. Processing a large number of continuous preference top-k queries. In SIGMOD, pages 397--408, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient top-k query answering using cached views

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        EDBT '13: Proceedings of the 16th International Conference on Extending Database Technology
        March 2013
        793 pages
        ISBN:9781450315975
        DOI:10.1145/2452376

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 March 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate7of10submissions,70%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader