skip to main content
10.1145/1989323.1989386acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Efficient and generic evaluation of ranked queries

Published: 12 June 2011 Publication History

Abstract

An important feature of the existing methods for ranked top-k processing is to avoid searching all the objects in the underlying dataset, and limiting the number of random accesses to the data. However, the performance of these methods degrades rapidly as the number of random accesses increases. In this paper, we propose a novel and general sequential access scheme for top-k query evaluation, which outperforms existing methods. We extend this scheme to efficiently answer top-k queries in subspace and on dynamic data. We also study the "dual" form of top-k queries called "ranking" queries, which returns the rank of a specified record/object, and propose an exact as well as two approximate solutions. An extensive empirical evaluation validates the robustness and efficiency of our techniques.

References

[1]
R. Akbarinia, E. Pacitti, and P. Valduriez. Best position algorithms for top-k queries. In VLDB, 2007.
[2]
N. Bansal, S. Guha, and N. Koudas. Ad-hoc aggregations of ranked lists in the presence of hierarchies. In SIGMOD, 2008.
[3]
H. Bast, D. Majumdar, R. Schenkel, M. Theobald, and G. Weikum. Io-top-k: Index-access optimized top-k query processing. In VLDB, 2006.
[4]
N. Bruno, L. Gravano, and A. Marian. Evaluating top-k queries over web-accessible databases. In ICDE, 2002.
[5]
N. Bruno and H. Wang. The threshold algorithm:from middleware systems to relational engine. IEEE Trans.Knowl.Data Eng., 19(4), 2007.
[6]
K. C. C. Chang and S. W. Hwang. Minimal probing: supporting expensive predicates for top-k queries. In SIGMOD, 2002.
[7]
Y. Chang, L. Bergman, V. Castelli, C.Li, M.Lo, and J.Smith. The onion technique:indexing for linear optimization queries. In SIGMOD, 2000.
[8]
G. Das, D. Gunopulos, N. Koudas, and D. Tsirogiannis. Answering top-k queries using views. In VLDB, 2006.
[9]
R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. In PODS, 2001.
[10]
R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci., 66(4), 2003.
[11]
U. Güntzer, W. T. Balke, and W. Kießling. Towards efficient multi-feature queries in heterogeneous environments. In ITCC, 2001.
[12]
P. Gurský and P. Vojtás. On top-ksearch with no random access using small memory. In ADBIS, 2008.
[13]
J. S. Heo, K. Y. Whang, M. S. Kim, Y. Kim, and I. Song. The partitioned-layer index:answering monotone top-k queries using the convex skyline and partitioning merging technique. Inf. Sci., 179(19), 2009.
[14]
V. Hristidis and Y. Papakonstantinou. Algorithms and applications for answering ranked queries using ranked views. VLDB J., 13(1), 2004.
[15]
S. W. Hwang and K. C. C. Chang. Optimizing top-k queries for middleware access: A unified cost-based approach. ACM Trans. Database Syst., 32(1), 2007.
[16]
I. F. Ilyas and W. G. Aref. Rank-aware query processing and optimization. In ICDE, 2005.
[17]
I. F. Ilyas, G. Beskales, and M. A. Soliman. A survey of top-query processing techniques in relational database systems. ACM Comput. Surv., 40(4), 2008.
[18]
I. F. Ilyas, R. Shah, W. G. Aref, J. S. Vitter, and A. K. Elmagarmid. Rank-aware query optimization. In SIGMOD, 2004.
[19]
C. Lang, Y. Chang, and J. Smith. Making the threshold algorithm access cost aware. IEEE Trans. Knowl.Data Eng., 16(10), 2004.
[20]
N. Mamoulis, K. H. Cheng, M. L. Yiu, and D. W. Cheung. Efficient aggregation of ranked inputs. In SIGMOD, 2006.
[21]
A. Marian, N. Bruno, and L. Gravano. Evaluating top-queries over web-accessible databases. ACM Trans. Database Syst., 29(2), 2004.
[22]
M. Mitzenmacher and E. Upfal. Probability and Computing: Randomized Algorithms and Probabilistic Analysis. Cambridge University Press, New York, NY, USA, 2005.
[23]
A. Natsev, Y. C. Chang, J. R. Smith, C. S. Li, and J. S. Vitter. Supporting incremental join queries on ranked inputs. In VLDB, 2001.
[24]
H. H. Pang, X. H. Ding, and B. H. Zheng. Efficient processing of exact top-queries over disk-resident sorted lists. VLDB J., 19(3), 2010.
[25]
Y. F. Tao, V. Hristidis, D. Papadias, and Y. Papakonstantinou. Branch-and-bound processing of ranked queries. Inf. Syst., 32(3), 2007.
[26]
Y. F. Tao, X. K. Xiao, and J. Pei. Efficient skyline and top-k retrieval in subspaces. IEEE Trans. Knowl. Data Eng., 19(8), 2007.
[27]
U. Üntzer, W. T. Balke, and W. Kießling. Optimizing multi-feature queries for image databases. In VLDB, 2000.
[28]
D. Xin, C. Chen, and J. W. Han. Towards robust indexing for ranked queries. In VLDB, 2006.
[29]
L. Zou and L. C. 0002. Dominant graph: An efficient indexing structure to answer top-k queries. In ICDE, 2008.

Cited By

View all
  • (2022)Efficient parallel processing of high-dimensional spatial kNN queriesSoft Computing10.1007/s00500-022-07081-026:22(12291-12316)Online publication date: 2-May-2022
  • (2020)Index-based, High-dimensional, Cosine Threshold Querying with Optimality GuaranteesTheory of Computing Systems10.1007/s00224-020-10009-6Online publication date: 26-Oct-2020
  • (2019)Multidimensional Preference Query Optimization on Infrastructure Monitoring Systems2019 IEEE International Conference on Big Data (Big Data)10.1109/BigData47090.2019.9005666(3727-3736)Online publication date: Dec-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
June 2011
1364 pages
ISBN:9781450306614
DOI:10.1145/1989323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tag

  1. ranked queries

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Efficient parallel processing of high-dimensional spatial kNN queriesSoft Computing10.1007/s00500-022-07081-026:22(12291-12316)Online publication date: 2-May-2022
  • (2020)Index-based, High-dimensional, Cosine Threshold Querying with Optimality GuaranteesTheory of Computing Systems10.1007/s00224-020-10009-6Online publication date: 26-Oct-2020
  • (2019)Multidimensional Preference Query Optimization on Infrastructure Monitoring Systems2019 IEEE International Conference on Big Data (Big Data)10.1109/BigData47090.2019.9005666(3727-3736)Online publication date: Dec-2019
  • (2019)Subspace top-k query processing using the hybrid-layer index with a tight boundData & Knowledge Engineering10.1016/j.datak.2012.07.00183(1-19)Online publication date: 1-Jan-2019
  • (2016)Evaluating Top-N queries in n-dimensional normed spacesInformation Sciences: an International Journal10.1016/j.ins.2016.09.035374:C(255-275)Online publication date: 20-Dec-2016
  • (2015)An Experimental Evaluation of Aggregation Algorithms for Processing Top-K Queries2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing10.1109/CIT/IUCC/DASC/PICOM.2015.47(326-333)Online publication date: Oct-2015
  • (2014)On the Complexity of Query Result DiversificationACM Transactions on Database Systems10.1145/260213639:2(1-46)Online publication date: 26-May-2014
  • (2014)Slicing the Dimensionality: Top-k Query Processing for High-Dimensional SpacesTransactions on Large-Scale Data- and Knowledge-Centered Systems XIV10.1007/978-3-662-45714-6_2(26-50)Online publication date: 21-Nov-2014
  • (2012)Optimal top-k generation of attribute combinations based on ranked listsProceedings of the 2012 ACM SIGMOD International Conference on Management of Data10.1145/2213836.2213883(409-420)Online publication date: 20-May-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media