skip to main content
10.1145/3514221.3526182acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

T-LevelIndex: Towards Efficient Query Processing in Continuous Preference Space

Published: 11 June 2022 Publication History

Abstract

Top-k related queries in continuous preference space (e.g., k-shortlist preference query kSPR, uncertain top-k query UTK, output-size specified utility-based query ORU) have numerous applications but are expensive to process. Existing algorithms process each query via specialized optimizations, which are difficult to generalize. In this work, we propose a novel and general index structure T-LevelIndex, which can be used to process various queries in continuous preference space efficiently. We devise efficient approaches to build the T-LevelIndex by fully exploiting the properties of continuous preference space. We conduct extensive experimental studies on both real- and synthetic- benchmarks. The results show that (i) our proposed index building approaches have low costs in terms of both space and time, and (ii) T-LevelIndex significantly outperforms specialized solutions for processing a spectrum of queries in continuous preference space, and the speedup can be two to three orders of magnitude.

References

[1]
2022. Booking. https://www.booking.com/
[2]
2022. Hotel dataset. www.hotels-base.com/
[3]
2022. House dataset. www.ipums.org/
[4]
2022. NBA dataset. www.basketball-reference.com/
[5]
2022. -LevelIndex implementation. https://github.com/cupermanrose/tau-LevelIndex/
[6]
2022. TripAdvisor. https://www.tripadvisor.com/
[7]
Pankaj K Agarwal. 2017. Simplex range searching and its variants: A review. A Journey Through Discrete Mathematics (2017), 1--30.
[8]
Pankaj K Agarwal, Boris Aronov, Timothy M Chan, and Micha Sharir. 1998. On Levels in Arrangements of Lines, Segments, Planes, and Triangles. Discrete & Computational Geometry 19, 3 (1998), 315--331.
[9]
Pankaj K Agarwal, Mark De Berg, Jiri Matousek, and Otfried Schwarzkopf. 1998. Constructing levels in arrangements and higher order Voronoi diagrams. SIAM journal on computing 27, 3 (1998), 654--667.
[10]
Norbert Beckmann, Hans-Peter Kriegel, Ralf Schneider, and Bernhard Seeger. 1990. The R*-tree: An efficient and robust access method for points and rectangles. In SIGMOD. 322--331.
[11]
Stephan Borzsony, Donald Kossmann, and Konrad Stocker. 2001. The skyline operator. In ICDE. 421--430.
[12]
Yuan-Chi Chang, Lawrence Bergman, Vittorio Castelli, Chung-Sheng Li, Ming- Ling Lo, and John R Smith. 2000. The onion technique: Indexing for linear optimization queries. In SIGMOD. 391--402.
[13]
Bernard Chazelle and Franco P Preparata. 1986. Halfspace range search: an algorithmic application of k-sets. Discrete & Computational Geometry 1, 1 (1986), 83--93.
[14]
Paolo Ciaccia and Davide Martinenghi. 2017. Reconciling skyline and ranking queries. PVLDB 10, 11 (2017), 1454--1465.
[15]
Vida Dujmovi¯u and Stefan Langerman. 2011. A center transversal theorem for hyperplanes and applications to graph drawing. In SoCG. 117--124.
[16]
Herbert Edelsbrunner, Joseph O'Rourke, and Raimund Seidel. 1986. Constructing arrangements of lines and hyperplanes with applications. SIAM J. Comput. 15, 2 (1986), 341--363.
[17]
Yunjun Gao, Qing Liu, Gang Chen, Baihua Zheng, and Linlin Zhou. 2015. Answering why-not questions on reverse top-k queries. PVLDB 8, 7 (2015), 738--749.
[18]
Shen Ge, Nikos Mamoulis, David W Cheung, et al. 2012. Efficient all top-k computation-a unified solution for all top-k, reverse top-k and top-m influential queries. TKDE 25, 5 (2012), 1015--1027.
[19]
Antonin Guttman. 1984. R-trees: A dynamic index structure for spatial searching. In SIGMOD. 47--57.
[20]
Zhian He and Eric Lo. 2012. Answering why-not questions on top-k queries. TKDE 26, 6 (2012), 1300--1315.
[21]
Vagelis Hristidis, Nick Koudas, and Yannis Papakonstantinou. 2001. PREFER: A system for the efficient execution of multi-parametric ranked queries. SIGMOD 30, 2 (2001), 259--270.
[22]
Ihab F Ilyas, George Beskales, and Mohamed A Soliman. 2008. A survey of top-k query processing techniques in relational database systems. Comput. Surveys 40, 4 (2008), 1--58.
[23]
Haim Kaplan, Wolfgang Mulzer, Liam Roditty, Paul Seiferth, and Micha Sharir. 2020. Dynamic planar Voronoi diagrams for general distance functions and their algorithmic applications. Discrete & Computational Geometry 64, 3 (2020), 838--904.
[24]
Jinfei Liu, Li Xiong, Jian Pei, Jun Luo, and Haoyu Zhang. 2015. Finding pareto optimal groups: Group-based skyline. PVLDB 8, 13 (2015), 2086--2097.
[25]
Qing Liu, Yunjun Gao, Gang Chen, Baihua Zheng, and Linlin Zhou. 2016. Answering why-not and why questions on reverse top-k queries. The VLDB Journal 25, 6 (2016), 867--892.
[26]
Nikos Mamoulis, Man Lung Yiu, Kit Hung Cheng, and David W Cheung. 2007. Efficient top-k aggregation of ranked inputs. TODS 32, 3 (2007), 19.
[27]
Tamara Mchedlidze, Marcel Radermacher, and Ignaz Rutter. 2017. Aligned drawings of planar graphs. In International Symposium on Graph Drawing and Network Visualization. 3--16.
[28]
Kyriakos Mouratidis, Keming Li, and Bo Tang. 2021. Marrying Top-k with Skyline Queries: Relaxing the Preference Input while Producing Output of Controllable Size. In SIGMOD. 1317--1330.
[29]
Kyriakos Mouratidis and Hwee Hwa Pang. 2013. Computing immutable regions for subspace top-k queries. PVLDB 6, 2 (2013), 73--84.
[30]
Kyriakos Mouratidis and Bo Tang. 2018. Exact processing of uncertain top-k queries in multi-criteria settings. PVLDB 11, 8 (2018), 866--879.
[31]
Kyriakos Mouratidis, Jilian Zhang, and Hwee Hwa PANG. 2015. Maximum rank query. PVLDB 8, 12 (2015), 1554--1565.
[32]
Dimitris Papadias, Yufei Tao, Greg Fu, and Bernhard Seeger. 2005. Progressive skyline computation in database systems. TODS 30, 1 (2005), 41--82.
[33]
Peng Peng and Raymong Chi-Wing Wong. 2015. k-hit query: Top-k query with probabilistic utility function. In SIGMOD. 577--592.
[34]
Jörg-Rüdiger Sack and Jorge Urrutia. 1999. Handbook of computational geometry. Elsevier.
[35]
Mohamed A Soliman, Ihab F Ilyas, Davide Martinenghi, and Marco Tagliasacchi. 2011. Ranking with uncertain scoring functions: semantics and sensitivity measures. In SIGMOD. 805--816.
[36]
Bo Tang, Kyriakos Mouratidis, and Mingji Han. 2021. On m-Impact Regions and Standing Top-k Influence Problems. In SIGMOD. 1784--1796.
[37]
Bo Tang, Kyriakos Mouratidis, and Man Lung Yiu. 2017. Determining the impact regions of competing options in preference space. In SIGMOD. 805--820.
[38]
Bo Tang, Kyriakos Mouratidis, Man Lung Yiu, and Zhenyu Chen. 2019. Creating top ranking options in the continuous option and preference space. PVLDB 12, 10 (2019), 1181--1194.
[39]
Yufei Tao, Vagelis Hristidis, Dimitris Papadias, and Yannis Papakonstantinou. 2007. Branch-and-bound processing of ranked queries. Information Systems 32, 3 (2007), 424--445.
[40]
Yufei Tao, Xiaokui Xiao, and Jian Pei. 2007. Efficient skyline and top-k retrieval in subspaces. TKDE 19, 8 (2007), 1072--1088.
[41]
Akrivi Vlachou, Christos Doulkeridis, Yannis Kotidis, and Kjetil Nørvåg. 2010. Reverse top-k queries. In ICDE. 365--376.
[42]
Akrivi Vlachou, Christos Doulkeridis, Yannis Kotidis, and Kjetil Norvag. 2011. Monochromatic and bichromatic reverse top-k queries. TKDE 23, 8 (2011), 1215--1229.
[43]
Akrivi Vlachou, Christos Doulkeridis, Kjetil Nørvåg, and Yannis Kotidis. 2010. Identifying the most influential data objects with reverse top-k queries. PVLDB 3, 1--2 (2010), 364--372.
[44]
Akrivi Vlachou, Christos Doulkeridis, Kjetil Nørvåg, and Yannis Kotidis. 2013. Branch-and-bound algorithm for reverse top-k queries. In SIGMOD. 481--492.
[45]
Guolei Yang and Ying Cai. 2017. Querying Improvement Strategies. In EDBT. 294--305.
[46]
Jianye Yang, Ying Zhang, Wenjie Zhang, and Xuemin Lin. 2016. Influence based cost optimization on user preference. In ICDE. 709--720.
[47]
Jianye Yang, Ying Zhang,Wenjie Zhang, and Xuemin Lin. 2019. Cost optimization based on influence and user preference. Knowledge and Information Systems 61, 2 (2019), 695--732.
[48]
Albert Yu, Pankaj K Agarwal, and Jun Yang. 2012. Processing a large number of continuous preference top-k queries. In SIGMOD. 397--408.
[49]
Jilian Zhang, Kyriakos Mouratidis, and HweeHwa Pang. 2014. Global immutable region computation. In SIGMOD. 1151--1162.
[50]
Lei Zou and Lei Chen. 2010. Pareto-based dominant graph: An efficient indexing structure to answer top-k queries. TKDE 23, 5 (2010), 727--741.

Cited By

View all
  • (2024)Reverse Regret Query2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00314(4100-4112)Online publication date: 13-May-2024
  • (2024)Multiple Continuous Top-K Queries Over Data Stream2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00129(1575-1588)Online publication date: 13-May-2024

Index Terms

  1. T-LevelIndex: Towards Efficient Query Processing in Continuous Preference Space

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data
    June 2022
    2597 pages
    ISBN:9781450392495
    DOI:10.1145/3514221
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 June 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. k-level index
    2. preference space
    3. top-k query processing

    Qualifiers

    • Research-article

    Funding Sources

    • Guangdong Provincial Key Laboratory
    • General Research Funds

    Conference

    SIGMOD/PODS '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)56
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Reverse Regret Query2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00314(4100-4112)Online publication date: 13-May-2024
    • (2024)Multiple Continuous Top-K Queries Over Data Stream2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00129(1575-1588)Online publication date: 13-May-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media