skip to main content
10.1145/1142473.1142531acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Efficient reverse k-nearest neighbor search in arbitrary metric spaces

Published: 27 June 2006 Publication History

Abstract

The reverse k-nearest neighbor (RkNN) problem, i.e. finding all objects in a data set the k-nearest neighbors of which include a specified query object, is a generalization of the reverse 1-nearest neighbor problem which has received increasing attention recently. Many industrial and scientific applications call for solutions of the RkNN problem in arbitrary metric spaces where the data objects are not Euclidean and only a metric distance function is given for specifying object similarity. Usually, these applications need a solution for the generalized problem where the value of k is not known in advance and may change from query to query. However, existing approaches, except one, are designed for the specific R1NN problem. In addition - to the best of our knowledge - all previously proposed methods, especially the one for generalized RkNN search, are only applicable to Euclidean vector data but not for general metric objects. In this paper, we propose the first approach for efficient RkNN search in arbitrary metric spaces where the value of k is specified at query time. Our approach uses the advantages of existing metric index structures but proposes to use conservative and progressive distance approximations in order to filter out true drops and true hits. In particular, we approximate the k-nearest neighbor distance for each data object by upper and lower bounds using two functions of only two parameters each. Thus, our method does not generate any considerable storage overhead. We show in a broad experimental evaluation on real-world data the scalability and the usability of our novel approach.

References

[1]
{1} A. M. Andrew. Another efficient algorithm for convex hulls in two dimensions. Information Processing Letters, 9, 1979.
[2]
{2} N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R*-Tree: An efficient and robust access method for points and rectangles. In Proc. SIGMOD, 1990.
[3]
{3} S. Berchtold, D. A. Keim, and H.-P. Kriegel. The X-Tree: An index structure for high-dimensional data. In Proc. VLDB, 1996.
[4]
{4} P. Ciaccia, M. Patella, and P. Zezula. M-Tree: an efficient access method for similarity search in metric spaces. In Proc. VLDB, 1997.
[5]
{5} C. Ding and H. Peng. Minimum redundancy feature selection from microarray gene expression data. In CSB03, 2003.
[6]
{6} A. Guttman. R-Trees: A dynamic index structure for spatial searching. In Proc. SIGMOD, 1984.
[7]
{7} F. Korn and S. Muthukrishnan. Influenced sets based on reverse nearest neighbor queries. In Proc. SIGMOD, 2000.
[8]
{8} M. Schroeder. Fractals, Chaos, Power Laws: Minutes from an infinite paradise. W.H. Freeman and company, New York, 1991.
[9]
{9} A. Singh, H. Ferhatosmanoglu, and A. S. Tosun. High dimensional reverse nearest neighbor queries. In Proc. CIKM, 2003.
[10]
{10} I. Stanoi, D. Agrawal, and A. E. Abbadi. Reverse nearest neighbor queries for dynamic databases. In Proc. DMKD, 2000.
[11]
{11} Y. Tao, D. Papadias, and X. Lian. Reverse kNN search in arbitrary dimensionality. In Proc. VLDB, 2004.
[12]
{12} C. Yang and K.-I. Lin. An index structure for efficient reverse nearest neighbor queries. In Proc. ICDE, 2001.

Cited By

View all
  • (2024)Towards Effective, Robust and Utility-preserving Watermarking of GPS TrajectoriesACM Transactions on Spatial Algorithms and Systems10.1145/370155811:1(1-25)Online publication date: 25-Oct-2024
  • (2024)Efficient Reverse $k$ Approximate Nearest Neighbor Search Over High-Dimensional Vectors2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00325(4262-4274)Online publication date: 13-May-2024
  • (2023)NanBDOSKnowledge-Based Systems10.1016/j.knosys.2023.110665274:COnline publication date: 15-Aug-2023
  • Show More Cited By

Index Terms

  1. Efficient reverse k-nearest neighbor search in arbitrary metric spaces

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data
    June 2006
    830 pages
    ISBN:1595934340
    DOI:10.1145/1142473
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 June 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. metric index structure
    2. reverse nearest neighbor search

    Qualifiers

    • Article

    Conference

    SIGMOD/PODS06
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards Effective, Robust and Utility-preserving Watermarking of GPS TrajectoriesACM Transactions on Spatial Algorithms and Systems10.1145/370155811:1(1-25)Online publication date: 25-Oct-2024
    • (2024)Efficient Reverse $k$ Approximate Nearest Neighbor Search Over High-Dimensional Vectors2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00325(4262-4274)Online publication date: 13-May-2024
    • (2023)NanBDOSKnowledge-Based Systems10.1016/j.knosys.2023.110665274:COnline publication date: 15-Aug-2023
    • (2023)Solving imbalanced learning with outlier detection and features reductionMachine Learning10.1007/s10994-023-06448-0113:8(5273-5330)Online publication date: 7-Dec-2023
    • (2023)Quantifying the competitiveness of a dataset in relation to general preferencesThe VLDB Journal10.1007/s00778-023-00804-133:1(231-250)Online publication date: 8-Aug-2023
    • (2022)Reverse spatial top-k keyword queriesThe VLDB Journal10.1007/s00778-022-00759-932:3(501-524)Online publication date: 25-Jul-2022
    • (2022)Mining Top-K Competitors by Eliminating the K-Least Items from Unstructured DatasetInnovations in Electronics and Communication Engineering10.1007/978-981-16-8512-5_54(505-514)Online publication date: 13-Mar-2022
    • (2021)Optimal school site selection in Urban areas using deep neural networksJournal of Ambient Intelligence and Humanized Computing10.1007/s12652-021-02903-913:1(313-327)Online publication date: 6-Feb-2021
    • (2021)Computing reverse nearest neighbourhood on road mapsWorld Wide Web10.1007/s11280-021-00969-1Online publication date: 23-Nov-2021
    • (2021)A Cost Model for Reverse Nearest Neighbor Query Processing on R-Trees Using Self PruningSimilarity Search and Applications10.1007/978-3-030-89657-7_4(45-53)Online publication date: 22-Oct-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media