skip to main content
10.1145/2582112.2582161acmotherconferencesArticle/Chapter ViewAbstractPublication PagessocgConference Proceedingsconference-collections
tutorial

Better ϵ-Dependencies for Offline Approximate Nearest Neighbor Search, Euclidean Minimum Spanning Trees, and ϵ-Kernels

Published: 08 June 2014 Publication History

Abstract

Recently, Arya, da Fonseca, and Mount [STOC 2011, SODA 2012] made notable progress in improving the ϵ-dependencies in the space/query-time tradeoffs for (1 + ϵ)-factor approximate nearest neighbor search in fixed-dimensional Euclidean spaces. However, ϵ-dependencies in the preprocessing time were not considered, and so their data structures cannot be used to derive faster algorithms for offline proximity problems. Known algorithms for many such problems, including approximate bichromatic closest pair (BCP) and approximate Euclidean minimum spanning trees (EMST), typically have factors near (1/ϵ)d/2±O(1) in the running time when the dimension d is a constant.
We describe a technique that breaks the (1/ϵ)d/2 barrier and yields new results for many well-known proximity problems, including:
• an O((1/ϵ)d/3+O(1) n)-time randomized algorithm for approximate BCP,
• an O((1/ϵ)d/3+O(1) n log n)-time algorithm for approximate EMST, and
• an O(n log n + (1/ϵ)d/3+O(1) n)-time algorithm to answer n approximate nearest neighbor queries on n points.
Using additional bit-packing tricks, we can shave off the log n factor for EMST, and even move most of the ϵ-factors to a sublinear term.
The improvement arises from a new time bound for exact "discrete Voronoi diagrams", which were previously used in the construction of ϵ-kernels (or extent-based coresets), a well-known tool for another class of fundamental problems. This connection leads to more results, including:
• a streaming algorithm to maintain an approximate diameter in O((1/ϵ)d/3+O(1)) time per point using O((1/ϵ)d/2+O(1)) space, and
• a streaming algorithm to maintain an ϵ-kernel in O((1/ϵ)d/4+O(1)) time per point using O((1/ϵ)d/2+O(1)) space.

References

[1]
P. K. Agarwal, H. Edelsbrunner, O. Schwarzkopf, and E. Welzl. Euclidean minimum spanning trees and bichromatic closest pairs. In Proc. Sixth Annu. Sympos. Comput. Geom., pages 203--210, 1990.
[2]
P. K. Agarwal, S. Har-Peled, and K. R. Varadarajan. Approximating extent measures of points. J. Assoc. Comput. Mach., 51(4):606--635, 2004.
[3]
P. K. Agarwal, J. Matousek, and S. Suri. Farthest neighbors, maximum spanning trees and related problems in higher dimensions. Comput. Geom. Theory Appl., 1:189--201, 1991.
[4]
A. Andoni, P. Indyk, H. L. Nguyen, and I. Razenshteyn. Beyond locality sensitive hashing. In Proc. 25th Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 1018--1028, 2014.
[5]
S. Arya, G. D. da Fonseca, and D. M. Mount. Approximate polytope membership queries. In Proc. 43rd Annu. ACM Sympos. Theory Comput., pages 579--586, 2011.
[6]
S. Arya, G. D. da Fonseca, and D. M. Mount. Polytope approximation and the Mahler volume. In Proc. 23rd Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 29--42, 2012.
[7]
S. Arya, T. Malamatos, and D. M. Mount. Space-time tradeoffs for approximate nearest neighbor searching. J. Assoc. Comput. Mach., 57:1--54, 2009.
[8]
S. Arya, D. M. Mount, N. Netanyahu, R. Silverman, and A. Y. Wu. An optimal algorithm for approximate nearest neighbor searching in fixed dimensions. J. Assoc. Comput. Mach., 45:891--923, 1998.
[9]
J. Augustine, D. Eppstein, and K. A. Wortman. Approximate weighted farthest neighbors and minimum dilation stars. Discrete Math., Algorithms and Applications, 2(4):553--566, 2010.
[10]
J. L. Bentley and J. B. Saxe. Decomposable searching problems I: Static-to-dynamic transformation. J. Algorithms, 1(4):301--358, 1980.
[11]
H. Breu, J. Gil, D. Kirkpatrick, and M. Werman. Linear time Euclidean distance transform algorithms. IEEE Trans. Pattern Analysis and Machine Intelligence, 17:529--533, 1995.
[12]
E. M. Bronshteyn and L. D. Ivanov. The approximation of convex sets by polyhedra. Siberian Math. J., 16:852--853, 1976.
[13]
P. B. Callahan and S. R. Kosaraju. Faster algorithms for some geometric graphs problems in higher dimensions. In Proc. Fourth Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 291--300, 1993.
[14]
P. B. Callahan and S. R. Kosaraju. A decomposition of multidimensional point sets with applications to k-nearest-neighbors and n-body potential fields. J. Assoc. Comput. Mach., 42:67--90, 1995.
[15]
T. M. Chan. Approximate nearest neighbor queries revisited. Discrete Comput. Geom., 20:359--373, 1998.
[16]
T. M. Chan. Geometric applications of a randomized optimization technique. Discrete Comput. Geom., 22(4):547--567, 1999.
[17]
T. M. Chan. Approximating the diameter, width, smallest enclosing cylinder, and minimum-width annulus. Internat. J. Comput. Geom. Appl., 12(1-2):67--85, 2002.
[18]
T. M. Chan. Faster core-set constructions and data-stream algorithms in fixed dimensions. Comput. Geom. Theory Appl., 35(1-2):20--35, 2006.
[19]
T. M. Chan. Well-separated pair decomposition in linear time? Inf. Process. Lett., 107(5):138--141, 2008.
[20]
K. L. Clarkson. An algorithm for approximate closest-point queries. In Proc. Tenth Annu. Sympos. Comput. Geom., pages 160--164, 1994.
[21]
A. Czumaj, F. Ergün, L. Fortnow, A. Magen, I. Newman, R. Rubinfeld, and C. Sohler. Approximating the weight of the Euclidean minimum spanning tree in sublinear time. SIAM J. Comput., 35(1):91--109, 2005.
[22]
A. Czumaj and C. Sohler. Estimating the weight of metric minimum spanning trees in sublinear time. SIAM J. Comput., 39(3):904--922, 2009.
[23]
R. M. Dudley. Metric entropy of some classes of sets with differentiable boundaries. J. Approx. Theory, 10:227--236, 1974.
[24]
A. Goel, P. Indyk, and K. R. Varadarajan. Reductions among high dimensional proximity problems. In Proc. 12th Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 769--778, 2001.
[25]
S. Har-Peled. Geometric Approximation Algorithms. AMS Press, 2011.
[26]
P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In Proc. 30th Annu. ACM Sympos. Theory Comput., pages 604--613, 1998.
[27]
S. Khuller and Y. Matias. A simple randomized sieve algorithm for the closest-pair problem. Inf. Comput., 118(1):34--37, 1995.
[28]
D. Krznaric, C. Levcopoulos, and B. J. Nilsson. Minimum spanning trees in d dimensions. Nord. J. Comput., 6(4):446--461, 1999.
[29]
M. H. Overmars. The Design of Dynamic Data Structures, volume 156 of Lecture Notes in Computer Science. Springer, 1983.
[30]
M. H. Overmars and J. van Leeuwen. Worst-case optimal insertion and deletion methods for decomposable searching problems. Inf. Process. Lett., 12(4):168--173, 1981.
[31]
P. M. Vaidya. Minimum spanning trees in k-dimensional space. SIAM J. Comput., 17(3):572--582, 1988.
[32]
H. Zarrabi-Zadeh. An almost space-optimal streaming algorithm for coresets in fixed dimensions. Algorithmica, 60(1):46--59, 2011.

Cited By

View all
  • (2024)NMNN: Newtonian Mechanics-based Natural Neighbor algorithmInformation Sciences10.1016/j.ins.2024.121200(121200)Online publication date: Jul-2024
  • (2023)Communication costs in a geometric communication networkTheoretical Computer Science10.1016/j.tcs.2023.114161977(114161)Online publication date: Oct-2023
  • (2022)Minimum Epsilon-Kernel Computation for Large-Scale Data ProcessingJournal of Computer Science and Technology10.1007/s11390-022-2429-637:6(1398-1411)Online publication date: 30-Nov-2022
  • Show More Cited By

Index Terms

  1. Better ϵ-Dependencies for Offline Approximate Nearest Neighbor Search, Euclidean Minimum Spanning Trees, and ϵ-Kernels

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SOCG'14: Proceedings of the thirtieth annual symposium on Computational geometry
    June 2014
    588 pages
    ISBN:9781450325943
    DOI:10.1145/2582112
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 June 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. ϵ-dependencies
    2. bichromatic closest pair
    3. coresets
    4. minimum spanning tree
    5. nearest neighbor

    Qualifiers

    • Tutorial
    • Research
    • Refereed limited

    Conference

    SOCG'14

    Acceptance Rates

    SOCG'14 Paper Acceptance Rate 60 of 175 submissions, 34%;
    Overall Acceptance Rate 625 of 1,685 submissions, 37%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 27 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)NMNN: Newtonian Mechanics-based Natural Neighbor algorithmInformation Sciences10.1016/j.ins.2024.121200(121200)Online publication date: Jul-2024
    • (2023)Communication costs in a geometric communication networkTheoretical Computer Science10.1016/j.tcs.2023.114161977(114161)Online publication date: Oct-2023
    • (2022)Minimum Epsilon-Kernel Computation for Large-Scale Data ProcessingJournal of Computer Science and Technology10.1007/s11390-022-2429-637:6(1398-1411)Online publication date: 30-Nov-2022
    • (2021)Communication Costs in a Geometric Communication NetworkProceedings of the 22nd International Conference on Distributed Computing and Networking10.1145/3427796.3427800(36-45)Online publication date: 5-Jan-2021
    • (2019)A simple approximation algorithm for the diameter of a set of points in an Euclidean planePLOS ONE10.1371/journal.pone.021120114:2(e0211201)Online publication date: 8-Feb-2019
    • (2018)Hardness of approximate nearest neighbor searchProceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3188745.3188916(1260-1268)Online publication date: 20-Jun-2018
    • (2018)Fast Euclidean OPTICS with Bounded Precision in Low Dimensional SpaceProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196922(1067-1082)Online publication date: 27-May-2018
    • (2018)Efficient Approximation Algorithm With Partition Technique For The Diameter Of A Set Of Points In 2D PlaneJournal of Physics: Conference Series10.1088/1742-6596/1087/2/0220351087(022035)Online publication date: 2-Oct-2018
    • (2017)Coresets-Methods and History: A Theoreticians Design Pattern for Approximation and Streaming AlgorithmsKI - Künstliche Intelligenz10.1007/s13218-017-0519-332:1(37-53)Online publication date: 19-Dec-2017
    • (2016)A fast and simple algorithm for computing approximate euclidean minimum spanning treesProceedings of the twenty-seventh annual ACM-SIAM symposium on Discrete algorithms10.5555/2884435.2884520(1220-1233)Online publication date: 10-Jan-2016

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media