tutorial

Better ϵ-Dependencies for Offline Approximate Nearest Neighbor Search, Euclidean Minimum Spanning Trees, and ϵ-Kernels

Authors:

Timothy M. ChanAuthors Info & Claims

SOCG'14: Proceedings of the thirtieth annual symposium on Computational geometry

Pages 416 - 425

https://doi.org/10.1145/2582112.2582161

Published: 08 June 2014 Publication History

Abstract

Recently, Arya, da Fonseca, and Mount [STOC 2011, SODA 2012] made notable progress in improving the ϵ-dependencies in the space/query-time tradeoffs for (1 + ϵ)-factor approximate nearest neighbor search in fixed-dimensional Euclidean spaces. However, ϵ-dependencies in the preprocessing time were not considered, and so their data structures cannot be used to derive faster algorithms for offline proximity problems. Known algorithms for many such problems, including approximate bichromatic closest pair (BCP) and approximate Euclidean minimum spanning trees (EMST), typically have factors near (1/ϵ)d/2±O(1) in the running time when the dimension d is a constant.

We describe a technique that breaks the (1/ϵ)d/2 barrier and yields new results for many well-known proximity problems, including:

• an O((1/ϵ)d/3+O(1) n)-time randomized algorithm for approximate BCP,

• an O((1/ϵ)d/3+O(1) n log n)-time algorithm for approximate EMST, and

• an O(n log n + (1/ϵ)d/3+O(1) n)-time algorithm to answer n approximate nearest neighbor queries on n points.

Using additional bit-packing tricks, we can shave off the log n factor for EMST, and even move most of the ϵ-factors to a sublinear term.

The improvement arises from a new time bound for exact "discrete Voronoi diagrams", which were previously used in the construction of ϵ-kernels (or extent-based coresets), a well-known tool for another class of fundamental problems. This connection leads to more results, including:

• a streaming algorithm to maintain an approximate diameter in O((1/ϵ)d/3+O(1)) time per point using O((1/ϵ)d/2+O(1)) space, and

• a streaming algorithm to maintain an ϵ-kernel in O((1/ϵ)d/4+O(1)) time per point using O((1/ϵ)d/2+O(1)) space.

References

[1]

P. K. Agarwal, H. Edelsbrunner, O. Schwarzkopf, and E. Welzl. Euclidean minimum spanning trees and bichromatic closest pairs. In Proc. Sixth Annu. Sympos. Comput. Geom., pages 203--210, 1990.

Digital Library

[2]

P. K. Agarwal, S. Har-Peled, and K. R. Varadarajan. Approximating extent measures of points. J. Assoc. Comput. Mach., 51(4):606--635, 2004.

Digital Library

[3]

P. K. Agarwal, J. Matousek, and S. Suri. Farthest neighbors, maximum spanning trees and related problems in higher dimensions. Comput. Geom. Theory Appl., 1:189--201, 1991.

Digital Library

[4]

A. Andoni, P. Indyk, H. L. Nguyen, and I. Razenshteyn. Beyond locality sensitive hashing. In Proc. 25th Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 1018--1028, 2014.

[5]

S. Arya, G. D. da Fonseca, and D. M. Mount. Approximate polytope membership queries. In Proc. 43rd Annu. ACM Sympos. Theory Comput., pages 579--586, 2011.

Digital Library

[6]

S. Arya, G. D. da Fonseca, and D. M. Mount. Polytope approximation and the Mahler volume. In Proc. 23rd Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 29--42, 2012.

Digital Library

[7]

S. Arya, T. Malamatos, and D. M. Mount. Space-time tradeoffs for approximate nearest neighbor searching. J. Assoc. Comput. Mach., 57:1--54, 2009.

Digital Library

[8]

S. Arya, D. M. Mount, N. Netanyahu, R. Silverman, and A. Y. Wu. An optimal algorithm for approximate nearest neighbor searching in fixed dimensions. J. Assoc. Comput. Mach., 45:891--923, 1998.

Digital Library

[9]

J. Augustine, D. Eppstein, and K. A. Wortman. Approximate weighted farthest neighbors and minimum dilation stars. Discrete Math., Algorithms and Applications, 2(4):553--566, 2010.

[10]

J. L. Bentley and J. B. Saxe. Decomposable searching problems I: Static-to-dynamic transformation. J. Algorithms, 1(4):301--358, 1980.

[11]

H. Breu, J. Gil, D. Kirkpatrick, and M. Werman. Linear time Euclidean distance transform algorithms. IEEE Trans. Pattern Analysis and Machine Intelligence, 17:529--533, 1995.

Digital Library

[12]

E. M. Bronshteyn and L. D. Ivanov. The approximation of convex sets by polyhedra. Siberian Math. J., 16:852--853, 1976.

[13]

P. B. Callahan and S. R. Kosaraju. Faster algorithms for some geometric graphs problems in higher dimensions. In Proc. Fourth Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 291--300, 1993.

Digital Library

[14]

P. B. Callahan and S. R. Kosaraju. A decomposition of multidimensional point sets with applications to k-nearest-neighbors and n-body potential fields. J. Assoc. Comput. Mach., 42:67--90, 1995.

Digital Library

[15]

T. M. Chan. Approximate nearest neighbor queries revisited. Discrete Comput. Geom., 20:359--373, 1998.

[16]

T. M. Chan. Geometric applications of a randomized optimization technique. Discrete Comput. Geom., 22(4):547--567, 1999.

[17]

T. M. Chan. Approximating the diameter, width, smallest enclosing cylinder, and minimum-width annulus. Internat. J. Comput. Geom. Appl., 12(1-2):67--85, 2002.

[18]

T. M. Chan. Faster core-set constructions and data-stream algorithms in fixed dimensions. Comput. Geom. Theory Appl., 35(1-2):20--35, 2006.

Digital Library

[19]

T. M. Chan. Well-separated pair decomposition in linear time? Inf. Process. Lett., 107(5):138--141, 2008.

Digital Library

[20]

K. L. Clarkson. An algorithm for approximate closest-point queries. In Proc. Tenth Annu. Sympos. Comput. Geom., pages 160--164, 1994.

Digital Library

[21]

A. Czumaj, F. Ergün, L. Fortnow, A. Magen, I. Newman, R. Rubinfeld, and C. Sohler. Approximating the weight of the Euclidean minimum spanning tree in sublinear time. SIAM J. Comput., 35(1):91--109, 2005.

Digital Library

[22]

A. Czumaj and C. Sohler. Estimating the weight of metric minimum spanning trees in sublinear time. SIAM J. Comput., 39(3):904--922, 2009.

Digital Library

[23]

R. M. Dudley. Metric entropy of some classes of sets with differentiable boundaries. J. Approx. Theory, 10:227--236, 1974.

[24]

A. Goel, P. Indyk, and K. R. Varadarajan. Reductions among high dimensional proximity problems. In Proc. 12th Annu. ACM-SIAM Sympos. Discrete Algorithms, pages 769--778, 2001.

Digital Library

[25]

S. Har-Peled. Geometric Approximation Algorithms. AMS Press, 2011.

[26]

P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In Proc. 30th Annu. ACM Sympos. Theory Comput., pages 604--613, 1998.

Digital Library

[27]

S. Khuller and Y. Matias. A simple randomized sieve algorithm for the closest-pair problem. Inf. Comput., 118(1):34--37, 1995.

Digital Library

[28]

D. Krznaric, C. Levcopoulos, and B. J. Nilsson. Minimum spanning trees in d dimensions. Nord. J. Comput., 6(4):446--461, 1999.

Digital Library

[29]

M. H. Overmars. The Design of Dynamic Data Structures, volume 156 of Lecture Notes in Computer Science. Springer, 1983.

Digital Library

[30]

M. H. Overmars and J. van Leeuwen. Worst-case optimal insertion and deletion methods for decomposable searching problems. Inf. Process. Lett., 12(4):168--173, 1981.

[31]

P. M. Vaidya. Minimum spanning trees in k-dimensional space. SIAM J. Comput., 17(3):572--582, 1988.

Digital Library

[32]

H. Zarrabi-Zadeh. An almost space-optimal streaming algorithm for coresets in fixed dimensions. Algorithmica, 60(1):46--59, 2011.

Digital Library

Cited By

Wang WYang LYang JZhang JTang DLiu T(2024)NMNN: Newtonian Mechanics-based Natural Neighbor algorithmInformation Sciences10.1016/j.ins.2024.121200(121200)Online publication date: Jul-2024
https://doi.org/10.1016/j.ins.2024.121200
Hajiaghaei Shanjani SKing V(2023)Communication costs in a geometric communication networkTheoretical Computer Science10.1016/j.tcs.2023.114161977(114161)Online publication date: Oct-2023
https://doi.org/10.1016/j.tcs.2023.114161
Guo HLi JGao H(2022)Minimum Epsilon-Kernel Computation for Large-Scale Data ProcessingJournal of Computer Science and Technology10.1007/s11390-022-2429-637:6(1398-1411)Online publication date: 30-Nov-2022
https://doi.org/10.1007/s11390-022-2429-6
Show More Cited By

Index Terms

Better ϵ-Dependencies for Offline Approximate Nearest Neighbor Search, Euclidean Minimum Spanning Trees, and ϵ-Kernels
1. Theory of computation
  1. Randomness, geometry and discrete structures
    1. Computational geometry

Recommendations

Euclidean minimum spanning trees and bichromatic closest pairs

We present an algorithm to compute a Euclidean minimum spanning tree of a given setS ofN points inEd in timeO(Fd(N,N) logdN), whereFd(n,m) is the time required to compute a bichromatic closest pair amongn red andm green points inEd. IfFd(N,N)=Ω(N1+ ), ...
Euclidean minimum spanning trees and bichromatic closest pairs

We present an algorithm to compute a Euclidean minimum spanning tree of a given setS ofN points inEd in timeO(Fd(N,N) logdN), whereFd(n,m) is the time required to compute a bichromatic closest pair amongn red andm green points inEd. IfFd(N,N)=Ω(N1+ ), ...
Randomized Approximate Nearest Neighbor Search with Limited Adaptivity
Special Issue on SPAA 2016

We study the complexity of parallel data structures for approximate nearest neighbor search in d-dimensional Hamming space {0,1}^d. A classic model for static data structures is the cell-probe model [27]. We consider a cell-probe model with limited ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

SOCG'14: Proceedings of the thirtieth annual symposium on Computational geometry

June 2014

588 pages

ISBN:9781450325943

DOI:10.1145/2582112

Program Chairs:
Siu-Wing Cheng
HKUST
,
Olivier Devillers
INRIA

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 June 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Tutorial
Research
Refereed limited

Conference

SOCG'14

SOCG'14: Annual Symposium on Computational Geometry

June 8 - 11, 2014

Kyoto, Japan

Acceptance Rates

SOCG'14 Paper Acceptance Rate 60 of 175 submissions, 34%;

Overall Acceptance Rate 625 of 1,685 submissions, 37%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

10
Total Citations
View Citations
153
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)1

Reflects downloads up to 27 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang WYang LYang JZhang JTang DLiu T(2024)NMNN: Newtonian Mechanics-based Natural Neighbor algorithmInformation Sciences10.1016/j.ins.2024.121200(121200)Online publication date: Jul-2024
https://doi.org/10.1016/j.ins.2024.121200
Hajiaghaei Shanjani SKing V(2023)Communication costs in a geometric communication networkTheoretical Computer Science10.1016/j.tcs.2023.114161977(114161)Online publication date: Oct-2023
https://doi.org/10.1016/j.tcs.2023.114161
Guo HLi JGao H(2022)Minimum Epsilon-Kernel Computation for Large-Scale Data ProcessingJournal of Computer Science and Technology10.1007/s11390-022-2429-637:6(1398-1411)Online publication date: 30-Nov-2022
https://doi.org/10.1007/s11390-022-2429-6
Hajiaghaei Shanjani SKing V(2021)Communication Costs in a Geometric Communication NetworkProceedings of the 22nd International Conference on Distributed Computing and Networking10.1145/3427796.3427800(36-45)Online publication date: 5-Jan-2021
https://dl.acm.org/doi/10.1145/3427796.3427800
Hong JWang ZNiu W(2019)A simple approximation algorithm for the diameter of a set of points in an Euclidean planePLOS ONE10.1371/journal.pone.021120114:2(e0211201)Online publication date: 8-Feb-2019
https://doi.org/10.1371/journal.pone.0211201
Rubinstein ADiakonikolas IKempe DHenzinger M(2018)Hardness of approximate nearest neighbor searchProceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3188745.3188916(1260-1268)Online publication date: 20-Jun-2018
https://dl.acm.org/doi/10.1145/3188745.3188916
Gan JTao YDas GJermaine CBernstein P(2018)Fast Euclidean OPTICS with Bounded Precision in Low Dimensional SpaceProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196922(1067-1082)Online publication date: 27-May-2018
https://dl.acm.org/doi/10.1145/3183713.3196922
Wang ZHong JNiu W(2018)Efficient Approximation Algorithm With Partition Technique For The Diameter Of A Set Of Points In 2D PlaneJournal of Physics: Conference Series10.1088/1742-6596/1087/2/0220351087(022035)Online publication date: 2-Oct-2018
https://doi.org/10.1088/1742-6596/1087/2/022035
Munteanu ASchwiegelshohn C(2017)Coresets-Methods and History: A Theoreticians Design Pattern for Approximation and Streaming AlgorithmsKI - Künstliche Intelligenz10.1007/s13218-017-0519-332:1(37-53)Online publication date: 19-Dec-2017
https://doi.org/10.1007/s13218-017-0519-3
Arya SMount D(2016)A fast and simple algorithm for computing approximate euclidean minimum spanning treesProceedings of the twenty-seventh annual ACM-SIAM symposium on Discrete algorithms10.5555/2884435.2884520(1220-1233)Online publication date: 10-Jan-2016
https://dl.acm.org/doi/10.5555/2884435.2884520

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten