High-Dimensional Nearest Neighbor Search with Remote Data Centers

Wang, Changzhou; Wang, Xiaoyang Sean

doi:10.1007/s101150200015

High-Dimensional Nearest Neighbor Search with Remote Data Centers

Original Paper
Published: September 2002

Volume 4, pages 440–465, (2002)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Changzhou Wang¹ &
Xiaoyang Sean Wang²

63 Accesses
1 Citation
Explore all metrics

Abstract.

Many data centers have archived a tremendous amount of data and begun to publish them on the Web. Due to limited resources and large amount of service requests, data centers usually do not directly support high-cost queries. On the other hand, users are often overwhelmed by the huge data volume and cannot afford to download the whole data sets and search them locally. To support high-dimensional nearest neighbor searches in this environment, the paper develops a multi-level approximation scheme. The coarsest-level approximations are stored locally and searched first. The result is then refined gradually via accesses to remote data centers. Data centers need only to deliver data items or their precomputed finer level approximations by their identifiers.

The searching process is usually long in this environment, since it involves remote sites. This paper describes an online search process: the system periodically reports a data item and a positive integer M. The reported item is guaranteed to be one of the M nearest neighbors of the query one. The paper proposes two algorithms to minimize M in each period. Experiments show that one of them performs similarly as a theoretical a posteriori algorithm and significantly outperforms the online extensions of two state-of-the-art nearest neighbor search methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Approximate Furthest Neighbor in High Dimensions

Heuristics for Computing k-Nearest Neighbors Graphs

Approximate Nearest Neighbor Search Using Query-Directed Dense Graph

Author information

Authors and Affiliations

Mathematics and Computing Technology, The Boeing Company, Bellevue, WA, USA, , , , , , US
Changzhou Wang
Department of Information and Software Engineering, George Mason University, Fairfax, VA, USA, , , , , , US
Xiaoyang Sean Wang

Authors

Changzhou Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyang Sean Wang
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received 25 July 2000 / Revised 25 July 2001 / Accepted in revised form 16 October 2001

Correspondence and offprint requests to: Xiaoyang Sean Wang, Department of Information and Software Engineering, George Mason University, Fairfax, VA 22030, USA. Email: xywang@gmu.eduau

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, C., Wang, X. High-Dimensional Nearest Neighbor Search with Remote Data Centers . Knowl Inform Sys 4, 440–465 (2002). https://doi.org/10.1007/s101150200015

Download citation

Issue Date: September 2002
DOI: https://doi.org/10.1007/s101150200015

Keywords: High-dimensional data; Nearest neighbor search; Online algorithm

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

High-Dimensional Nearest Neighbor Search with Remote Data Centers

Abstract.

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Approximate Furthest Neighbor in High Dimensions

Heuristics for Computing k-Nearest Neighbors Graphs

Approximate Nearest Neighbor Search Using Query-Directed Dense Graph

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Subscribe and save

Buy Now

Navigation

High-Dimensional Nearest Neighbor Search with Remote Data Centers

Abstract.

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Approximate Furthest Neighbor in High Dimensions

Heuristics for Computing k-Nearest Neighbors Graphs

Approximate Nearest Neighbor Search Using Query-Directed Dense Graph

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now

Search

Navigation