skip to main content
10.1145/3323873.3325014acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
short-paper

A Genetic Programming Approach for Searching on Nearest Neighbors Graphs

Published: 05 June 2019 Publication History

Abstract

The use of nearest neighbors graphs has been leading to considerable gains over classic approaches (e.g., tree-indexing-based methods) in approximate nearest neighbor searches. Classical searches on these graphs are conduced in a greedy way by moving at each step to the neighbor (of current vertex) with the lowest distance to the query. In this work, we explore the combination of topological properties of graphs and the distance itself through a Genetic Programming framework to obtain a better indicator for selecting the next vertex in the search process. Our objective is to minimize the scan rate needed to reach the true nearest neighbors. Experimental results, conducted with three different graph-based methods over a large textual collection, show significant gains of the proposed approach over the classic search algorithm on graphs.

References

[1]
Juan F. Hernández Albarracín, Edemir Ferreira, Jeferson A. dos Santos, and Ricardo da S. Torres. 2017. Fusion of genetic-programming-based indices in hyperspectral image classification tasks. In Proceeding of the IEEE International Geoscience and Remote Sensing Symposium. 554--557.
[2]
Alexandr Andoni and Piotr Indyk. 2008. Near-optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions. Commun. ACM, Vol. 51, 1 (Jan. 2008), 117--122.
[3]
Fabien André, Anne-Marie Kermarrec, and Nicolas Le Scouarnec. 2017. Accelerated Nearest Neighbor Search with Quick ADC. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR'2017). ACM, New York, NY, USA, 159--166.
[4]
Martin Aumüller, Erik Bernhardsson, and Alexander Faithfull. 2017. ANN-Benchmarks: A Benchmarking Tool for Approximate Nearest Neighbor Algorithms. In Similarity Search and Applications, Christian Beecks, Felix Borutta, Peer Kröger, and Thomas Seidl (Eds.). Springer International Publishing, Cham, 34--49.
[5]
Artem Babenko and Victor Lempitsky. 2015. The Inverted Multi-Index. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 37, 6 (June 2015), 1247--1260.
[6]
Mayank Bawa, Tyson Condie, and Prasanna Ganesan. 2005. LSH Forest: Self-tuning Indexes for Similarity Search. In Proceedings of the 14th International Conference on World Wide Web (WWW'2005). 651--660.
[7]
Jon Louis Bentley. 1975. Multidimensional Binary Search Trees Used for Associative Searching. Commun. ACM, Vol. 18, 9 (Sept. 1975), 509--517.
[8]
Luciano da F. Costa, Francisco A Rodrigues, Gonzalo Travieso, and Paulino Ribeiro Villas Boas. 2007. Characterization of complex networks: A survey of measurements. Advances in physics, Vol. 56, 1 (2007), 167--242.
[9]
Ricardo da S. Torres, Alexandre X. Falcao, Marcos A. Goncalves, Joao P. Papa, Baoping Zhang, Weiguo Fan, and Edward A Fox. 2009. A genetic programming framework for content-based image retrieval. Pattern Recognition, Vol. 42, 2 (2009), 283--292.
[10]
Sanjoy Dasgupta and Yoav Freund. 2008. Random Projection Trees and Low Dimensional Manifolds. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing (STOC'2008). 537--546.
[11]
Moises G. de Carvalho, Alberto H. F. Laender, Marcos A. Goncalves, and Altigran S. da Silva. 2012. A Genetic Programming Approach to Record Deduplication. IEEE Transactions on Knowledge and Data Engineering, Vol. 24, 3 (March 2012), 399--412.
[12]
Wei Dong, Charikar Moses, and Kai Li. 2011. Efficient K-nearest Neighbor Graph Construction for Generic Similarity Measures. In Proceedings of the 20th International Conference on World Wide Web (WWW'2011). 577--586.
[13]
Tiezheng Ge, Kaiming He, Qifa Ke, and Jian Sun. 2013. Optimized Product Quantization for Approximate Nearest Neighbor Search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2946--2953.
[14]
Antonin Guttman. 1984. R-trees: A Dynamic Index Structure for Spatial Searching. In Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data (SIGMOD'1984). ACM, New York, NY, USA, 47--57.
[15]
Ben Harwood and Tom Drummond. 2016. FANNG: Fast Approximate Nearest Neighbour Graphs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5713--5722.
[16]
Piotr Indyk and Rajeev Motwani. 1998. Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality. In Proceedings of the 30th Annual ACM Symposium on Theory of Computing. 604--613.
[17]
Masajiro Iwasaki. 2016. Pruned Bi-directed K-nearest Neighbor Graph for Proximity Search. In Similarity Search and Applications. Springer International Publishing, Cham, 20--33.
[18]
Herve Jegou, Matthijs Douze, and Cordelia Schmid. 2011. Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 33, 1 (2011), 117--128.
[19]
Tianxu Ji, Xianglong Liu, Cheng Deng, Lei Huang, and Bo Lang. 2014. Query-Adaptive Hash Code Ranking for Fast Nearest Neighbor Search. In Proceedings of the 22nd ACM International Conference on Multimedia (MM'2014). ACM, New York, NY, USA, 1005--1008.
[20]
Yannis Kalantidis and Yannis Avrithis. 2014. Locally Optimized Product Quantization for Approximate Nearest Neighbor Search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2329--2336.
[21]
John R. Koza. 1992. Genetic Programming: on the programming of computers by means of natural selection. Vol. 1. MIT press.
[22]
An'isio Lacerda, Marco Cristo, Marcos André Gonccalves, Weiguo Fan, Nivio Ziviani, and Berthier Ribeiro-Neto. 2006. Learning to Advertise. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'2006). ACM, New York, NY, USA, 549--556.
[23]
Qin Lv, William Josephson, Zhe Wang, Moses Charikar, and Kai Li. 2007. Multi-probe LSH: Efficient Indexing for High-dimensional Similarity Search. In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'2007). 950--961.
[24]
Yury A. Malkov, Alexander Ponomarenko, Andrey Logvinov, and Vladimir Krylov. 2014. Approximate nearest neighbor algorithm based on navigable small world graphs. Information Systems, Vol. 45 (2014), 61--68.
[25]
Yury A. Malkov and D. A. Yashunin. 2016. Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs. arXiv preprint arXiv:1603.09320 (2016).
[26]
Marius Muja and David G. Lowe. 2009. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration. In Proceedings of the International Conference on Computer Vision Theory and Application. 331--340.
[27]
Dimitris Papadias. 2000. Hill Climbing Algorithms for Content-based Retrieval of Similar Configurations. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'2000). ACM, New York, NY, USA, 240--247.
[28]
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 1532--1543.
[29]
Chanop Silpa-Anan and Richard Hartley. 2008. Optimised KD-trees for fast image descriptor matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8.
[30]
Robert F Sproull. 1991. Refinements to nearest-neighbor searching in k-dimensional trees. Algorithmica, Vol. 6, 1--6 (1991), 579--589.
[31]
Min Wang, Wengang Zhou, Qi Tian, Junfu Pu, and Houqiang Li. 2017. Deep Supervised Quantization by Self-Organizing Map. In Proceedings of the 25th ACM International Conference on Multimedia (MM'2017). ACM, New York, NY, USA, 1707--1715.
[32]
Peter N. Yianilos. 1993. Data Structures and Algorithms for Nearest Neighbor Search in General Metric Spaces. In Proceedings of the 4th Annual ACM-SIAM Symposium on Discrete Algorithms. 311--321.

Cited By

View all
  • (2022)A genetic programming approach for searching on nearest neighbors graphsMultimedia Tools and Applications10.1007/s11042-022-12248-w81:16(23449-23472)Online publication date: 18-Mar-2022
  • (2021)A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor searchProceedings of the VLDB Endowment10.14778/3476249.347625514:11(1964-1978)Online publication date: 27-Oct-2021
  • (2021)Two-stage routing with optimized guided search and greedy algorithm on proximity graph▪Knowledge-Based Systems10.1016/j.knosys.2021.107305229:COnline publication date: 11-Oct-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '19: Proceedings of the 2019 on International Conference on Multimedia Retrieval
June 2019
427 pages
ISBN:9781450367653
DOI:10.1145/3323873
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. distance learning
  2. genetic programming
  3. graph topological features
  4. nearest neighbor search
  5. nearest neighbors graph

Qualifiers

  • Short-paper

Funding Sources

  • Fundação de Amparo à Pesquisa do Estado de São Paulo
  • Conselho Nacional de Desenvolvimento Científico e Tecnológico

Conference

ICMR '19
Sponsor:

Acceptance Rates

Overall Acceptance Rate 177 of 599 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)A genetic programming approach for searching on nearest neighbors graphsMultimedia Tools and Applications10.1007/s11042-022-12248-w81:16(23449-23472)Online publication date: 18-Mar-2022
  • (2021)A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor searchProceedings of the VLDB Endowment10.14778/3476249.347625514:11(1964-1978)Online publication date: 27-Oct-2021
  • (2021)Two-stage routing with optimized guided search and greedy algorithm on proximity graph▪Knowledge-Based Systems10.1016/j.knosys.2021.107305229:COnline publication date: 11-Oct-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media