An efficiency study of a pivot-based algorithm for similarity search on a heterogeneous platform

Uribe-Paredes, R.; Arias, E.; Cazorla, D.; Sánchez, J. L.

doi:10.1007/s11227-016-1868-7

An efficiency study of a pivot-based algorithm for similarity search on a heterogeneous platform

Published: 13 September 2016

Volume 73, pages 384–401, (2017)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

R. Uribe-Paredes¹,
E. Arias²,
D. Cazorla² &
…
J. L. Sánchez ORCID: orcid.org/0000-0002-3498-9174²

232 Accesses
Explore all metrics

Abstract

Graphics processing units have definitely consolidated a position of privilege in the acceleration of general purpose applications. Search algorithms in large databases are a clear example of applications that benefit from computing platforms based on these devices. To obtain an efficient implementation of a given code using these platforms, it is very important to take into account their features. However, the characteristics of the application and certain overheads still introduced by these platforms, make it not always advisable to use these devices to obtain significant time reductions. In this paper, we show how different properties of current graphics processing units are exploited for improving a version of the general metric structure similarity search algorithm introduced by the authors, and compare it with a multithreaded version of the same algorithm using conventional processors. The analysis of the results provides us relevant data to determine the most appropriate computing platform.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving Parallel Processing of Matrix-Based Similarity Measures on Modern GPUs

A survey of graph processing on graphics processing units

Article 09 January 2018

Accelerating Set Similarity Joins Using GPUs

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Micó ML, Oncina J, Vidal E (1994) A new version of the nearest-neighbour approximating and eliminating search algorithm (aesa) with linear preprocessing time and memory requirements. Pattern Recognit Lett 15(1):9–17
Article Google Scholar
Baeza-Yates R, Cunto W, Manber U, Wu S (1994) Proximity matching using fixed-queries trees. In: 5th Combinatorial Pattern Matching (CPM’94), LNCS, vol 807. Springer, Berlin, pp 198–212
Chávez E, Marroquín JL, Baeza-Yates R (1999) Spaghettis: an array based algorithm for similarity queries in metric spaces. In: 6th International Symposium on String Processing and Information Retrieval (SPIRE’99). IEEE CS Press, pp 38–46
Chávez E, Marroquín JL, Navarro G (2001) Fixed queries array: a fast and economical data structure for proximity searching. Multimedia Tools Appl 14(2):113–135
Article MATH Google Scholar
Pedreira O, Brisaboa NR (2007) Spatial selection of sparse pivots for similarity search in metric spaces. In: 33rd Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM 2007), LNCS, vol 4362. Springer, Harrachov, pp 434–445
Chávez E, Navarro G, Baeza-Yates R, Marroquín JL (2001) Searching in metric spaces. ACM Comput Surveys 33(3):273–321
Article Google Scholar
Hetland M (2009) The basic principles of metric indexing. In: Coello C, Dehuri S, Ghosh S (eds) Swarm intelligence for multi-objective problems in data mining, vol 242., Studies in Computational IntelligenceSpringer, Berlin, pp 199–232
Chapter Google Scholar
Kuang Q, Zhao L (2009) A practical GPU based kNN algorithm. In: International Symposium on Computer Science and Computational Technology (ISCSCT), pp 151–155
Garcia V, Debreuve E, Barlaud M (2008) Fast k nearest neighbor search using GPU. In: Computer Vision and Pattern Recognition Workshop, pp 1–6
Barrientos RJ, Gómez JI, Tenllado C, Matias MP, Marin M (2013) Range query processing on single and multi GPU environments. Comput Electr Eng 39(8):2656–2668
Article Google Scholar
Uribe-Paredes R, Arias E, Sánchez JL, Cazorla D, Valero-Lara P (2012) Improving the performance for the range search on metric spaces using a multi-GPU platform. In: Database and expert systems applications (DEXA), Lecture Notes in Computer Science, vol 7447. Springer, Berlin, pp 442–449
Uribe-Paredes R, Valero-Lara P, Arias E, Sanchez JL, Cazorla D (2011) Similarity search implementations for multi-core and many-core processors. In: International Conference on High Performance Computing and Simulation (HPCS), pp 656–663
Bolettieri P, Esuli A, Falchi F, Lucchese C, Perego R, Piccioli T, Rabitti F (2009) CoPhIR: a test collection for content-based image retrieval. CoRR. arXiv:0905.4627v2
Kruliš M, Skopal T, Lokoč J, Beecks C (2012) Combining cpu and gpu architectures for fast similarity search. Distrib Parallel Databases 30(3–4):179–207
Article Google Scholar
Uribe-Paredes R, Cazorla D, Arias E, Sánchez JL (2014) Towards an efficient static scheduling scheme for delivering queries to heterogeneous clusters in the similarity search problem. J Supercomput 70(2):527–540
Article Google Scholar
Wattsup.Net. https://www.wattsupmeters.com/secure/products.php?pn=0&wai=0&spec=3. Accessed 5 May 2016

Download references

Acknowledgments

This work has been partially supported by the MINECO under the Project CGL2013-48367-P, and by the JCCM and European Commission (FEDER funds) under the Project PEII-2014-028-P.

Author information

Authors and Affiliations

Computer Engineering Department, University of Magallanes, Ave. Bulnes, 01855, Punta Arenas, Chile
R. Uribe-Paredes
Computing Systems Department, University of Castilla-La Mancha, Campus Universitario s/n, Albacete, Spain
E. Arias, D. Cazorla & J. L. Sánchez

Authors

R. Uribe-Paredes
View author publications
You can also search for this author inPubMed Google Scholar
E. Arias
View author publications
You can also search for this author inPubMed Google Scholar
D. Cazorla
View author publications
You can also search for this author inPubMed Google Scholar
J. L. Sánchez
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to J. L. Sánchez.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Uribe-Paredes, R., Arias, E., Cazorla, D. et al. An efficiency study of a pivot-based algorithm for similarity search on a heterogeneous platform. J Supercomput 73, 384–401 (2017). https://doi.org/10.1007/s11227-016-1868-7

Download citation

Published: 13 September 2016
Issue Date: January 2017
DOI: https://doi.org/10.1007/s11227-016-1868-7

Keywords

Profiles

E. Arias View author profile

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An efficiency study of a pivot-based algorithm for similarity search on a heterogeneous platform

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Improving Parallel Processing of Matrix-Based Similarity Measures on Modern GPUs

A survey of graph processing on graphics processing units

Accelerating Set Similarity Joins Using GPUs

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Profiles

Subscribe and save

Buy Now