Abstract
Graphics processing units have definitely consolidated a position of privilege in the acceleration of general purpose applications. Search algorithms in large databases are a clear example of applications that benefit from computing platforms based on these devices. To obtain an efficient implementation of a given code using these platforms, it is very important to take into account their features. However, the characteristics of the application and certain overheads still introduced by these platforms, make it not always advisable to use these devices to obtain significant time reductions. In this paper, we show how different properties of current graphics processing units are exploited for improving a version of the general metric structure similarity search algorithm introduced by the authors, and compare it with a multithreaded version of the same algorithm using conventional processors. The analysis of the results provides us relevant data to determine the most appropriate computing platform.












Similar content being viewed by others
References
Micó ML, Oncina J, Vidal E (1994) A new version of the nearest-neighbour approximating and eliminating search algorithm (aesa) with linear preprocessing time and memory requirements. Pattern Recognit Lett 15(1):9–17
Baeza-Yates R, Cunto W, Manber U, Wu S (1994) Proximity matching using fixed-queries trees. In: 5th Combinatorial Pattern Matching (CPM’94), LNCS, vol 807. Springer, Berlin, pp 198–212
Chávez E, Marroquín JL, Baeza-Yates R (1999) Spaghettis: an array based algorithm for similarity queries in metric spaces. In: 6th International Symposium on String Processing and Information Retrieval (SPIRE’99). IEEE CS Press, pp 38–46
Chávez E, Marroquín JL, Navarro G (2001) Fixed queries array: a fast and economical data structure for proximity searching. Multimedia Tools Appl 14(2):113–135
Pedreira O, Brisaboa NR (2007) Spatial selection of sparse pivots for similarity search in metric spaces. In: 33rd Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM 2007), LNCS, vol 4362. Springer, Harrachov, pp 434–445
Chávez E, Navarro G, Baeza-Yates R, Marroquín JL (2001) Searching in metric spaces. ACM Comput Surveys 33(3):273–321
Hetland M (2009) The basic principles of metric indexing. In: Coello C, Dehuri S, Ghosh S (eds) Swarm intelligence for multi-objective problems in data mining, vol 242., Studies in Computational IntelligenceSpringer, Berlin, pp 199–232
Kuang Q, Zhao L (2009) A practical GPU based kNN algorithm. In: International Symposium on Computer Science and Computational Technology (ISCSCT), pp 151–155
Garcia V, Debreuve E, Barlaud M (2008) Fast k nearest neighbor search using GPU. In: Computer Vision and Pattern Recognition Workshop, pp 1–6
Barrientos RJ, Gómez JI, Tenllado C, Matias MP, Marin M (2013) Range query processing on single and multi GPU environments. Comput Electr Eng 39(8):2656–2668
Uribe-Paredes R, Arias E, Sánchez JL, Cazorla D, Valero-Lara P (2012) Improving the performance for the range search on metric spaces using a multi-GPU platform. In: Database and expert systems applications (DEXA), Lecture Notes in Computer Science, vol 7447. Springer, Berlin, pp 442–449
Uribe-Paredes R, Valero-Lara P, Arias E, Sanchez JL, Cazorla D (2011) Similarity search implementations for multi-core and many-core processors. In: International Conference on High Performance Computing and Simulation (HPCS), pp 656–663
Bolettieri P, Esuli A, Falchi F, Lucchese C, Perego R, Piccioli T, Rabitti F (2009) CoPhIR: a test collection for content-based image retrieval. CoRR. arXiv:0905.4627v2
Kruliš M, Skopal T, Lokoč J, Beecks C (2012) Combining cpu and gpu architectures for fast similarity search. Distrib Parallel Databases 30(3–4):179–207
Uribe-Paredes R, Cazorla D, Arias E, Sánchez JL (2014) Towards an efficient static scheduling scheme for delivering queries to heterogeneous clusters in the similarity search problem. J Supercomput 70(2):527–540
Wattsup.Net. https://www.wattsupmeters.com/secure/products.php?pn=0&wai=0&spec=3. Accessed 5 May 2016
Acknowledgments
This work has been partially supported by the MINECO under the Project CGL2013-48367-P, and by the JCCM and European Commission (FEDER funds) under the Project PEII-2014-028-P.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Uribe-Paredes, R., Arias, E., Cazorla, D. et al. An efficiency study of a pivot-based algorithm for similarity search on a heterogeneous platform. J Supercomput 73, 384–401 (2017). https://doi.org/10.1007/s11227-016-1868-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-016-1868-7