Abstract
The similarity search is a central problem to many applications, such as multimedia databases and repositories containing complex non-structured objects. The metric space model is very useful in these scenarios, because metric indexes support efficient similarity search but most of them are designed for main memory. In this article we introduce an improved version of the List of Clustered Permutations (iLCP), a competitive index for approximate similarity search. Our proposal is specially adapted for secondary memory and performs well in several scenarios, especially on spaces of medium and high dimensionality. We assessed this new structure with several real-life metric spaces from SISAP, the results show that this new version keeps the rewarding characteristics of LCP, while obtaining a very good performance in terms of number of pages read per search.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Amato, G., Savino, P.: Approximate similarity search in metric spaces using inverted files. In: 3rd International ICST Conference on Scalable Information Systems, INFOSCALE 2008, Vico Equense, Italy, 4–6 June 2008, p. 28 (2008). https://doi.org/10.4108/ICST.INFOSCALE2008.3486
Chávez, E., Navarro, G.: Probabilistic proximity search: fighting the curse of dimensionality in metric spaces. Inf. Process. Lett. 85(1), 39–46 (2003)
Chávez, E., Navarro, G.: A compact space decomposition for effective metric indexing. Pattern Recogn. Lett. 26(9), 1363–1376 (2005)
Chavez, E., Figueroa, K., Navarro, G.: Effective proximity retrieval by ordering permutations. IEEE Trans. Pattern Anal. Mach. Intell. 30(9), 1647–1658 (2008)
Esuli, A.: Use of permutation prefixes for efficient and scalable approximate similarity search. Inf. Process. Manag. 48(5), 889–902 (2012). https://doi.org/10.1016/j.ipm.2010.11.011
Figueroa, K., Paredes, R.: List of clustered permutations for proximity searching. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds.) SISAP 2013. LNCS, vol. 8199, pp. 50–58. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41062-8_6
Figueroa, K., Navarro, G., Chávez, E.: Metric spaces library (2007). http://www.sisap.org/Metric_Space_Library.html
Jin, S., Kim, O., Feng, W.: \({\rm M}^{\rm X}\)-tree: a double hierarchical metric index with overlap reduction. In: Murgante, B., Misra, S., Carlini, M., Torre, C.M., Nguyen, H.-Q., Taniar, D., Apduhan, B.O., Gervasi, O. (eds.) ICCSA 2013. LNCS, vol. 7975, pp. 574–589. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39640-3_42
Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, 2nd edn. Cambridge University Press, New York (2014)
Lokoč, J., Moško, J., Čech, P., Skopal, T.: On indexing metric spaces using cut-regions. Inf. Syst. 43, 1–19 (2014). http://www.sciencedirect.com/science/article/pii/S0306437914000258
Navarro, G., Reyes, N.: New dynamic metric indices for secondary memory. Inf. Syst. 59, 48–78 (2016)
Roggero, P., Reyes, N., Figueroa, K., Paredes, R.: List of clustered permutations in secondary memory for proximity searching. J. Comput. Sci. Technol. 15(02), 107–113 (2015)
Samet, H.: Foundations of Multidimensional and Metric Data Structures, 1st edn. The Morgan Kaufman Series in Computer Graphics and Geometic Modeling, Morgan Kaufmann Publishers, University of Maryland at College Park (2006)
Uhlmann, J.: Satisfying general proximity/similarity queries with metric trees. Inf. Process. Lett. 40(4), 175–179 (1991)
Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach. Advances in Database Systems, vol. 32. Springer, Heidelberg (2006). https://doi.org/10.1007/0-387-29151-2
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Figueroa, K., Reyes, N., Camarena-Ibarrola, A., Valero-Elizondo, L. (2018). Improving the List of Clustered Permutation on Metric Spaces for Similarity Searching on Secondary Memory. In: Martínez-Trinidad, J., Carrasco-Ochoa, J., Olvera-López, J., Sarkar, S. (eds) Pattern Recognition. MCPR 2018. Lecture Notes in Computer Science(), vol 10880. Springer, Cham. https://doi.org/10.1007/978-3-319-92198-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-92198-3_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92197-6
Online ISBN: 978-3-319-92198-3
eBook Packages: Computer ScienceComputer Science (R0)