Skip to main content

Improving the List of Clustered Permutation on Metric Spaces for Similarity Searching on Secondary Memory

  • Conference paper
  • First Online:
  • 747 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10880))

Abstract

The similarity search is a central problem to many applications, such as multimedia databases and repositories containing complex non-structured objects. The metric space model is very useful in these scenarios, because metric indexes support efficient similarity search but most of them are designed for main memory. In this article we introduce an improved version of the List of Clustered Permutations (iLCP), a competitive index for approximate similarity search. Our proposal is specially adapted for secondary memory and performs well in several scenarios, especially on spaces of medium and high dimensionality. We assessed this new structure with several real-life metric spaces from SISAP, the results show that this new version keeps the rewarding characteristics of LCP, while obtaining a very good performance in terms of number of pages read per search.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Amato, G., Savino, P.: Approximate similarity search in metric spaces using inverted files. In: 3rd International ICST Conference on Scalable Information Systems, INFOSCALE 2008, Vico Equense, Italy, 4–6 June 2008, p. 28 (2008). https://doi.org/10.4108/ICST.INFOSCALE2008.3486

  2. Chávez, E., Navarro, G.: Probabilistic proximity search: fighting the curse of dimensionality in metric spaces. Inf. Process. Lett. 85(1), 39–46 (2003)

    Article  MathSciNet  Google Scholar 

  3. Chávez, E., Navarro, G.: A compact space decomposition for effective metric indexing. Pattern Recogn. Lett. 26(9), 1363–1376 (2005)

    Article  Google Scholar 

  4. Chavez, E., Figueroa, K., Navarro, G.: Effective proximity retrieval by ordering permutations. IEEE Trans. Pattern Anal. Mach. Intell. 30(9), 1647–1658 (2008)

    Article  Google Scholar 

  5. Esuli, A.: Use of permutation prefixes for efficient and scalable approximate similarity search. Inf. Process. Manag. 48(5), 889–902 (2012). https://doi.org/10.1016/j.ipm.2010.11.011

    Article  Google Scholar 

  6. Figueroa, K., Paredes, R.: List of clustered permutations for proximity searching. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds.) SISAP 2013. LNCS, vol. 8199, pp. 50–58. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41062-8_6

    Chapter  Google Scholar 

  7. Figueroa, K., Navarro, G., Chávez, E.: Metric spaces library (2007). http://www.sisap.org/Metric_Space_Library.html

  8. Jin, S., Kim, O., Feng, W.: \({\rm M}^{\rm X}\)-tree: a double hierarchical metric index with overlap reduction. In: Murgante, B., Misra, S., Carlini, M., Torre, C.M., Nguyen, H.-Q., Taniar, D., Apduhan, B.O., Gervasi, O. (eds.) ICCSA 2013. LNCS, vol. 7975, pp. 574–589. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-39640-3_42

    Chapter  Google Scholar 

  9. Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, 2nd edn. Cambridge University Press, New York (2014)

    Book  Google Scholar 

  10. Lokoč, J., Moško, J., Čech, P., Skopal, T.: On indexing metric spaces using cut-regions. Inf. Syst. 43, 1–19 (2014). http://www.sciencedirect.com/science/article/pii/S0306437914000258

    Article  Google Scholar 

  11. Navarro, G., Reyes, N.: New dynamic metric indices for secondary memory. Inf. Syst. 59, 48–78 (2016)

    Article  Google Scholar 

  12. Roggero, P., Reyes, N., Figueroa, K., Paredes, R.: List of clustered permutations in secondary memory for proximity searching. J. Comput. Sci. Technol. 15(02), 107–113 (2015)

    Google Scholar 

  13. Samet, H.: Foundations of Multidimensional and Metric Data Structures, 1st edn. The Morgan Kaufman Series in Computer Graphics and Geometic Modeling, Morgan Kaufmann Publishers, University of Maryland at College Park (2006)

    Google Scholar 

  14. Uhlmann, J.: Satisfying general proximity/similarity queries with metric trees. Inf. Process. Lett. 40(4), 175–179 (1991)

    Article  Google Scholar 

  15. Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach. Advances in Database Systems, vol. 32. Springer, Heidelberg (2006). https://doi.org/10.1007/0-387-29151-2

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karina Figueroa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Figueroa, K., Reyes, N., Camarena-Ibarrola, A., Valero-Elizondo, L. (2018). Improving the List of Clustered Permutation on Metric Spaces for Similarity Searching on Secondary Memory. In: Martínez-Trinidad, J., Carrasco-Ochoa, J., Olvera-López, J., Sarkar, S. (eds) Pattern Recognition. MCPR 2018. Lecture Notes in Computer Science(), vol 10880. Springer, Cham. https://doi.org/10.1007/978-3-319-92198-3_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-92198-3_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-92197-6

  • Online ISBN: 978-3-319-92198-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics