Skip to main content
Log in

Locality sensitive hashing with bit selection

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Locality sensitive hashing (LSH), one of the most popular hashing techniques, has attracted considerable attention for nearest neighbor search in the field of image retrieval. It can achieve promising performance only if the number of the generated hash bits is large enough. However, more hash bits assembled to the binary codes contain massive redundant information and require more time cost and storage spaces. To alleviate this limitation, we propose a novel bit selection framework to pick important bits out of the hash bits generated by hashing techniques. Within the bit selection framework, we further exploit eleven evaluation criteria to measure the importance and similarity of each bit generated by LSH, so that the bits with high importance and less similarity are selected to assemble new binary codes. To demonstrate the effectiveness of the proposed framework of bit selection, we evaluated the proposed framework with the evaluation criteria on five commonly used data sets. Experimental results show the proposed bit selection framework works effectively in different cases, and the performance of LSH has not been degraded significantly after redundant hash bits reduced by the evaluation criteria.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Wang J, Zhang T, Song J, Sebe N, Shen HT (2018) A survey on learning to hash. IEEE Trans Pattern Anal Mach Intell 40(4):769–790

    Article  Google Scholar 

  2. Zhang B, Qian J (2021) Autoencoder-based unsupervised clustering and hashing. Appl Intell 51(1):493–505

    Article  Google Scholar 

  3. Liu H, Li E, Liu X, Su K, Zhang S (2021) Anomaly detection with kernel preserving embedding. ACM Transactions on Knowledge Discovery from Data 15(5):91:1–91:

    Article  Google Scholar 

  4. Zhang S, Li X, Zong M, Zhu X, Wang R (2018) Efficient knn classification with different numbers of nearest neighbors. IEEE Transactions on Neural Networks and Learning Systems 29(5):1774–1785

    Article  MathSciNet  Google Scholar 

  5. Quynh NH, Thuy QDT, Van CP, Van CN, Tao NQ (2018) An efficient image retrieval method using adaptive weights. Appl Intell 48(10):3807–3826

    Article  Google Scholar 

  6. Silpa-Anan C, Hartley RI (2008) Optimised kd-trees for fast image descriptor matching. In: Proc. of the 2008 IEEE conf computer vision and pattern recognition (CVPR08), pp 24–26

  7. Bentley JL (1975) Multidimensional binary search trees used for associative searching. Commun ACM 18(9):509–517

    Article  Google Scholar 

  8. Nielsen F, Piro P, Barlaud M (2009) Tailored bregman ball trees for effective nearest neighbors. In: Proc of IEEE european workshop on computational geometry, pp 29–32

  9. Liu H, Xu X, Li E, Li X, Zhang S (2021) Anomaly detection with representative neighbors. IEEE Transactions on Neural Networks and Learning Systems, 32(12). https://doi.org/10.1109/TNNLS.2021.3109898

  10. Liu H, Li X, Zhang S, Tian Q (2020) Adaptive hashing with sparse matrix factorization. IEEE Transactions on Neural Networks and Learning Systems 31(10):4318–4329

    Article  MathSciNet  Google Scholar 

  11. Cai D (2021) A revisit of hashing algorithms for approximate nearest neighbor search. IEEE Trans Knowl Data Eng 33(6):2337–2348

    Article  Google Scholar 

  12. Kulis B, Grauman K (2009) Kernelized locality-sensitive hashing for scalable image search. In: Proc of IEEE int conf computer vision (ICCV09), pp 2130–2137

  13. Raginsky M, Lazebnik S (2009) Locality-sensitive binary codes from shift-invariant kernels. In: Proc of the annual conf neural information processing systems (NIPS09), pp 1509–1517

  14. Chi L, Zhu X (2017) Hashing techniques: A survey and taxonomy. ACM Computting Surveys 50(1):1–36

    Google Scholar 

  15. Jégou H, Perronnin F, Douze M, Sánchez J, Pérez P, Schmid C (2012) Aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716

    Article  Google Scholar 

  16. Gong Y, Lazebnik S, Gordo A, Perronnin F (2013) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intell 35 (12):2916–2929

    Article  Google Scholar 

  17. Hoang T, Do TT, Nguyen TV, Cheung NM (2020) Unsupervised deep cross-modality spectral hashing. IEEE Trans Image Process 29:8391–8406

    Article  MathSciNet  Google Scholar 

  18. Liu W, Wang J, Kumar S, Chang SF (2011) Hashing with graphs. In: Proc of Int Conf Machine Learning (ICML2011), pp 1–8

  19. Zhu X, Zhang L, Huang Z (2014) A sparse embedding and least variance encoding approach to hashing. IEEE Trans Image Process 23(9):3737–3750

    Article  MathSciNet  Google Scholar 

  20. Irie G, Li Z, Wu XM, Chang SF (2014) Locally linear hashing for extracting non-linear manifolds. In: Proc of the 2014 IEEE conf computer vision and pattern recognition (CVPR2014), pp 2123–2130

  21. Pang QQ, Zhang L (2020) Fast backward iterative laplacian score for unsupervised feature selection. In: Proc of int conf knowledge science, engineering and management (KSEM2020), pp 409–420

  22. Xie X, Chen H, Qian J (2019) Twin maximum entropy discriminations for classification. Appl Intell 49(6):2391–2399

    Article  Google Scholar 

  23. Pan H, You X, Liu S, Zhang D (2021) Pearson correlation coefficient-based pheromone refactoring mechanism for multi-colony ant colony optimization. Appl Intell 51(2):752–774

    Article  Google Scholar 

  24. Liu H, Sun J, Liu L, Zhang H (2009) Feature selection with dynamic mutual information. Pattern Recogn 42(7):1330–1339

    Article  Google Scholar 

  25. Zhang D, Wang J, Cai D, Lu J (2010) Self-taught hashing for fast similarity search. In: Proc of the 33rd Int ACM SIGIR conf research and development in information, pp 18–25

  26. Zhu X, Huang Z, Cheng H, Cui J, Shen HT (2013) Sparse hashing for fast multimedia search. ACM Trans Inf Syst 31(2):1–24

    Article  Google Scholar 

  27. Liu X, He J, Chang SF (2017a) Hash bit selection for nearest neighbor search. IEEE Trans Image Process 26(11):5367–5380

    Article  MathSciNet  Google Scholar 

  28. Liu H, Liu L, Le TD, Lee I, Sun S, Li J (2017b) Nonparametric sparse matrix decomposition for cross-view dimensionality reduction. IEEE Transactions on Multimedia 19(8):1848– 1859

    Article  Google Scholar 

  29. Nie X, Jing W, Cui C, Zhang CJ, Zhu L, Yin Y (2020) Joint multi-view hashing for large-scale near-duplicate video retrieval. IEEE Trans Knowl Data Eng 32(10):1951–1965

    Article  Google Scholar 

  30. Shen HT, Liu L, Yang Y, Xu X, Huang Z, Shen F, Hong R (2021) Exploiting subspace relation in semantic labels for cross-modal hashing. IEEE Trans Knowl Data Eng 33(10):3351–3365

    Article  Google Scholar 

  31. Kong W, Li WJ, Guo M (2012) Manhattan hashing for large-scale image retrieval. In: Proc of the 35rd Int ACM SIGIR conf research and development in information, pp 45–54

  32. Liu X, Nie X, Zhou Q, Nie L, Yin Y (2020) Model optimization boosting framework for linear model hash learning. IEEE Trans Image Process 29:4254–4268

    Article  Google Scholar 

  33. Hasan MM, Srizon AY, Sayeed A, Hasan MAM (2021) High performance classification of caltech-101 with a transfer learned deep convolutional neural network. In: Proc of the 2021 IEEE int conf information and communication technology for sustainable development (ICICT4SD), pp 35–39

  34. Gui J, Liu T, Sun Z, Tao D, Tan T (2018) Fast supervised fiscrete hashing. IEEE Trans Pattern Anal Mach Intell 40(2):490– 496

    Article  Google Scholar 

  35. Yan C, Bai X, Wang S, Zhou J, Hancock ER (2019) Cross-modal hashing with semantic deep embedding. Neurocomputing 337:58–66

    Article  Google Scholar 

Download references

Acknowledgments

The authors would like to thank the anonymous referees and the editors for their valuable comments and suggestions, helping to improve the paper significantly. This work was partially supported by the national NSF of China (NSFC) (61976195) and the NSF of Zhejiang Province (LY18F020019).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Huawen Liu or Xin Chen.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Special Issue on Multi-view Learning

Guest Editors: Guoqing Chao, Xingquan Zhu, Weiping Ding, Jinbo Bi and Shiliang Sun

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, W., Liu, H., Lou, J. et al. Locality sensitive hashing with bit selection. Appl Intell 52, 14724–14738 (2022). https://doi.org/10.1007/s10489-022-03546-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-03546-9

Keywords

Navigation