Skip to main content
Log in

Unsupervised deep triplet hashing with pseudo triplets for scalable image retrieval

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Deep learning based hashing methods have been proven to be effective in the field of image retrieval recently. Among them, most high-performance methods are supervised frameworks, which require annotated labels by humans. Considering the difficulty of labeling large-scale image datasets, unsupervised methods, which just need images themselves for training, are more suitable for practical applications. However, how to improve the discriminative ability of hash codes generated by unsupervised models still remains as a challenging problem. In this paper, we present a novel deep framework called Unsupervised Deep Triplet Hashing (UDTH) for scalable image retrieval. UDTH builds pseudo triplets based on the neighborhood structure in the high-dimensional visual feature space, and then solves two problems through the proposed objective function: 1) Triplet network is utilized to maximize the distance between different classes of binary representation; 2) Autoencoder and Binary quantization are exploited to learn hash codes which maintain the structural information of original samples. Extensive experiments on the datasets of CIFAR-10, NUS-WIDE and MIRFLICKR-25K are conducted, and the results show that our proposed UDTH is superior to the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. In: Advances in neural information processing systems, pp 153–160

  2. Cao Z, Long M, Wang J, Yu PS (2017) Hashnet: Deep learning to hash by continuation. In: Proceedings of the IEEE International Conference on Computer Vision, pp 5608–5617

  3. Chua T, Tang J, Hong R, Li H, Luo Z, Zheng Y (2009) Nus-wide: a real-world web image database from national university of singapore. In: ACM Multimedia, pp 48–56

  4. Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the twentieth annual symposium on Computational geometry, pp 253–262

  5. Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recogn 48(10):2993–3003

    Article  Google Scholar 

  6. Do TT, Doan AD, Cheung NM (2016) Learning to hash with binary deep neural network. In: European Conference on Computer Vision, pp 219–234

  7. Do TT, Tan DKL, Hoang T, Cheung NM (2016) Learning to hash with binary deep neural network. In: European Conference on Computer Vision, pp 219–234

  8. Erin Liong V, Lu J, Wang G, Moulin P, Zhou J (2015) Deep hashing for compact binary codes learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2475–2483

  9. Gong Y, Lazebnik S, Gordo A, Perronnin F (2013) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intell 35(12):2916–2929

    Article  Google Scholar 

  10. Gong M, Zhao J, Liu J, Miao Q, Jiao L (2016) Change detection in synthetic aperture radar images based on deep neural networks. IEEE Trans Neural Netw Learn Syst 27(1):125–138

    Article  MathSciNet  Google Scholar 

  11. Heo JP, Lee Y, He J, Chang S, Yoon SE (2012) Spherical hashing. In: 2012 IEEE conference on Computer vision and pattern recognition (CVPR), pp 2957–2964

  12. Hoffer E, Ailon N (2015) Deep metric learning using triplet network. In: International workshop on similarity-based pattern recognition, pp 84–92

  13. Huang S, Xiong Y, Zhang Y, Wang J (2017) Unsupervised triplet hashing for fast image retrieval, pp 84–92

  14. Huang S, Xiong Y, Zhang Y, Wang J (2017) Unsupervised triplet hashing for fast image retrieval. In: Proceedings of the on Thematic Workshops of ACM Multimedia 2017, pp 84–92

  15. Huiskes MJ, Thomee B, Lew MS (2010) New trends and ideas in visual concept detection: the mir flickr retrieval evaluation initiative. In: Proceedings of the international conference on Multimedia information retrieval, pp 527–536

  16. Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images. Technical report, University of Toronto

  17. Li Q, Sun Z, He R, Tan T (2017) Deep supervised discrete hashing. In: Advances in neural information processing systems, pp 2482–2491

  18. Lin K, Lu J, Chen C, Zhou J (2016) Learning compact binary descriptors with unsupervised deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1183–1192

  19. Liu W, Wang J, Ji R, Jiang YG, Chang S (2012) Supervised hashing with kernels. In: 2012 IEEE conference on Computer vision and pattern recognition (CVPR), pp 2074–2081

  20. Liu W, Wang J, Ji R, Jiang YG, Chang S (2012) Supervised hashing with kernels. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2074–2081

  21. Liu W, Kumar S, Kumar S, Chang S (2014) Discrete graph hashing. In: Advances in neural information processing systems

  22. Liu H, Wang R, Shan S, Chen X (2017) Learning multifunctional binary codes for both category and attribute oriented retrieval tasks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3901–3910

  23. Maaten Lvd, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(Nov):2579–2605

    MATH  Google Scholar 

  24. Shen F, Shen C, Liu W, Shen H (2015) Supervised discrete hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 37–45

  25. Shen F, Zhou X, Yang Y, Song J, Shen HT, Tao D (2016) A fast optimization method for general binary code learning. IEEE Trans Image Process 25 (12):5610–5621

    Article  MathSciNet  Google Scholar 

  26. Shen F, Yang Y, Liu L, Liu W, Tao D, Shen HT (2017) Asymmetric binary coding for image search. IEEE Trans Multimed 19(9):2022–2032

    Article  Google Scholar 

  27. Shen F, Xu Y, Liu L, Yang Y, Huang Z, Shen HT (2018) Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE Trans Pattern Anal Mach Intell 40(12):3034–3044

    Article  Google Scholar 

  28. Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations

  29. Song J, Gao L, Yan Y, Zhang D, Sebe N (2015) Supervised hashing with pseudo labels for scalable multimedia retrieval. In: ACM Multimedia, pp 827–830

  30. Song J, He T, Fan H, Gao L (2017) Deep discrete hashing with self-supervised pairwise labels. In: Joint european conference on machine learning and knowledge discovery in databases, pp 223–238

  31. Song J, Gao L, Liu L, Zhu X, Sebe N (2018) Quantization-based hashing: a general framework for scalable image and video retrieval. Pattern Recogn 75:175–187

    Article  Google Scholar 

  32. Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on Machine learning, pp 1096–1103

  33. Wang J, Kumar S, Chang S (2012) Semi-supervised hashing for large-scale search. IEEE Trans Pattern Anal Mach Intell 34(12):2393–2406

    Article  Google Scholar 

  34. Weiss Y, Torralba A, Fergus R (2009) Spectral hashing. In: Advances in neural information processing systems, pp 1753–1760

  35. Yan H, Ye Q, Zhang TA, Yu D, Yuan X, Xu Y, Fu L (2018) Least squares twin bounded support vector machines based on l1-norm distance metric for classification. Pattern Recogn 74:434–447

    Article  Google Scholar 

  36. Yang H, Lin K, Chen C (2018) Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Trans Pattern Anal Mach Intell 40 (2):437–451

    Article  Google Scholar 

  37. Yang W, Li J, Zheng H, Da Xu RY (2018) A nuclear norm based matrix regression based projections method for feature extraction. IEEE Access 6:7445–7451

    Article  Google Scholar 

  38. Ye Q, Yang J, Liu F, Zhao C, Ye N, Yin T (2018) L1-norm distance linear discriminant analysis based on an effective iterative algorithm. IEEE Trans Circ Syst Video Technol 28(1):114–129

    Article  Google Scholar 

  39. Zhang H, Liu L, Long Y, Shao L (2018) Unsupervised deep hashing with pseudo labels for scalable image retrieval. IEEE Trans Image Process 27(4):1626–1638

    Article  MathSciNet  Google Scholar 

  40. Zhang H, Long Y, Shao L (2018) Zero-shot hashing with orthogonal projection for image retrieval. Pattern Recognition Letters

  41. Zhang H, Long Y, Guan Y, Shao L (2019) Triple verification network for generalized zero-shot learning. IEEE Trans Image Process 28(1):506–517

    Article  MathSciNet  Google Scholar 

  42. Zhang H, Long Y, Yang W, Shao L (2019) Dual-verification network for zero-shot learning. Inf Sci 470:43–57

    Article  MathSciNet  Google Scholar 

  43. Zheng W (2017) Multichannel eeg-based emotion recognition via group sparse canonical correlation analysis. IEEE Trans Cogn Dev Syst 9(3):281–290

    Article  Google Scholar 

  44. Zuo Z, Yang L, Peng Y, Fei C, Qu Y (2018) Gaze-informed egocentric action recognition for memory aid systems. IEEE Access 6(99):12894–12904

    Article  Google Scholar 

  45. Zuo Z, Wei B, Chao F, Qu Y, Peng Y, Yang L (2019) Enhanced gradient-based local feature descriptors by saliency map for egocentric action recognition. Appl Syst Innov 2(1):7

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haofeng Zhang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported by National Natural Science Foundation of China (No.61872187, No.61871444).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gu, Y., Zhang, H., Zhang, Z. et al. Unsupervised deep triplet hashing with pseudo triplets for scalable image retrieval. Multimed Tools Appl 79, 35253–35274 (2020). https://doi.org/10.1007/s11042-019-7687-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-7687-0

Keywords

Navigation