Abstract
Deep learning based hashing methods have been proven to be effective in the field of image retrieval recently. Among them, most high-performance methods are supervised frameworks, which require annotated labels by humans. Considering the difficulty of labeling large-scale image datasets, unsupervised methods, which just need images themselves for training, are more suitable for practical applications. However, how to improve the discriminative ability of hash codes generated by unsupervised models still remains as a challenging problem. In this paper, we present a novel deep framework called Unsupervised Deep Triplet Hashing (UDTH) for scalable image retrieval. UDTH builds pseudo triplets based on the neighborhood structure in the high-dimensional visual feature space, and then solves two problems through the proposed objective function: 1) Triplet network is utilized to maximize the distance between different classes of binary representation; 2) Autoencoder and Binary quantization are exploited to learn hash codes which maintain the structural information of original samples. Extensive experiments on the datasets of CIFAR-10, NUS-WIDE and MIRFLICKR-25K are conducted, and the results show that our proposed UDTH is superior to the state-of-the-art methods.











Similar content being viewed by others
References
Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. In: Advances in neural information processing systems, pp 153–160
Cao Z, Long M, Wang J, Yu PS (2017) Hashnet: Deep learning to hash by continuation. In: Proceedings of the IEEE International Conference on Computer Vision, pp 5608–5617
Chua T, Tang J, Hong R, Li H, Luo Z, Zheng Y (2009) Nus-wide: a real-world web image database from national university of singapore. In: ACM Multimedia, pp 48–56
Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the twentieth annual symposium on Computational geometry, pp 253–262
Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recogn 48(10):2993–3003
Do TT, Doan AD, Cheung NM (2016) Learning to hash with binary deep neural network. In: European Conference on Computer Vision, pp 219–234
Do TT, Tan DKL, Hoang T, Cheung NM (2016) Learning to hash with binary deep neural network. In: European Conference on Computer Vision, pp 219–234
Erin Liong V, Lu J, Wang G, Moulin P, Zhou J (2015) Deep hashing for compact binary codes learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2475–2483
Gong Y, Lazebnik S, Gordo A, Perronnin F (2013) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intell 35(12):2916–2929
Gong M, Zhao J, Liu J, Miao Q, Jiao L (2016) Change detection in synthetic aperture radar images based on deep neural networks. IEEE Trans Neural Netw Learn Syst 27(1):125–138
Heo JP, Lee Y, He J, Chang S, Yoon SE (2012) Spherical hashing. In: 2012 IEEE conference on Computer vision and pattern recognition (CVPR), pp 2957–2964
Hoffer E, Ailon N (2015) Deep metric learning using triplet network. In: International workshop on similarity-based pattern recognition, pp 84–92
Huang S, Xiong Y, Zhang Y, Wang J (2017) Unsupervised triplet hashing for fast image retrieval, pp 84–92
Huang S, Xiong Y, Zhang Y, Wang J (2017) Unsupervised triplet hashing for fast image retrieval. In: Proceedings of the on Thematic Workshops of ACM Multimedia 2017, pp 84–92
Huiskes MJ, Thomee B, Lew MS (2010) New trends and ideas in visual concept detection: the mir flickr retrieval evaluation initiative. In: Proceedings of the international conference on Multimedia information retrieval, pp 527–536
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images. Technical report, University of Toronto
Li Q, Sun Z, He R, Tan T (2017) Deep supervised discrete hashing. In: Advances in neural information processing systems, pp 2482–2491
Lin K, Lu J, Chen C, Zhou J (2016) Learning compact binary descriptors with unsupervised deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1183–1192
Liu W, Wang J, Ji R, Jiang YG, Chang S (2012) Supervised hashing with kernels. In: 2012 IEEE conference on Computer vision and pattern recognition (CVPR), pp 2074–2081
Liu W, Wang J, Ji R, Jiang YG, Chang S (2012) Supervised hashing with kernels. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2074–2081
Liu W, Kumar S, Kumar S, Chang S (2014) Discrete graph hashing. In: Advances in neural information processing systems
Liu H, Wang R, Shan S, Chen X (2017) Learning multifunctional binary codes for both category and attribute oriented retrieval tasks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3901–3910
Maaten Lvd, Hinton G (2008) Visualizing data using t-sne. J Mach Learn Res 9(Nov):2579–2605
Shen F, Shen C, Liu W, Shen H (2015) Supervised discrete hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 37–45
Shen F, Zhou X, Yang Y, Song J, Shen HT, Tao D (2016) A fast optimization method for general binary code learning. IEEE Trans Image Process 25 (12):5610–5621
Shen F, Yang Y, Liu L, Liu W, Tao D, Shen HT (2017) Asymmetric binary coding for image search. IEEE Trans Multimed 19(9):2022–2032
Shen F, Xu Y, Liu L, Yang Y, Huang Z, Shen HT (2018) Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE Trans Pattern Anal Mach Intell 40(12):3034–3044
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations
Song J, Gao L, Yan Y, Zhang D, Sebe N (2015) Supervised hashing with pseudo labels for scalable multimedia retrieval. In: ACM Multimedia, pp 827–830
Song J, He T, Fan H, Gao L (2017) Deep discrete hashing with self-supervised pairwise labels. In: Joint european conference on machine learning and knowledge discovery in databases, pp 223–238
Song J, Gao L, Liu L, Zhu X, Sebe N (2018) Quantization-based hashing: a general framework for scalable image and video retrieval. Pattern Recogn 75:175–187
Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on Machine learning, pp 1096–1103
Wang J, Kumar S, Chang S (2012) Semi-supervised hashing for large-scale search. IEEE Trans Pattern Anal Mach Intell 34(12):2393–2406
Weiss Y, Torralba A, Fergus R (2009) Spectral hashing. In: Advances in neural information processing systems, pp 1753–1760
Yan H, Ye Q, Zhang TA, Yu D, Yuan X, Xu Y, Fu L (2018) Least squares twin bounded support vector machines based on l1-norm distance metric for classification. Pattern Recogn 74:434–447
Yang H, Lin K, Chen C (2018) Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Trans Pattern Anal Mach Intell 40 (2):437–451
Yang W, Li J, Zheng H, Da Xu RY (2018) A nuclear norm based matrix regression based projections method for feature extraction. IEEE Access 6:7445–7451
Ye Q, Yang J, Liu F, Zhao C, Ye N, Yin T (2018) L1-norm distance linear discriminant analysis based on an effective iterative algorithm. IEEE Trans Circ Syst Video Technol 28(1):114–129
Zhang H, Liu L, Long Y, Shao L (2018) Unsupervised deep hashing with pseudo labels for scalable image retrieval. IEEE Trans Image Process 27(4):1626–1638
Zhang H, Long Y, Shao L (2018) Zero-shot hashing with orthogonal projection for image retrieval. Pattern Recognition Letters
Zhang H, Long Y, Guan Y, Shao L (2019) Triple verification network for generalized zero-shot learning. IEEE Trans Image Process 28(1):506–517
Zhang H, Long Y, Yang W, Shao L (2019) Dual-verification network for zero-shot learning. Inf Sci 470:43–57
Zheng W (2017) Multichannel eeg-based emotion recognition via group sparse canonical correlation analysis. IEEE Trans Cogn Dev Syst 9(3):281–290
Zuo Z, Yang L, Peng Y, Fei C, Qu Y (2018) Gaze-informed egocentric action recognition for memory aid systems. IEEE Access 6(99):12894–12904
Zuo Z, Wei B, Chao F, Qu Y, Peng Y, Yang L (2019) Enhanced gradient-based local feature descriptors by saliency map for egocentric action recognition. Appl Syst Innov 2(1):7
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was supported by National Natural Science Foundation of China (No.61872187, No.61871444).
Rights and permissions
About this article
Cite this article
Gu, Y., Zhang, H., Zhang, Z. et al. Unsupervised deep triplet hashing with pseudo triplets for scalable image retrieval. Multimed Tools Appl 79, 35253–35274 (2020). https://doi.org/10.1007/s11042-019-7687-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7687-0