Abstract
Hashing is a widely used technique for large-scale approximate nearest neighbor searching in multimedia retrieval. Recent works have proved that using deep neural networks is a promising solution for learning both feature representation and hash codes. However, most existing deep hashing methods directly learn hash codes from a convolutional neural network, ignoring the spatial importance distribution of images. The loss of spatial importance negatively affects the performance of hash learning and thus reduces its accuracy. To address this issue, we propose a new deep hashing method with weighted spatial information, which generates hash codes by using discrete spatial importance distribution. In particular, to extract the discrete spatial importance information of images effectively, we propose a method to learn the spatial attention map and hash code simultaneously, which makes the spatial attention map more conductive to hash-based retrieval. The experimental results of three widely used datasets show that the proposed deep weighted hashing method is superior to the state-of-the-art hashing method.
Y. Yin—This work was supported in part by the National Natural Science Foundation of China (61671274, 61876098), National Key R & D Program of China (2018YFC0830100, 2018YFC0830102) and special funds for distinguished professors of Shandong Jianzhu University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp. 1097–1105 (2012)
Liu, H., Wang, R., Shan, S., Chen, X.: Deep supervised hashing for fast image retrieval. In: Computer Vision and Pattern Recognition, pp. 2064–2072 (2016)
Liong, V.E., Lu, J., Tan, Y.P., Zhou, J.: Deep video hashing. IEEE Trans. Multimed. 19, 1209–1219 (2017)
Long, C., Zhang, H., Xiao, J., Nie, L., Chua, T.S.: SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Computer Vision and Pattern Recognition, pp. 6298–6306 (2017)
Nie, X., Li, X., Chai, Y., Cui, C., Xi, X., Yin, Y.: Robust image fingerprinting based on feature point relationship mining. IEEE Trans. Inf. Forensics Secur. 13, 1509–1523 (2018)
Nie, X., Jing, W., Cui, C., Zhang, J., Zhu, L., Yin, Y.: Joint multi-view hashing for large-scale near-duplicate video retrieval. IEEE Trans. Knowl. Data Eng. 32, 1951–1965 (2019)
Nie, X., Yin, Y., Sun, J., Liu, J., Cui, C.: Comprehensive feature-based robust video fingerprinting using tensor model. IEEE Trans. Multimed. 19, 785–796 (2016)
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Learning deep features for discriminative localization. In: Computer Vision and Pattern Recognition, pp. 2921–2929 (2015)
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 640–651 (2017)
Oquab, M., Bottou, L., Laptev, I., Sivic, J.: Learning and transferring mid-level image representations using convolutional neural networks. In: Computer Vision and Pattern Recognition, pp. 1717–1724 (2014)
Wang, J., Zhang, T., Song, J., Sebe, N., Shen, H.T.: A survey on learning to hash. IEEE Trans. Pattern Anal. Mach. Intell. 40, 769–790 (2018)
Andoni, A., Indyk, P.: Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Commun. ACM 51, 117–122 (2008)
Liuz, Y., Cuiz, J., Huang, Z., Liz, H., Shenx, H.T.: SK-LSH: an efficient index structure for approximate nearest neighbor search. Proc. VLDB Endow. 7, 745–756 (2014)
Wang, J., Liu, W., Kumar, S., Chang, S.F.: Learning to hash for indexing big data - a survey. Proc. IEEE 104, 34–57 (2016)
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: International Conference on Neural Information Processing Systems, pp. 1753–1760 (2008)
Yunchao, G., Svetlana, L., Albert, G., Florent, P.: Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 35, 2916–2929 (2013)
Rastegari, M., Farhadi, A., Forsyth, D.: Attribute discovery via predictable discriminative binary codes. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7577, pp. 876–889. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33783-3_63
Norouzi, M., Blei, D.M.: Minimal loss hashing for compact binary codes. In: Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 353–360. Citeseer (2011)
Wang, J., Kumar, S., Chang, S.: Semi-supervised hashing for large-scale search. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2393–2406 (2012)
Liu, W., Wang, J., Ji, R., Jiang, Y.G., Chang, S.F.: Supervised hashing with kernels. In: Computer Vision and Pattern Recognition, pp. 2074–2081. IEEE (2012)
Kulis, B., Darrell, T.: Learning to hash with binary reconstructive embeddings. In: Advances in Neural Information Processing Systems, pp. 1042–1050 (2009)
Liong, V.E., Lu, J., Gang, W., Moulin, P., Jie, Z.: Deep hashing for compact binary codes learning. In: Computer Vision and Pattern Recognition, pp. 2475–2483 (2015)
Fang, Z., Huang, Y., Liang, W., Tan, T.: Deep semantic ranking based hashing for multi-label image retrieval. In: Computer Vision and Pattern Recognition, pp. 1556–1564 (2015)
Lai, H., Pan, Y., Liu, Y., Yan, S.: Simultaneous feature learning and hash coding with deep neural networks. In: Computer Vision and Pattern Recognition, pp. 3270–3278 (2015)
Lu, J., Liong, V.E., Zhou, J.: Deep hashing for scalable image search. IEEE Trans. Image Process. 26, 2352–2367 (2017)
Yang, H., Lin, K., Chen, C.: Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Trans. Pattern Anal. Mach. Intell. 40, 437–451 (2018)
Li, W.J., Wang, S., Kang, W.C.: Feature learning based deep supervised hashing with pairwise labels. In: International Joint Conference on Artificial Intelligence, pp. 1711–1717 (2016)
Li, Q., Sun, Z., He, R., Tan, T.: Deep supervised discrete hashing. In: Advances in Neural Information Processing Systems, pp. 2479–2488 (2017)
Cao, Y., Long, M., Liu, B., Wang, J.: Deep Cauchy hashing for hamming space retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1229–1237 (2018)
Jiang, Q.Y., Cui, X., Li, W.J.: Deep discrete supervised hashing. IEEE Trans. Image Process. 27, 5996–6009 (2018)
Jiang, Q.Y., Li, W.J.: Asymmetric deep supervised hashing. In: Thirty-Second AAAI Conference on Artificial Intelligence, pp. 3342–3349 (2018)
Jin, L., Shu, X., Li, K., Li, Z., Qi, G., Tang, J.: Deep ordinal hashing with spatial attention. IEEE Trans. Image Process. 28, 2173–2186 (2019)
You, Q., Jin, H., Wang, Z., Chen, F., Luo, J.: Image captioning with semantic attention. In: Computer Vision and Pattern Recognition, pp. 4651–4659 (2016)
Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. Comput. Sci. 2048–2057 (2015)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. arXiv preprint arXiv:1405.3531 (2014)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Comput. Sci. (2014)
Zhang, P., Zhang, W., Li, W.J., Guo, M.: Supervised hashing with latent factor models. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 173–182 (2014)
Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. Technical report, Citeseer (2009)
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval, p. 48. ACM (2009)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Wang, X.J., Zhang, L., Jing, F., Ma, W.Y.: AnnoSearch: image auto-annotation by search. Comput. Vis. Pattern Recogn. 2, 1483–1490 (2006)
Luo, X., Nie, L., He, X., Wu, Y., Chen, Z.D., Xu, X.S.: Fast scalable supervised hashing. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 735–744. ACM (2018)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Shi, Y., Nie, X., Zhou, Q., Xi, X., Yin, Y. (2021). Discrete Spatial Importance-Based Deep Weighted Hashing. In: Ishikawa, H., Liu, CL., Pajdla, T., Shi, J. (eds) Computer Vision – ACCV 2020. ACCV 2020. Lecture Notes in Computer Science(), vol 12624. Springer, Cham. https://doi.org/10.1007/978-3-030-69535-4_23
Download citation
DOI: https://doi.org/10.1007/978-3-030-69535-4_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-69534-7
Online ISBN: 978-3-030-69535-4
eBook Packages: Computer ScienceComputer Science (R0)