Abstract
Hashing has been widely investigated for large-scale image retrieval due to its search effectiveness and computation efficiency. In this work, we propose a novel Semantic Guided Hashing method coupled with binary matrix factorization to perform more effective nearest neighbor image search by simultaneously exploring the weakly-supervised rich community-contributed information and the underlying data structures. To uncover the underlying semantic information from the weakly-supervised user-provided tags, the binary matrix factorization model is leveraged for learning the binary features of images while the problem of imperfect tags is well addressed. The uncovered semantic information enables to well guide the discrete hash code learning. The underlying data structures are discovered by adaptively learning a discriminative data graph, which makes the learned hash codes preserve the meaningful neighbors. To the best of our knowledge, the proposed method is the first work that incorporates the hash code learning, the semantic information mining and the data structure discovering into one unified framework. Besides, the proposed method is extended to one deep approach for the optimal compatibility of discriminative feature learning and hash code learning. Experiments are conducted on two widely-used social image datasets and the proposed method achieves encouraging performance compared with the state-of-the-art hashing methods.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Cao, Yue, Long, Mingsheng, Liu, Bin, & Wang, Jianmin (2018). Deep cauchy hashing for hamming space retrieval. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 1229–1237).
Cao, Zhangjie, Long, Mingsheng, Wang, Jianmin, & Yu, Philip S. (2017). Hashnet: Deep learning to hash by continuation. In Proceedings of IEEE International Conference on Computer Vision, (pp. 5609–5618).
Dizaji, Kamran Ghasedi, Zheng, Feng, Sadoughi, Najmeh, Yang, Yanhua, Deng, Cheng, & Huang, Heng (2018). Unsupervised deep generative adversarial hashing network. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 3664–3673).
Gong, Yunchao, Ke, Qifa, Isard, Michael, & Lazebnik, Svetlana. (2013). A multi-view embedding space for modeling internet images, tags, and their semantics. International Journal of Computer Vision, 106(2), 210–233.
Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2013). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12), 2916–2929.
Gordo, Albert, Almazán, Jon, Revaud, Jérôme, & Larlus, Diane. (2017). End-to-end learning of deep visual representations for image retrieval. International Journal of Computer Vision, 124(2), 237–254.
Guan, Ziyu, Xie, Fei, Zhao, Wanqing, Wang, Xiaopeng, Chen, Long, Zhao, Wei, & Peng, Jinye (2018). Tag-based weakly-supervised hashing for image retrieval. In Proceedings of International Joint Conference on Artificial Intelligence, (pp. 3776–3782).
Gui, Jie, & Li, Ping (2018). R2SDH: robust rotated supervised discrete hashing. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (pp. 1485–1493).
Gui, Jie, Liu, Tongliang, Sun, Zhenan, Tao, Dacheng, & Tan, Tieniu. (2018). Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(2), 490–496.
Heo, Jae-Pil, Lee, Youngwoon, He, Junfeng, Chang, Shih-Fu, & Yoon, Sung eui (2012). Spherical hashing. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 2957–2964).
Heo, Jae-Pil, Lee, Youngwoon, He, Junfeng, Chang, Shih-Fu, & Yoon, Sung-Eui. (2015). Spherical hashing: Binary code embedding with hyperspheres. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(11), 2304–2316.
Hu, Haifeng, Wang, Kun, Lv, Chenggang, Wu, Jiansheng, & Yang, Zhen. (2019). Semi-supervised metric learning-based anchor graph hashing for large-scale image retrieval. IEEE Transactions on Image Processing, 28(2), 739–754.
Huiskes, Mark J., & Lew, Michael S. (2008). The mir flickr retrieval evaluation. In Proceedings of ACM International Conference on Multimedia Information Retrieval, (pp. 39–43).
Indyk, Piotr, & Motwani, Rajeev (1998). Approximate nearest neighbors: Towards removing the curse of dimensionality. In Proceedings of ACM Symposium on Theory of Computing, (pp. 604–613).
Ji, Jianqiu, Li, Jianmin, Yan, Shuicheng, Zhang, Bo, & Tian, Qi (2012). Super-bit locality-sensitive hashing. In Proceedings of Advances in Neural Information Processing Systems, (pp. 108–116).
Jiang, Qing-Yuan, & Li, Wu-Jun (2018). Asymmetric deep supervised hashing. In Proceedings of AAAI Conference on Artificial Intelligence, (pp. 3342–3349).
Jiang, Qing-Yuan, Cui, Xue, & Li, Wu-Jun. (2018). Deep discrete supervised hashing. IEEE Transactions on Image Processing, 27(12), 5996–6009.
Jin, Lu, Li, Kai, Li, Zechao, Xiao, Fu, Qi, Guo-Jun, & Tang, Jinhui. (2019). Deep semantic-preserving ordinal hashing for cross-modal similarity search. IEEE Transactions on Neural Networks and Learning Systems, 30(5), 1429–1440.
Jin, Lu, Shu, Xiangbo, Li, Kai, Li, Zechao, Qi, Guo-Jun, & Tang, Jinhui. (2019). Deep ordinal hashing with spatial attention. IEEE Transactions on Image Processing, 28(5), 2173–2186.
Kan, Meina, Xu, Dong, Shan, Shiguang, & Chen, Xilin. (2014). Semisupervised hashing via kernel hyperplane learning for scalable image search. IEEE Transactions on Circuits and Systems for Video Technology, 24(4), 704–713.
Krizhevsky, Alex, Sutskever, Ilya, & Hinton, Geoffrey E. (2012). Imagenet classification with deep convolutional neural networks. In Proceedings of Advances in Neural Information Processing Systems, (pages 1106–1114).
Kulis, Brian, & Darrell, Trevor (2009). Learning to hash with binary reconstructive embeddings. In Proceedings of Advances in Neural Information Processing Systems, (pp. 1042–1050).
Kulis, Brian, & Grauman, Kristen (2009). Kernelized locality-sensitive hashing for scalable image search. In Proceedings of IEEE International Conference on Computer Vision, (pp. 2130–2137).
Li, Qi, Sun, Zhenan, He, Ran, & Tan, Tieniu (2017). Deep supervised discrete hashing. In Proceedings of Advances in Neural Information Processing Systems, (pp. 2479–2488).
Li, Wu-Jun, Wang, Sheng, & Kang, Wang-Cheng (2016). Feature learning based deep supervised hashing with pairwise labels. In Proceedings of International Joint Conference on Artificial Intelligence, (pp. 1711–1717).
Lin, Kevin, Yang, Huei-Fang, Hsiao, Jen-Hao, & Chen, Chu-Song (2015). Deep learning of binary hash codes for fast image retrieval. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, (pp. 4933–4941).
Liong, Venice Erin, Lu, Jiwen, Tan, Yap-Peng, & Zhou, Jie (2017). Cross-modal deep variational hashing. In Proceedings of IEEE International Conference on Computer Vision, (pp. 4097–4105).
Liong, Venice Erin, Lu, Jiwen, Wang, Gang, Moulin, Pierre, & Zhou, Jie (2015). Deep hashing for compact binary codes learning. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, (pp. 2475–2483).
Li, Zechao, & Tang, Jinhui. (2015). Weakly supervised deep metric learning for community-contributed image retrieval. IEEE Transactions on Multimedia, 17(11), 1989–1999.
Li, Zechao, & Tang, Jinhui. (2017). Weakly supervised deep matrix factorization for social image understanding. IEEE Transactions on Image Processing, 26(1), 276–288.
Li, Zechao, Tang, Jinhui, & Mei, Tao. (2019). Deep collaborative embedding for social image understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(9), 2070–2083.
Liu, Haomiao, Wang, Ruiping, Shan, Shiguang, Chen, Xilin (2016). Deep supervised hashing for fast image retrieval. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 2064–2072).
Liu, Wei, Wang, Jun, Kumar, Sanjiv, & Chang, Shih-Fu (2011). Hashing with graphs. In Proceedings of International Conference on Machine Learning, (pp. 1–8).
Liu, Xianglong, Deng, Cheng, Lang, Bo, Tao, Dacheng, & Li, Xuelong. (2016). Query-adaptive reciprocal hash tables for nearest neighbor search. IEEE Transactions on Image Processing, 25(2), 907–919.
Liu, Xianglong, He, Junfeng, & Chang, Shih-Fu. (2017). Hash bit selection for nearest neighbor search. IEEE Transactions on Image Processing, 26(11), 5367–5380.
Liu, Xianglong, Li, Zhujin, Deng, Cheng, & Tao, Dacheng. (2017). Distributed adaptive binary quantization for fast nearest neighbor search. IEEE Transactions on Image Processing, 26(11), 5324–5336.
Mandal, Devraj, Chaudhury, Kunal N., & Biswas, Soma. (2019). Generalized semantic preserving hashing for cross-modal retrieval. IEEE Transactions on Image Processing, 28(1), 102–112.
Raginsky, Maxim, & Lazebnik, Svetlana (2009). Locality-sensitive binary codes from shift-invariant kernels. In Proceedings of Advances in Neural Information Processing Systems, (pp. 1509–1517).
Shen, Fumin, Shen, Chunhua, Liu, Wei, & Shen, Heng Tao (2015). Supervised discrete hashing. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 37–45).
Shen, Fumin, Shen, Chunhua, Shi, Qinfeng, Anton, van den Hengel, & Tang, Zhenmin (2013). Inductive hashing on manifolds. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 1562–1569).
Sohn, Sungryull, Kim, Hyunwoo, & Kim, Junmo. (2017). Uncorrelated component analysis-based hashing. IEEE Transactions on Image Processing, 26(8), 3759–3774.
Tang, Jinhui, & Li, Zechao. (2018). Weakly supervised multimodal hashing for scalable social image retrieval. IEEE Transactions on on Circuits and Systems for Video Technology, 28(10), 2730–2741.
Tang, Jinhui, Lin, Jie, Li, Zechao, & Yang, Jian. (2018). Discriminative deep quantization hashing for face image retrieval. IEEE Transactions on Neural Networks and Learning Systtems, 29(12), 6154–6162.
Tang, Jinhui, Li, Zechao, Wang, Meng, & Zhao, Ruizhen. (2015). Neighborhood discriminant hashing for large-scale image retrieval. IEEE Transactions on Image Processing, 24(9), 2827–2840.
Tang, Jinhui, Li, Zechao, & Zhu, Xiang. (2018). Supervised deep hashing for scalable face image retrieval. Pattern Recognition, 75, 25–32.
Tang, Jinhui, Shu, Xiangbo, Li, Zechao, Jiang, Yu-Gang, & Tian, Qi. (2019). Social anchor-unit graph regularized tensor completion for large-scale image retagging. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(8), 2027–2034.
Tang, Jinhui, Shu, Xiaongbo, Li, Zechao, Qi, Guo-Jun, & Wang, Jingdong. (2016). Generalized deep transfer networks for knowledge propagation in heterogeneous domains. ACM Transactions on Multimedia Computing Communications and Applications, 12(4s), 1–22.
Tang, Jinhui, Shu, Xiangbo, Qi, Guo-Jun, Li, Zechao, Wang, Meng, Yan, Shuicheng, et al. (2017). Tri-clustered tensor completion for social-aware image tag refinement. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(8), 1662–1674.
Venkateswara, Hemanth, Eusebio, Jose, Chakraborty, Shayok, & Panchanathan, Sethuraman (2017). Deep hashing network for unsupervised domain adaptation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, (pp. 5385–5394).
Wang, Daixin, Cui, Peng, Ou, Mingdong, & Zhu, Wenwu (2015). Deep multimodal hashing with orthogonal units. In Proceedings of International Joint Conference on Artificial Intelligence, (pp. 2291–2297).
Wang, Jun, Kumar, Sanjiv, & Chang, Shih-Fu. (2012). Semi-supervised hashing for large scale search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(12), 2393–2406.
Wang, Jingdong, Zhang, Ting, Song, Jingkuan, Sebe, Nicu, & Shen, Heng Tao. (2018). A survey on learning to hash. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(4), 769–790.
Weiss, Yair, Torralba, Antonio, & Fergus, Robert (2008). Spectral hashing. In Proceedings of Advances in Neural Information Processing Systems, (pp. 1753–1760).
Wu, Fei, Yu, Zhou, Yi, Tang, Siliang, Zhang, & Yin, & Zhuang, Yueting., (2014). Sparse multi-modal hashing. IEEE Transactions on Multimedia, 16(2), 424–439.
Xia, Rongkai, Pan, Yan, Lai, Hanjiang, Liu, Cong, & Yan, Shuicheng (2014). Supervised hashing for image retrieval via image representation learning. In Proceedings of AAAI Conference on Artificial Intelligence, (pp. 2156–2162).
Zhai, Deming, Liu, Xianming, Ji, Xiangyang, Zhao, Debin, Satoh, Shin’ichi, & Gao, Wen. (2018). Supervised distributed hashing for large-scale multimedia retrieval. IEEE Transactions on Multimedia, 20(3), 675–686.
Zhang, Dell, Wang, Jun, Cai, Deng, & Lu, Jinsong (2012). Self-taught hashing for fast similarity search. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, (pp. 18–25).
Zhang, Dongqing, & Li, Wu-Jun (2014). Large-scale supervised multimodal hashing with semantic correlation maximization. In Proceedings of AAAI Conference on Artificial Intelligence, (pp. 143–152).
Zhang, Zhongyuan, Li, Tao, Ding, Chris H. Q., & Zhang, Xiang-Sun (2007). Binary matrix factorization with applications. In Proceedings of IEEE International Conference on Data Mining, (pp. 391–400).
Zhang, Ziming, Chen, Yuting, & Saligrama, Venkatesh (2016). Efficient training of very deep neural networks for supervised hashing. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, (pp. 1487–1495).
Zhang, Haofeng, Liu, Li, Long, Yang, & Shao, Ling. (2018). Unsupervised deep hashing with pseudo labels for scalable image retrieval. IEEE Transactions on Image Processing, 27(4), 1626–1638.
Zhu, Han, Long, Mingsheng, Wang, Jianmin, & Cao, Yue (2016). Deep hashing network for efficient similarity retrieval. In Proceedings of AAAI Conference on Artificial Intelligence, (pp. 2415–2421).
Zhu, Xiaofeng, Huang, Zi, Shen, Heng Tao, & Zhao, Xin (2013). Linear cross-modal hashing for efficient multimedia search. In Proceedings of ACM International Conference on Multimedia, (pp. 143–152).
Zhuang, Yueting, Liu, Yang, Wu, Fei, Zhang, Yin, & Shao, Jian (2011). Hypergraph spectral hashing for similarity search of social image. In Proceedings of ACM International Conference on Multimedia, (pp. 1457–1460).
Acknowledgements
This work was partially supported by the National Key R&D Program of China under Grant 2018AAA0102002, the National Natural Science Foundation of China (Grant No. 61772275, 61732007, 61772268 and 61672285) and the Natural Science Foundation of Jiangsu Province (Grant BK20170033).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Li Liu, Matti Pietikäinen, Jie Qin, Jie Chen, Wanli Ouyang, Luc Van Gool.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, Z., Tang, J., Zhang, L. et al. Weakly-supervised Semantic Guided Hashing for Social Image Retrieval. Int J Comput Vis 128, 2265–2278 (2020). https://doi.org/10.1007/s11263-020-01331-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-020-01331-0