Hypergraph-Based Discrete Hashing Learning for Cross-Modal Retrieval

Tang, Dianjuan; Cui, Hui; Shi, Dan; Ji, Hua

doi:10.1007/978-3-030-00776-8_71

Dianjuan Tang¹⁸,
Hui Cui¹⁸,
Dan Shi¹⁸ &
…
Hua Ji¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11164))

Included in the following conference series:

Pacific Rim Conference on Multimedia

3670 Accesses
1 Citations

Abstract

Hashing has drawn increasing attention in cross-modal retrieval due to its high computation efficiency and low storage cost. However, there is a certain lack in the previous cross-modal hashing methods that they can not effectively represent the correlations between paired multi-modal instances. In this paper, we propose a novel Hypergraph-based Discrete Hashing (BGDH) to solve the limitation. We formulate a unified unsupervised hashing framework which simultaneously performs hypergraph learning and hash codes learning. Hypergraph learning can effectively preserve the intra-media similarity consistency. Furthermore, we propose an efficient discrete hash optimization method to directly learn the hash codes without quantization information loss. Extensive experiments on three benchmark datasets demonstrate the superior performance of the proposed approach, compared with state-of-the-art cross-modal hashing techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. Arch. 3, 993–1022 (2003)
MATH Google Scholar
Cheng, Z., Shen, J., Zhu, L., Kankanhalli, M.S., Nie, L.: Exploiting music play sequence for music recommendation. In: Proceedings of the Joint Conference on Artificial Intelligence (IJCAI), pp. 3654–3660 (2017). https://doi.org/10.24963/ijcai.2017/511
Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2083–2090 (2014). https://doi.org/10.1109/CVPR.2014.267
Gao, Y., Wang, M., Zha, Z., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–376 (2013). https://doi.org/10.1109/TIP.2012.2202676
Article MathSciNet MATH Google Scholar
Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: Proceedings of the Joint Conference on Artificial Intelligence (IJCAI), pp. 1360–1365 (2011). https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-230
Lin, Z., Chen, M., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. CoRR 1009.5055 (2010)
Google Scholar
Lin, Z., Ding, G., Hu, M., Wang, J.: Semantics-preserving hashing for cross-view retrieval. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3864–3872 (2015). https://doi.org/10.1109/CVPR.2015.7299011
Liong, V.E., Lu, J., Tan, Y.: Cross-modal discrete hashing. Pattern Recognit. 79, 114–129 (2018). https://doi.org/10.1016/j.patcog.2018.02.002
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article MathSciNet Google Scholar
Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: Proceedings of the ACM International Conference on Management of Data (SIGMOD), pp. 785–796 (2013). https://doi.org/10.1145/2463676.2465274
Xie, L., Shen, J., Han, J., Zhu, L., Shao, L.: Dynamic multi-view hashing for online image retrieval. In: Proceedings of the Joint Conference Artificial Intelligence (IJCAI), pp. 3133–3139 (2017). https://doi.org/10.24963/ijcai.2017/437
Zhen, Y., Yeung, D.: A probabilistic model for multimodal hash function learning. In: Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (KDD), pp. 940–948 (2012). https://doi.org/10.1145/2339530.2339678
Zhou, J., Ding, G., Guo, Y.: Latent semantic sparse hashing for cross-modal similarity search. In: Proceedings of the ACM International Conference on Information Retrieval (SIGIR), pp. 415–424 (2014). https://doi.org/10.1145/2600428.2609610
Zhu, L., Huang, Z., Chang, X., Song, J., Shen, H.T.: Exploring consistent preferences: discrete hashing with pair-exemplar for scalable landmark search. In: Proceedings of the ACM International Conference on Multimedia (MM), pp. 726–734 (2017). https://doi.org/10.1145/3123266.3123301
Zhu, L., Huang, Z., Li, Z., Xie, L., Shen, H.T.: Exploring auxiliary context: discrete semantic transfer hashing for scalable image retrieval. IEEE Trans. Neural Netw. Learn. Syst. 99, 1–13 (2018). https://doi.org/10.1109/TNNLS.2018.2797248
Article Google Scholar
Zhu, L., Huang, Z., Liu, X., He, X., Sun, J., Zhou, X.: Discrete multimodal hashing with canonical views for robust mobile landmark search. IEEE Trans. Multimed. 19(9), 2066–2079 (2017). https://doi.org/10.1109/TMM.2017.2729025
Article Google Scholar
Zhu, X., Huang, Z., Shen, H.T., Zhao, X.: Linear cross-modal hashing for efficient multimedia search. In: Proceedings of the ACM International Conference on Multimedia (MM), pp. 143–152 (2013). https://doi.org/10.1145/2502081.2502107

Download references

Author information

Authors and Affiliations

School of Information Science and Engineering, Shandong Normal University, Jinan, 250014, China
Dianjuan Tang, Hui Cui, Dan Shi & Hua Ji

Authors

Dianjuan Tang
View author publications
You can also search for this author in PubMed Google Scholar
Hui Cui
View author publications
You can also search for this author in PubMed Google Scholar
Dan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Hua Ji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hua Ji .

Editor information

Editors and Affiliations

Hefei University of Technology, Hefei, China
Richang Hong
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
University of Tokyo, Tokyo, Japan
Toshihiko Yamasaki
Hefei University of Technology, Hefei, China
Meng Wang
City University of Hong Kong, Hong Kong, Hong Kong
Chong-Wah Ngo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, D., Cui, H., Shi, D., Ji, H. (2018). Hypergraph-Based Discrete Hashing Learning for Cross-Modal Retrieval. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11164. Springer, Cham. https://doi.org/10.1007/978-3-030-00776-8_71

Download citation

DOI: https://doi.org/10.1007/978-3-030-00776-8_71
Published: 19 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00775-1
Online ISBN: 978-3-030-00776-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics