Abstract
Hashing has drawn increasing attention in cross-modal retrieval due to its high computation efficiency and low storage cost. However, there is a certain lack in the previous cross-modal hashing methods that they can not effectively represent the correlations between paired multi-modal instances. In this paper, we propose a novel Hypergraph-based Discrete Hashing (BGDH) to solve the limitation. We formulate a unified unsupervised hashing framework which simultaneously performs hypergraph learning and hash codes learning. Hypergraph learning can effectively preserve the intra-media similarity consistency. Furthermore, we propose an efficient discrete hash optimization method to directly learn the hash codes without quantization information loss. Extensive experiments on three benchmark datasets demonstrate the superior performance of the proposed approach, compared with state-of-the-art cross-modal hashing techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. Arch. 3, 993–1022 (2003)
Cheng, Z., Shen, J., Zhu, L., Kankanhalli, M.S., Nie, L.: Exploiting music play sequence for music recommendation. In: Proceedings of the Joint Conference on Artificial Intelligence (IJCAI), pp. 3654–3660 (2017). https://doi.org/10.24963/ijcai.2017/511
Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2083–2090 (2014). https://doi.org/10.1109/CVPR.2014.267
Gao, Y., Wang, M., Zha, Z., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–376 (2013). https://doi.org/10.1109/TIP.2012.2202676
Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: Proceedings of the Joint Conference on Artificial Intelligence (IJCAI), pp. 1360–1365 (2011). https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-230
Lin, Z., Chen, M., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. CoRR 1009.5055 (2010)
Lin, Z., Ding, G., Hu, M., Wang, J.: Semantics-preserving hashing for cross-view retrieval. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3864–3872 (2015). https://doi.org/10.1109/CVPR.2015.7299011
Liong, V.E., Lu, J., Tan, Y.: Cross-modal discrete hashing. Pattern Recognit. 79, 114–129 (2018). https://doi.org/10.1016/j.patcog.2018.02.002
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: Proceedings of the ACM International Conference on Management of Data (SIGMOD), pp. 785–796 (2013). https://doi.org/10.1145/2463676.2465274
Xie, L., Shen, J., Han, J., Zhu, L., Shao, L.: Dynamic multi-view hashing for online image retrieval. In: Proceedings of the Joint Conference Artificial Intelligence (IJCAI), pp. 3133–3139 (2017). https://doi.org/10.24963/ijcai.2017/437
Zhen, Y., Yeung, D.: A probabilistic model for multimodal hash function learning. In: Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (KDD), pp. 940–948 (2012). https://doi.org/10.1145/2339530.2339678
Zhou, J., Ding, G., Guo, Y.: Latent semantic sparse hashing for cross-modal similarity search. In: Proceedings of the ACM International Conference on Information Retrieval (SIGIR), pp. 415–424 (2014). https://doi.org/10.1145/2600428.2609610
Zhu, L., Huang, Z., Chang, X., Song, J., Shen, H.T.: Exploring consistent preferences: discrete hashing with pair-exemplar for scalable landmark search. In: Proceedings of the ACM International Conference on Multimedia (MM), pp. 726–734 (2017). https://doi.org/10.1145/3123266.3123301
Zhu, L., Huang, Z., Li, Z., Xie, L., Shen, H.T.: Exploring auxiliary context: discrete semantic transfer hashing for scalable image retrieval. IEEE Trans. Neural Netw. Learn. Syst. 99, 1–13 (2018). https://doi.org/10.1109/TNNLS.2018.2797248
Zhu, L., Huang, Z., Liu, X., He, X., Sun, J., Zhou, X.: Discrete multimodal hashing with canonical views for robust mobile landmark search. IEEE Trans. Multimed. 19(9), 2066–2079 (2017). https://doi.org/10.1109/TMM.2017.2729025
Zhu, X., Huang, Z., Shen, H.T., Zhao, X.: Linear cross-modal hashing for efficient multimedia search. In: Proceedings of the ACM International Conference on Multimedia (MM), pp. 143–152 (2013). https://doi.org/10.1145/2502081.2502107
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Tang, D., Cui, H., Shi, D., Ji, H. (2018). Hypergraph-Based Discrete Hashing Learning for Cross-Modal Retrieval. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11164. Springer, Cham. https://doi.org/10.1007/978-3-030-00776-8_71
Download citation
DOI: https://doi.org/10.1007/978-3-030-00776-8_71
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00775-1
Online ISBN: 978-3-030-00776-8
eBook Packages: Computer ScienceComputer Science (R0)