Skip to main content

Hypergraph-Based Discrete Hashing Learning for Cross-Modal Retrieval

  • Conference paper
  • First Online:
Advances in Multimedia Information Processing – PCM 2018 (PCM 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11164))

Included in the following conference series:

Abstract

Hashing has drawn increasing attention in cross-modal retrieval due to its high computation efficiency and low storage cost. However, there is a certain lack in the previous cross-modal hashing methods that they can not effectively represent the correlations between paired multi-modal instances. In this paper, we propose a novel Hypergraph-based Discrete Hashing (BGDH) to solve the limitation. We formulate a unified unsupervised hashing framework which simultaneously performs hypergraph learning and hash codes learning. Hypergraph learning can effectively preserve the intra-media similarity consistency. Furthermore, we propose an efficient discrete hash optimization method to directly learn the hash codes without quantization information loss. Extensive experiments on three benchmark datasets demonstrate the superior performance of the proposed approach, compared with state-of-the-art cross-modal hashing techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. Arch. 3, 993–1022 (2003)

    MATH  Google Scholar 

  2. Cheng, Z., Shen, J., Zhu, L., Kankanhalli, M.S., Nie, L.: Exploiting music play sequence for music recommendation. In: Proceedings of the Joint Conference on Artificial Intelligence (IJCAI), pp. 3654–3660 (2017). https://doi.org/10.24963/ijcai.2017/511

  3. Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2083–2090 (2014). https://doi.org/10.1109/CVPR.2014.267

  4. Gao, Y., Wang, M., Zha, Z., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–376 (2013). https://doi.org/10.1109/TIP.2012.2202676

    Article  MathSciNet  MATH  Google Scholar 

  5. Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: Proceedings of the Joint Conference on Artificial Intelligence (IJCAI), pp. 1360–1365 (2011). https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-230

  6. Lin, Z., Chen, M., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. CoRR 1009.5055 (2010)

    Google Scholar 

  7. Lin, Z., Ding, G., Hu, M., Wang, J.: Semantics-preserving hashing for cross-view retrieval. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3864–3872 (2015). https://doi.org/10.1109/CVPR.2015.7299011

  8. Liong, V.E., Lu, J., Tan, Y.: Cross-modal discrete hashing. Pattern Recognit. 79, 114–129 (2018). https://doi.org/10.1016/j.patcog.2018.02.002

    Article  Google Scholar 

  9. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  MathSciNet  Google Scholar 

  10. Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: Proceedings of the ACM International Conference on Management of Data (SIGMOD), pp. 785–796 (2013). https://doi.org/10.1145/2463676.2465274

  11. Xie, L., Shen, J., Han, J., Zhu, L., Shao, L.: Dynamic multi-view hashing for online image retrieval. In: Proceedings of the Joint Conference Artificial Intelligence (IJCAI), pp. 3133–3139 (2017). https://doi.org/10.24963/ijcai.2017/437

  12. Zhen, Y., Yeung, D.: A probabilistic model for multimodal hash function learning. In: Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (KDD), pp. 940–948 (2012). https://doi.org/10.1145/2339530.2339678

  13. Zhou, J., Ding, G., Guo, Y.: Latent semantic sparse hashing for cross-modal similarity search. In: Proceedings of the ACM International Conference on Information Retrieval (SIGIR), pp. 415–424 (2014). https://doi.org/10.1145/2600428.2609610

  14. Zhu, L., Huang, Z., Chang, X., Song, J., Shen, H.T.: Exploring consistent preferences: discrete hashing with pair-exemplar for scalable landmark search. In: Proceedings of the ACM International Conference on Multimedia (MM), pp. 726–734 (2017). https://doi.org/10.1145/3123266.3123301

  15. Zhu, L., Huang, Z., Li, Z., Xie, L., Shen, H.T.: Exploring auxiliary context: discrete semantic transfer hashing for scalable image retrieval. IEEE Trans. Neural Netw. Learn. Syst. 99, 1–13 (2018). https://doi.org/10.1109/TNNLS.2018.2797248

    Article  Google Scholar 

  16. Zhu, L., Huang, Z., Liu, X., He, X., Sun, J., Zhou, X.: Discrete multimodal hashing with canonical views for robust mobile landmark search. IEEE Trans. Multimed. 19(9), 2066–2079 (2017). https://doi.org/10.1109/TMM.2017.2729025

    Article  Google Scholar 

  17. Zhu, X., Huang, Z., Shen, H.T., Zhao, X.: Linear cross-modal hashing for efficient multimedia search. In: Proceedings of the ACM International Conference on Multimedia (MM), pp. 143–152 (2013). https://doi.org/10.1145/2502081.2502107

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hua Ji .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tang, D., Cui, H., Shi, D., Ji, H. (2018). Hypergraph-Based Discrete Hashing Learning for Cross-Modal Retrieval. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11164. Springer, Cham. https://doi.org/10.1007/978-3-030-00776-8_71

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00776-8_71

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00775-1

  • Online ISBN: 978-3-030-00776-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics