Skip to main content

Discrete Similarity Preserving Hashing for Cross-modal Retrieval

  • Conference paper
  • First Online:
  • 1692 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 11633))

Abstract

Hashing methods have attracted great attention for cross-modal retrieval due to the low memory requirement and fast computation. Cross-modal hashing methods aim to transform the data from different modalities into a common Hamming space. However, most existing cross-modal hashing methods ignore the restrictions on the Hamming distance between dissimilar instances. Besides, most cross-modal hashing methods relax discrete constraints and then quantize the continuous values to obtain suboptimal solutions as hash codes, which causes quantization error and low retrieval performance. To address above problems, we propose a novel supervised cross-modal hashing method, termed Discrete Similarity Preserving Hashing (DSPH). DSPH simultaneously preserves inter-modality and intra-modality similarity. Specifically, DSPH puts restrictions on both the similar and dissimilar instances to learn more discriminative hash codes. Moreover, we present a discrete gradient descent algorithm to solve the discrete optimization problem. Extensive experiments conducted on Wiki and NUS-WIDE datasets show that DSPH improves retrieval performance compared with several state-of-the-art cross-modal hashing methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Chen, Z., Zhong, F., Min, G., Leng, Y., Ying, Y.: Supervised intra- and inter-modality similarity preserving hashing for cross-modal retrieval. IEEE Access 6, 27796–27808 (2018)

    Article  Google Scholar 

  2. Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval, p. 48 (2009)

    Google Scholar 

  3. Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2075–2082 (2014)

    Google Scholar 

  4. Gong, Y., Lazebnik, S., Gordo, A., Perronnin, F.: Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 2916–2929 (2013)

    Article  Google Scholar 

  5. He, K., Wen, F., Sun, J.: K-means hashing: an affinity-preserving quantization method for learning binary compact codes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2938–2945 (2013)

    Google Scholar 

  6. Kang, W., Li, W., Zhou, Z.: Column sampling based discrete supervised hashing. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 1230–1236 (2016)

    Google Scholar 

  7. Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: International Joint Conference on Artificial Intelligence, p. 1360 (2011)

    Google Scholar 

  8. Lin, G., Shen, C., van den Hengel, A.: Supervised hashing using graph cuts and boosted decision trees. IEEE Trans. Pattern Anal. Mach. Intell. 37(11), 2317–2331 (2015)

    Article  Google Scholar 

  9. Lin, Z., Ding, G., Hu, M., Wang, J.: Semantics-preserving hashing for cross-view retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3864–3872 (2015)

    Google Scholar 

  10. Liu, H., Ji, R., Wu, Y., Huang, F., Zhang, B.: Cross-modality binary code learning via fusion similarity hashing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6345–6353 (2017)

    Google Scholar 

  11. Liu, W., Wang, J., Ji, R., Jiang, Y.G., Chang, S.F.: Supervised hashing with kernels. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2074–2081 (2012)

    Google Scholar 

  12. Liu, W., Wang, J., Kumar, S., Chang, S.F.: Hashing with graphs. In: International Conference on Machine Learning, pp. 1–8 (2011)

    Google Scholar 

  13. Ma, D., Liang, J., Kong, X., He, R.: Frustratingly easy cross-modal hashing. In: Proceedings of the 2016 ACM on Multimedia Conference, pp. 237–241 (2016)

    Google Scholar 

  14. Masci, J., Bronstein, M.M., Bronstein, A.M., Schmidhuber, J.: Multimodal similarity-preserving hashing. IEEE Trans. Pattern Anal. Mach. Intell. 36(4), 824–830 (2014)

    Article  Google Scholar 

  15. Rasiwasia, N., et al.: A new approach to cross-modal multimedia retrieval. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 251–260 (2010)

    Google Scholar 

  16. Shen, F., Liu, W., Zhang, S., Yang, Y., Tao Shen, H.: Learning binary codes for maximum inner product search. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4148–4156 (2015)

    Google Scholar 

  17. Shen, F., Shen, C., Liu, W., Tao Shen, H.: Supervised discrete hashing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 37–45 (2015)

    Google Scholar 

  18. Shen, F., Shen, C., Shi, Q., van den Hengel, A., Tang, Z., Shen, H.T.: Hashing on nonlinear manifolds. IEEE Trans. Image Process. 24(6), 1839–1851 (2015)

    Article  MathSciNet  Google Scholar 

  19. Shen, F., Zhou, X., Yang, Y., Song, J., Shen, H.T., Tao, D.: A fast optimization method for general binary code learning. IEEE Trans. Image Process. 25(12), 5610–5621 (2016)

    Article  MathSciNet  Google Scholar 

  20. Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 785–796 (2013)

    Google Scholar 

  21. Tang, J., Wang, K., Shao, L.: Supervised matrix factorization hashing for cross-modal retrieval. IEEE Trans. Image Process. 25(7), 3157–3166 (2016)

    Article  MathSciNet  Google Scholar 

  22. Wang, Y., Ni, R., Zhao, Y., Xian, M.: Watermark embedding for direct binary searched halftone images by adopting visual cryptography. Comput., Mater. Continua 55(2), 255–265 (2018)

    Article  Google Scholar 

  23. Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems, pp. 1753–1760 (2009)

    Google Scholar 

  24. Yang, Z., Huang, Y., Li, X., Wang, W.: Efficient secure data provenance scheme in multimedia outsourcing and sharing. Comput., Mater. Continua 56(1), 1–17 (2018)

    Google Scholar 

  25. Zhang, L., Zhang, Y., Gu, X., Tang, J., Tian, Q.: Scalable similarity search with topology preserving hashing. IEEE Trans. Image Process. 23(7), 3025–3039 (2014)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgment

This work was supported by National Natural Science Foundation of China (Grant No. 61772111, 61872170).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiangwei Kong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, M., Kong, X., Yao, T., Zhang, Y. (2019). Discrete Similarity Preserving Hashing for Cross-modal Retrieval. In: Sun, X., Pan, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2019. Lecture Notes in Computer Science(), vol 11633. Springer, Cham. https://doi.org/10.1007/978-3-030-24265-7_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-24265-7_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-24264-0

  • Online ISBN: 978-3-030-24265-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics