Abstract
Hashing methods have attracted great attention for cross-modal retrieval due to the low memory requirement and fast computation. Cross-modal hashing methods aim to transform the data from different modalities into a common Hamming space. However, most existing cross-modal hashing methods ignore the restrictions on the Hamming distance between dissimilar instances. Besides, most cross-modal hashing methods relax discrete constraints and then quantize the continuous values to obtain suboptimal solutions as hash codes, which causes quantization error and low retrieval performance. To address above problems, we propose a novel supervised cross-modal hashing method, termed Discrete Similarity Preserving Hashing (DSPH). DSPH simultaneously preserves inter-modality and intra-modality similarity. Specifically, DSPH puts restrictions on both the similar and dissimilar instances to learn more discriminative hash codes. Moreover, we present a discrete gradient descent algorithm to solve the discrete optimization problem. Extensive experiments conducted on Wiki and NUS-WIDE datasets show that DSPH improves retrieval performance compared with several state-of-the-art cross-modal hashing methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Chen, Z., Zhong, F., Min, G., Leng, Y., Ying, Y.: Supervised intra- and inter-modality similarity preserving hashing for cross-modal retrieval. IEEE Access 6, 27796–27808 (2018)
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: Nus-wide: a real-world web image database from national university of singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval, p. 48 (2009)
Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2075–2082 (2014)
Gong, Y., Lazebnik, S., Gordo, A., Perronnin, F.: Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 2916–2929 (2013)
He, K., Wen, F., Sun, J.: K-means hashing: an affinity-preserving quantization method for learning binary compact codes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2938–2945 (2013)
Kang, W., Li, W., Zhou, Z.: Column sampling based discrete supervised hashing. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 1230–1236 (2016)
Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: International Joint Conference on Artificial Intelligence, p. 1360 (2011)
Lin, G., Shen, C., van den Hengel, A.: Supervised hashing using graph cuts and boosted decision trees. IEEE Trans. Pattern Anal. Mach. Intell. 37(11), 2317–2331 (2015)
Lin, Z., Ding, G., Hu, M., Wang, J.: Semantics-preserving hashing for cross-view retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3864–3872 (2015)
Liu, H., Ji, R., Wu, Y., Huang, F., Zhang, B.: Cross-modality binary code learning via fusion similarity hashing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6345–6353 (2017)
Liu, W., Wang, J., Ji, R., Jiang, Y.G., Chang, S.F.: Supervised hashing with kernels. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2074–2081 (2012)
Liu, W., Wang, J., Kumar, S., Chang, S.F.: Hashing with graphs. In: International Conference on Machine Learning, pp. 1–8 (2011)
Ma, D., Liang, J., Kong, X., He, R.: Frustratingly easy cross-modal hashing. In: Proceedings of the 2016 ACM on Multimedia Conference, pp. 237–241 (2016)
Masci, J., Bronstein, M.M., Bronstein, A.M., Schmidhuber, J.: Multimodal similarity-preserving hashing. IEEE Trans. Pattern Anal. Mach. Intell. 36(4), 824–830 (2014)
Rasiwasia, N., et al.: A new approach to cross-modal multimedia retrieval. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 251–260 (2010)
Shen, F., Liu, W., Zhang, S., Yang, Y., Tao Shen, H.: Learning binary codes for maximum inner product search. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4148–4156 (2015)
Shen, F., Shen, C., Liu, W., Tao Shen, H.: Supervised discrete hashing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 37–45 (2015)
Shen, F., Shen, C., Shi, Q., van den Hengel, A., Tang, Z., Shen, H.T.: Hashing on nonlinear manifolds. IEEE Trans. Image Process. 24(6), 1839–1851 (2015)
Shen, F., Zhou, X., Yang, Y., Song, J., Shen, H.T., Tao, D.: A fast optimization method for general binary code learning. IEEE Trans. Image Process. 25(12), 5610–5621 (2016)
Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 785–796 (2013)
Tang, J., Wang, K., Shao, L.: Supervised matrix factorization hashing for cross-modal retrieval. IEEE Trans. Image Process. 25(7), 3157–3166 (2016)
Wang, Y., Ni, R., Zhao, Y., Xian, M.: Watermark embedding for direct binary searched halftone images by adopting visual cryptography. Comput., Mater. Continua 55(2), 255–265 (2018)
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems, pp. 1753–1760 (2009)
Yang, Z., Huang, Y., Li, X., Wang, W.: Efficient secure data provenance scheme in multimedia outsourcing and sharing. Comput., Mater. Continua 56(1), 1–17 (2018)
Zhang, L., Zhang, Y., Gu, X., Tang, J., Tian, Q.: Scalable similarity search with topology preserving hashing. IEEE Trans. Image Process. 23(7), 3025–3039 (2014)
Acknowledgment
This work was supported by National Natural Science Foundation of China (Grant No. 61772111, 61872170).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, M., Kong, X., Yao, T., Zhang, Y. (2019). Discrete Similarity Preserving Hashing for Cross-modal Retrieval. In: Sun, X., Pan, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2019. Lecture Notes in Computer Science(), vol 11633. Springer, Cham. https://doi.org/10.1007/978-3-030-24265-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-24265-7_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24264-0
Online ISBN: 978-3-030-24265-7
eBook Packages: Computer ScienceComputer Science (R0)