GrowBit: Incremental Hashing for Cross-Modal Retrieval

Mandal, Devraj; Annadani, Yashas; Biswas, Soma

doi:10.1007/978-3-030-20870-7_19

Devraj Mandal¹²,
Yashas Annadani¹³ &
Soma Biswas¹²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11364))

Included in the following conference series:

Asian Conference on Computer Vision

2092 Accesses

Abstract

Cross-modal retrieval using hashing techniques is gaining increasing importance due to its efficient storage, scalability and fast query processing speeds. In this work, we address a related and relatively unexplored problem: given a set of cross-modal data with their already learned hash codes, can we increase the number of bits to better represent the data without relearning everything? This scenario is especially important when the number of tags describing the data increases, necessitating longer hash codes for better representation. To tackle this problem, we propose a novel approach called GrowBit, which incrementally learns the bits in the hash code and thus utilizes all the bits learned so far. We develop a two-stage approach for learning the hash codes and hash functions separately, utilizing a recent formulation which decouples over the bits so that it can incorporate the incremental approach. Experiments on MirFlickr, IAPR-TC-12 and NUS-WIDE datasets show the usefulness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Online Discriminative Semantic-Preserving Hashing for Large-Scale Cross-Modal Retrieval

Category-Level Contrastive Learning for Unsupervised Hashing in Cross-Modal Retrieval

Article Open access 02 April 2024

Learning Hash Subspace from Large-Scale Multi-modal Pre-Training: A CLIP-Based Cross-modal Hashing Framework

References

Bronstein, M.M., Bronstein, A.M., Michel, F., Paragios, N.: Data fusion through cross-modality metric learning using similarity-sensitive hashing. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3594–3601 (2010)
Google Scholar
Cakir, F., He, K., Sclaroff, S.: Hashing with binary matrix pursuit. arXiv preprint arXiv:1808.01990 (2018)
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: 2009 ACM International Conference on Image and Video Retrieval (ACM-CIVR), pp. 48–56 (2009)
Google Scholar
Dai, Q., Li, J., Wang, J., Jiang, Y.G.: Binary optimized hashing. In: 2016 ACM on Multimedia Conference (ACM-MM), pp. 1247–1256 (2016)
Google Scholar
Ding, G., Guo, Y., Zhou, J.: Collective matrix factorization hashing for multimodal data. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2075–2082 (2014)
Google Scholar
Escalante, H.J., et al.: The segmented and annotated IAPR TC-12 benchmark. Comput. Vis. Image Underst. 114, 419–428 (2010)
Article Google Scholar
Fatih, C., He, K., Bargal, S.A., Sclaroff, S.: MIHash: online hashing with mutual information. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 437–445 (2017)
Google Scholar
Hardoon, D.R., Szedmak, S., Shawe-Taylor, J.: Canonical correlation analysis: an overview with application to learning methods. Neural Comput. 16, 2639–2664 (2004)
Article Google Scholar
Horn, R.A., Johnson, C.R.: Matrix Analysis. Cambridge University Press, Cambridge (1990)
MATH Google Scholar
Huiskes, M.J., Lew, M.S.: The MIR Flickr retrieval evaluation. In: 2008 ACM International Conference on Multimedia Information Retrieval (ACM-MIR), pp. 39–43 (2008)
Google Scholar
Jiang, Q.Y., Li, W.J.: Deep cross-modal hashing. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3232–3240 (2017)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: 2012 Advances in Neural Information Processing Systems (NIPS), pp. 1097–1105 (2012)
Google Scholar
Kumar, S., Udupa, R.: Learning hash functions for cross-view similarity search. In: 2011 International Joint Conference on Artificial Intelligence (IJCAI), pp. 1360–1365 (2011)
Google Scholar
Lin, G., Shen, C., Suter, D., van den Hengel, A.: A general two-step approach to learning-based hashing. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 2552–2559 (2013)
Google Scholar
Lin, Z., Ding, G., Han, J., Wang, J.: Cross-view retrieval via probability-based semantics-preserving hashing. IEEE Trans. Cybern. 47, 4342–4355 (2017)
Article Google Scholar
Lin, Z., Ding, G., Hu, M., Wang, J.: Semantics-preserving hashing for cross-view retrieval. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3864–3872 (2015)
Google Scholar
Liu, L., Lin, Z., Shao, L., Shen, F., Ding, G., Han, J.: Sequential discrete hashing for scalable cross-modality similarity retrieval. IEEE Trans. Image Process. 26, 107–118 (2017)
Article MathSciNet Google Scholar
Long, M., Cao, Y., Wang, J., Yu, P.S.: Compositional correlation quantization for large-scale multimodal search. In: 2016 International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR), pp. 579–588 (2016)
Google Scholar
Mandal, D., Biswas, S.: Label consistent matrix factorization based hashing for cross-modal retrieval. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 2901–2905 (2017)
Google Scholar
Mandal, D., Chaudhury, K.N., Biswas, S.: Generalized semantic preserving hashing for n-label cross-modal retrieval. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2633–2641 (2017)
Google Scholar
Paszke, A., et al.: Automatic differentiation in PyTorch. In: 2017 Advances in Neural Information Processing Systems Workshop (NIPS-W) (2017)
Google Scholar
Shen, F., Zhou, X., Yang, Y., Song, J., Shen, H.T., Tao, D.: A fast optimization method for general binary code learning. IEEE Trans. Image Process. 25, 5610–5621 (2017)
Article MathSciNet Google Scholar
Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: 2013 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 785–796 (2013)
Google Scholar
Wang, D., Gao, X., Wang, X., He, L.: Semantic topic multimodal hashing for cross-media retrieval. In: 2015 International Joint Conference on Artificial Intelligence (IJCAI), pp. 3890–3896 (2015)
Google Scholar
Wang, J., Kumar, S., Chang, S.F.: Sequential projection learning for hashing with compact codes. In: 2010 International Conference on Machine Learning (ICML), pp. 1127–1134 (2010)
Google Scholar
Wu, B., Yang, Q., Zheng, W.S., Wang, Y., Wang, J.: Quantized correlation hashing for fast cross-modal search. In: 2015 International Joint Conference on Artificial Intelligence (IJCAI), pp. 3946–3952 (2015)
Google Scholar
Xia, R., Pan, Y., Lai, H., Liu, C., Yan, S.: Supervised hashing for image retrieval via image representation learning. In: 2014 AAAI Conference on Artificial Intelligence (AAAI), pp. 2156–2162 (2014)
Google Scholar
Xie, L., Shen, J., Han, J., Zhu, L., Shao, L.: Dynamic multi-view hashing for online image retrieval. In: 2017 International Joint Conference on Artificial Intelligence (IJCAI), pp. 3133–3139 (2017)
Google Scholar
Xu, X., Shen, F., Yang, Y., Shen, H.T., Li, X.: Learning discriminative binary codes for large-scale cross-modal retrieval. IEEE Trans. Image Process. 26, 2494–2507 (2017)
Article MathSciNet Google Scholar
Yang, E., Deng, C., Liu, W., Liu, X., Tao, D., Gao, X.: Pairwise relationship guided deep hashing for cross-modal retrieval. In: 2017 AAAI Conference on Artificial Intelligence (AAAI), pp. 1618–1625 (2017)
Google Scholar
Zhang, D., Li, W.J.: Large-scale supervised multimodal hashing with semantic correlation maximization. In: 2014 AAAI Conference on Artificial Intelligence (AAAI), pp. 2177–2183 (2014)
Google Scholar
Zhang, J., Peng, Y., Yuan, M.: Unsupervised generative adversarial cross-modal hashing. In: 2018 AAAI Conference on Artificial Intelligence (AAAI), pp. 539–546 (2018)
Google Scholar
Zhang, R., Lin, L., Zhang, R., Zuo, W., Zhang, L.: Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Trans. Image Process. 24, 4766–4779 (2015)
Article MathSciNet Google Scholar
Zhang, T., Wang, J.: Collaborative quantization for cross-modal similarity search. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2036–2045 (2016)
Google Scholar
Zhou, J., Ding, G., Guo, Y.: Latent semantic sparse hashing for cross-modal similarity search. In: 2014 International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR), pp. 415–424 (2014)
Google Scholar
Zhou, J., Ding, G., Guo, Y., Liu, Q., Dong, X.: Kernel-based supervised hashing for cross-view similarity search. In: 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6 (2014)
Google Scholar
Zhu, H., Long, M., Wang, J., Cao, Y.: Deep hashing network for efficient similarity retrieval. In: 2016 AAAI Conference on Artificial Intelligence (AAAI), pp. 2415–2421 (2016)
Google Scholar
Zhuang, B., Lin, G., Shen, C., Reid, I.: Fast training of triplet-based deep binary embedding networks. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5955–5964 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Science, Bangalore, India
Devraj Mandal & Soma Biswas
ETH, Zurich, Switzerland
Yashas Annadani

Authors

Devraj Mandal
View author publications
You can also search for this author in PubMed Google Scholar
Yashas Annadani
View author publications
You can also search for this author in PubMed Google Scholar
Soma Biswas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Devraj Mandal .

Editor information

Editors and Affiliations

IIIT Hyderabad, Hyderabad, India
C.V. Jawahar
ANU, Canberra, ACT, Australia
Hongdong Li
Simon Fraser University, Burnaby, BC, Canada
Greg Mori
ETH Zurich, Zurich, Zürich, Switzerland
Konrad Schindler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mandal, D., Annadani, Y., Biswas, S. (2019). GrowBit: Incremental Hashing for Cross-Modal Retrieval. In: Jawahar, C., Li, H., Mori, G., Schindler, K. (eds) Computer Vision – ACCV 2018. ACCV 2018. Lecture Notes in Computer Science(), vol 11364. Springer, Cham. https://doi.org/10.1007/978-3-030-20870-7_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-20870-7_19
Published: 25 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-20869-1
Online ISBN: 978-3-030-20870-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics