skip to main content
10.1145/3581783.3611977acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Hashing One With All

Published: 27 October 2023 Publication History

Abstract

The recent trend in unsupervised hashing requires not only a discrete representation space but also the ability to mine the similarities between data points. Determining and maintaining the relations between each datum and all the others maximally utilize the semantic diversity of the training set, but can we explore this on the holistic dataset using a single network end-to-end? In this paper, we take a step towards this vision by proposing Overview Hashing (OH). OH unifies the two ultimate goals of unsupervised hashing, i.e., (1) encoding compact features and (2) learning data similarities on a large scale end-to-end, into one model. In particular, we split the top of an encoder into a binary hash head and a continuous one. For an arbitrary datum, its similarities to all the others in the dataset are reflected in the Hamming distances of their hash heads. The distances then act as the weights to aggregate the continuous heads, shaping the final representation of this datum for loss computation. Hence, training with this representation simultaneously tunes the similarities of this datum to the whole dataset. In the context of a contrastive learning framework, we theoretically endorse our design by linking it to knowledge distillation and the attention mechanisms. Our experiments on the benchmarked datasets show the superiority of OH over the state-of-the-art hashing methods. Code is available at \hrefhttps://github.com/RosieYuu/OH \textcolorred https://github.com/RosieYuu/OH.

Supplemental Material

MP4 File
This is a video to introduce our work 'Hashing One with All', our work proposed an end-to-end unsupervised deep hashing model that explores semantic relations on a large scale. We introduced the concept of overview that describes data relevance on the whole dataset using pair-wise hash code similarities. A contrastive learning objective was employed, of which the input was the aggregated representation of each datum according to its overview. We theoretically linked our training strategy and model structure with knowledge distillation and the attention mechanism. Extensive experiments revealed that the proposed method remarkably boosted the state-of-the-art unsupervised hashing schemes in image retrieval.

References

[1]
Mart237;n Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, et al. 2016. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv:1603.04467 (2016).
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In ICLR.
[3]
Yoshua Bengio, Nicholas Léonard, and Aaron Courville. 2013. Estimating or propagating gradients through stochastic neurons for conditional computation. arXiv preprint arXiv:1308.3432 (2013).
[4]
Yue Cao, Mingsheng Long, Jianmin Wang, Han Zhu, and Qingfu Wen. 2016. Deep Quantization Network for Efficient Image Retrieval. In AAAI.
[5]
Zhangjie Cao, Mingsheng Long, Jianmin Wang, and Philip S Yu. 2017a. HashNet: Deep Learning to Hash by Continuation. In ICCV.
[6]
Zhangjie Cao, Mingsheng Long, Jianmin Wang, and Philip S. Yu. 2017b. HashNet: Deep Learning to Hash by Continuation. In The ICCV.
[7]
Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, and Armand Joulin. 2020. Unsupervised learning of visual features by contrasting cluster assignments. In NeurIPS.
[8]
Miguel A Carreira-Perpinán and Ramin Raziperchikolaei. 2015. Hashing with binary autoencoders. In CVPR.
[9]
Moses S Charikar. 2002. Similarity estimation techniques from rounding algorithms. In STOC.
[10]
Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020b. A simple framework for contrastive learning of visual representations. In ICML.
[11]
Xinlei Chen, Haoqi Fan, Ross Girshick, and Kaiming He. 2020a. Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297 (2020).
[12]
Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore. In CIVR.
[13]
Bo Dai, Ruiqi Guo, Sanjiv Kumar, Niao He, and Le Song. 2017. Stochastic Generative Hashing. In ICML.
[14]
Thanh-Toan Do, Anh-Dzung Doan, and Ngai-Man Cheung. 2016. Learning to hash with binary deep neural network. In ECCV.
[15]
Khoa D. Doan, Peng Yang, and Ping Li. 2022. One Loss for Quantization: Deep Hashing with Discrete Wasserstein Distributional Matching. In CVPR. 9437--9447.
[16]
Venice Erin Liong, Jiwen Lu, Gang Wang, Pierre Moulin, and Jie Zhou. 2015. Deep hashing for compact binary codes learning. In CVPR.
[17]
Kamran Ghasedi Dizaji, Feng Zheng, Najmeh Sadoughi, Yanhua Yang, Cheng Deng, and Heng Huang. 2018. Unsupervised Deep Generative Adversarial Hashing Network. In CVPR.
[18]
Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin. 2013. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, 12 (2013), 2916--2929.
[19]
Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In CVPR.
[20]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.
[21]
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).
[22]
Diederik Kingma and Jimmy Ba. 2015. Adam: A method for STOChastic optimization. In ICLR.
[23]
Nikita Kitaev, Łukasz Kaiser, and Anselm Levskaya. 2020. Reformer: The efficient transformer. In ICLR.
[24]
Alex Krizhevsky and Geoffrey Hinton. 2009. Learning multiple layers of features from tiny images. Technical Report, University of Toronto (2009).
[25]
Brian Kulis and Trevor Darrell. 2009. Learning to hash with binary reconstructive embeddings. In NeurIPS.
[26]
Brian Kulis and Kristen Grauman. 2009. Kernelized locality-sensitive hashing for scalable image search. In ICCV.
[27]
Yann LeCun, Yoshua Bengio, et al. 1995. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, Vol. 3361, 10 (1995), 1995.
[28]
Juho Lee, Yoonho Lee, Jungtaek Kim, Adam Kosiorek, Seungjin Choi, and Yee Whye Teh. 2019. Set transformer: A framework for attention-based permutation-invariant neural networks. In ICML.
[29]
Junnan Li, Pan Zhou, Caiming Xiong, Richard Socher, and Steven CH Hoi. 2020. Prototypical contrastive learning of unsupervised representations. arXiv preprint arXiv:2005.04966 (2020).
[30]
Yunqiang Li and Jan C Van Gemert. 2021. Deep Unsupervised Image Hashing by Maximizing Bit Entropy. In AAAI.
[31]
Kevin Lin, Jiwen Lu, Chu-Song Chen, and Jie Zhou. 2016. Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks. In CVPR.
[32]
Qinghong Lin, Xiaojun Chen, Qin Zhang, Shaotian Cai, Wenzhe Zhao, and Hongfa Wang. 2022. Deep Unsupervised Hashing with Latent Semantic Components. In AAAI.
[33]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In ECCV.
[34]
Li Liu, Mengyang Yu, and Ling Shao. 2017. Latent structure preserving hashing. International Journal of Computer Vision, Vol. 122, 3 (2017), 439--457.
[35]
Wei Liu, Cun Mu, Sanjiv Kumar, and Shih-Fu Chang. 2014. Discrete graph hashing. In NeurIPS.
[36]
Wei Liu, Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2011. Hashing with graphs. In ICML.
[37]
Xiao Luo, Haixin Wang, Daqing Wu, Chong Chen, Minghua Deng, Jianqiang Huang, and Xian-Sheng Hua. 2023. A Survey on Deep Hashing Methods. ACM Transactions on Knowledge Discovery from Data, Vol. 17, 1 (2023), No.15:1--50.
[38]
Xiao Luo, Daqing Wu, Zeyu Ma, Chong Chen, Minghua Deng, Jianqiang Huang, and Xian-Sheng Hua. 2021a. A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing. In MM.
[39]
Xiao Luo, Daqing Wu, Zeyu Ma, Chong Chen, Huasong Zhong, Minghua Deng, Jianqiang Huang, and Xian-sheng Hua. 2021b. Cimon: Towards high-quality hash codes. In IJCAI.
[40]
Zeyu Ma, Wei Ju, Xiao Luo, Chong Chen, Xian-Sheng Hua, and Guangming Lu. 2022a. Improved Deep Unsupervised Hashing via Prototypical Learning. In ACM MM. 659--667.
[41]
Zeyu Ma, Xiao Luo, Yingjie Chen, Mixiao Hou, Jinxing Li, Minghua Deng, and Guangming Lu. 2022b. Improved Deep Unsupervised Hashing with Fine-grained Semantic Similarity Mining for Multi-Label Image Retrieval. In IJCAI. 1254--1260.
[42]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research, Vol. 9, Nov (2008), 2579--2605.
[43]
Aaron van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018).
[44]
Zexuan Qiu, Qinliang Su, Zijing Ou, Jianxing Yu, and Changyou Chen. 2021. Unsupervised Hashing with Contrastive Information Bottleneck. In IJCAI.
[45]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, et al. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision, Vol. 115, 3 (2015), 211--252.
[46]
Ruslan Salakhutdinov and Geoffrey Hinton. 2009. Semantic Hashing. International Journal of Approximate Reasoning, Vol. 50, 7 (2009).
[47]
Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In CVPR.
[48]
Fumin Shen, Yan Xu, Li Liu, Yang Yang, Zi Huang, and Heng Tao Shen. 2018. Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 40, 12 (2018), 3034--3044.
[49]
Yuming Shen, Li Liu, and Ling Shao. 2019. Unsupervised binary representation learning with deep variational networks. International Journal of Computer Vision, Vol. 127, 11--12 (2019), 1614--1628.
[50]
Yuming Shen, Jie Qin, Jiaxin Chen, Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, and Ling Shao. 2020. Auto-encoding twin-bottleneck hashing. In CVPR.
[51]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In ICLR.
[52]
Jingkuan Song, Tao He, Lianli Gao, Xing Xu, Alan Hanjalic, and Heng Tao Shen. 2018. Binary generative adversarial networks for image retrieval. In AAAI.
[53]
Shupeng Su, Chao Zhang, Kai Han, and Yonghong Tian. 2018. Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN. In NeurIPS.
[54]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NeurIPS.
[55]
Jinpeng Wang, Ziyun Zeng, Bin Chen, Tao Dai, and Shu-Tao Xia. 2022. Contrastive Quantization with Code Memory for Unsupervised Image Retrieval. In AAAI.
[56]
Weiwei Wang, Yuming Shen, Haofeng Zhang, Yahou Yao, and Li Liu. 2020. Set and Rebase: Determining the Semantic Graph Connectivity for Unsupervised Cross-Modal Hashing. In IJCAI.
[57]
Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing. In NeurIPS.
[58]
Zhirong Wu, Yuanjun Xiong, Stella X Yu, and Dahua Lin. 2018. Unsupervised feature learning via non-parametric instance discrimination. In CVPR.
[59]
Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised Hashing for Image Retrieval via Image Representation Learning. In AAAI.
[60]
Erkun Yang, Tongliang Liu, Cheng Deng, Wei Liu, and Dacheng Tao. 2019. DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs. In CVPR.
[61]
Jiaguo Yu, Yuming Shen, Menghan Wang, Haofeng Zhang, and Philip HS Torr. 2022. Learning to Hash Naturally Sorts. In IJCAI.
[62]
Haofeng Zhang, Yifan Gu, Yazhou Yao, Zheng Zhang, Li Liu, Jian Zhang, and Ling Shao. 2021. Deep Unsupervised Self-evolutionary Hashing for Image Retrieval. IEEE Transactions on Multimedia, Vol. 23 (2021), 3400--3413.
[63]
Haofeng Zhang, Li Liu, Yang Long, and Ling Shao. 2017. Unsupervised deep hashing with pseudo labels for scalable image retrieval. IEEE Transactions on Image Processing, Vol. 27, 4 (2017), 1626--1638.
[64]
Wanqian Zhang, Dayan Wu, Chule Yang, Bo Li, and Weiping Wang. 2022. Clustering and Separating Similarities for Deep Unsupervised Hashing. In ICASSP. 1655--1659.
[65]
Han Zhu, Mingsheng Long, Jianmin Wang, and Yue Cao. 2016. Deep Hashing Network for Efficient Similarity Retrieval. In AAAI.
[66]
Maciej Zieba, Piotr Semberecki, Tarek El-Gaaly, and Tomasz Trzcinski. 2018. Bingan: Learning compact binary descriptors with a regularized gan. In NeurIPS.

Cited By

View all
  • (2024)Contrastive transformer masked image hashing for degraded image retrievalProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/135(1218-1226)Online publication date: 3-Aug-2024
  • (2024)Self-Quantization with Adaptive Codebooks for Unsupervised Image RetrievalPattern Recognition and Computer Vision10.1007/978-981-97-8792-0_38(546-560)Online publication date: 9-Nov-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '23: Proceedings of the 31st ACM International Conference on Multimedia
October 2023
9913 pages
ISBN:9798400701085
DOI:10.1145/3581783
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. contrastive learning
  2. image retrieval
  3. memory bank
  4. unsupervised deep hashing

Qualifiers

  • Research-article

Funding Sources

  • Natural Science Foundation of Jiangsu Province
  • "111" Program
  • National Natural Science Foundation of China

Conference

MM '23
Sponsor:
MM '23: The 31st ACM International Conference on Multimedia
October 29 - November 3, 2023
Ottawa ON, Canada

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)56
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Contrastive transformer masked image hashing for degraded image retrievalProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/135(1218-1226)Online publication date: 3-Aug-2024
  • (2024)Self-Quantization with Adaptive Codebooks for Unsupervised Image RetrievalPattern Recognition and Computer Vision10.1007/978-981-97-8792-0_38(546-560)Online publication date: 9-Nov-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media