skip to main content
research-article

Probability Ordinal-Preserving Semantic Hashing for Large-Scale Image Retrieval

Authors Info & Claims
Published:21 April 2021Publication History
Skip Abstract Section

Abstract

Semantic hashing enables computation and memory-efficient image retrieval through learning similarity-preserving binary representations. Most existing hashing methods mainly focus on preserving the piecewise class information or pairwise correlations of samples into the learned binary codes while failing to capture the mutual triplet-level ordinal structure in similarity preservation. In this article, we propose a novel Probability Ordinal-preserving Semantic Hashing (POSH) framework, which for the first time defines the ordinal-preserving hashing concept under a non-parametric Bayesian theory. Specifically, we derive the whole learning framework of the ordinal similarity-preserving hashing based on the maximum posteriori estimation, where the probabilistic ordinal similarity preservation, probabilistic quantization function, and probabilistic semantic-preserving function are jointly considered into one unified learning framework. In particular, the proposed triplet-ordering correlation preservation scheme can effectively improve the interpretation of the learned hash codes under an economical anchor-induced asymmetric graph learning model. Moreover, the sparsity-guided selective quantization function is designed to minimize the loss of space transformation, and the regressive semantic function is explored to promote the flexibility of the formulated semantics in hash code learning. The final joint learning objective is formulated to concurrently preserve the ordinal locality of original data and explore potentials of semantics for producing discriminative hash codes. Importantly, an efficient alternating optimization algorithm with the strictly proof convergence guarantee is developed to solve the resulting objective problem. Extensive experiments on several large-scale datasets validate the superiority of the proposed method against state-of-the-art hashing-based retrieval methods.

References

  1. Alexandr Andoni, Piotr Indyk, Thijs Laarhoven, Ilya Razenshteyn, and Ludwig Schmidt. 2015. Practical and optimal LSH for angular distance. In Proceedings of the Neural Information Processing Systems. 1225–1233.Google ScholarGoogle Scholar
  2. Yue Cao, Mingsheng Long, Jianmin Wang, Han Zhu, and Qingfu Wen. 2016. Deep quantization network for efficient image retrieval. In Proceedings of the 13th AAAI Conference on Artificial Intelligence. 3457–3463.Google ScholarGoogle Scholar
  3. Yaxiong Chen and Xiaoqiang Lu. 2020. Deep category-level and regularized hashing with global semantic similarity learning. IEEE Transactions on Cybernetics. DOI:10.1109/TCYB.2020.2964993Google ScholarGoogle Scholar
  4. Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: A real-world web image database from National University of Singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval. ACM, 48.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Aristides Gionis, Piotr Indyk, Rajeev Motwani. 1999. Similarity search in high dimensions via hashing. In Proceedings of the 25th International Conference on Very Large Data Bases, Vol. 99. 518–529.Google ScholarGoogle Scholar
  6. Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 12 (2013), 2916–2929.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jie Gui and Ping Li. 2018. RSDH: Robust rotated supervised discrete hashing. In Proceedings of the ACM Conference on Knowledge Discovery and Data Mining. ACM, 1485–1493.Google ScholarGoogle Scholar
  8. Jie Gui, Tongliang Liu, Zhenan Sun, and Tieniu Tan. 2018. Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 2 (2018), 490–496.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Jie Gui, Tongliang Liu, Zhenan Sun, Dacheng Tao, and Tieniu Tan. 2018. Supervised discrete hashing with relaxation. IEEE Transactions on Neural Networks and Learning Systems 29, 3 (2018), 608–617.Google ScholarGoogle ScholarCross RefCross Ref
  10. J. Guo, H. Chang, and W. Zhu. 2020. Preserving ordinal consensus: Towards feature selection for unlabeled data. In Proceedings of the AAAI Conference on Artificial Intelligence. 1–8.Google ScholarGoogle Scholar
  11. Ran He, Wei-Shi Zheng, Tieniu Tan, and Zhenan Sun. 2013. Half-quadratic-based iterative minimization for robust sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 36, 2 (2013), 261–275.Google ScholarGoogle Scholar
  12. Zihao Hu, Junxuan Chen, Hongtao Lu, and Tongzhen Zhang. 2017. Bayesian supervised hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6348–6355.Google ScholarGoogle ScholarCross RefCross Ref
  13. Qing-Yuan Jiang and Wu-Jun Li. 2015. Scalable graph hashing with feature transformation. In Proceedings of the International Joint Conference on Artificial Intelligence. 2248–2254.Google ScholarGoogle Scholar
  14. Zhongming Jin, Cheng Li, Yue Lin, and Deng Cai. 2014. Density sensitive hashing. IEEE Transactions on Cybernetics 44, 8 (2014), 1362–1371.Google ScholarGoogle ScholarCross RefCross Ref
  15. Wang-Cheng Kang, Wu-Jun Li, and Zhi-Hua Zhou. 2016. Column sampling based discrete supervised hashing. In Proceedings of the 13th AAAI Conference on Artificial Intelligence. 1230–1236.Google ScholarGoogle ScholarCross RefCross Ref
  16. Svebor Karaman, Xudong Lin, Xuefeng Hu, and Shih-Fu Chang. 2019. Unsupervised rank-preserving hashing for large-scale image retrieval. In Proceedings of the 2019 International Conference on Multimedia Retrieval. 192–196.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Benjamin Klein and Lior Wolf. 2019. End-to-end supervised product quantization for image search and retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5041–5050.Google ScholarGoogle ScholarCross RefCross Ref
  18. Alex Krizhevsky and Geoffrey Hinton. 2009. Learning Multiple Layers of Features from Tiny Images. Technical Report. University of Toronto.Google ScholarGoogle Scholar
  19. Zhihui Lai, Yudong Chen, Jian Wu, Wai Keung Wong, and Fumin Shen. 2018. Jointly sparse hashing for image retrieval. IEEE Transactions on Image Processing 27, 12 (2018), 6147–6158.Google ScholarGoogle ScholarCross RefCross Ref
  20. Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of IEEE 86, 11 (1998), 2278–2324.Google ScholarGoogle ScholarCross RefCross Ref
  21. Qi Li, Zhenan Sun, Ran He, and Tieniu Tan. 2017. Deep supervised discrete hashing. In Proceedings of the 31st Conference on Neural Information Processing Systems. 2482–2491.Google ScholarGoogle Scholar
  22. Wu-Jun Li, Sheng Wang, and Wang-Cheng Kang. 2016. Feature learning based deep supervised hashing with pairwise labels. In Proceedings of the International Joint Conference on Artificial Intelligence. 1711–1717.Google ScholarGoogle Scholar
  23. Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van den Hengel, and David Suter. 2014. Fast supervised hashing with decision trees for high-dimensional data. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1963–1970.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Bin Liu, Yue Cao, Mingsheng Long, Jianmin Wang, and Jingdong Wang. 2018. Deep triplet quantization. In Proceedings of the 26th ACM International Conference on Multimedia. 755–763.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Hong Liu, Rongrong Ji, Jingdong Wang, and Chunhua Shen. 2017. Ordinal constraint binary coding for approximate nearest neighbor search. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. ACM, 2238–2244.Google ScholarGoogle Scholar
  26. Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends® in Information Retrieval 3, 3 (2009), 225–331.Google ScholarGoogle Scholar
  27. Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 2074–2081.Google ScholarGoogle Scholar
  28. Wei Liu, Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. 2011. Hashing with graphs. In Proceedings of the 28th International Conference on Machine Learning. 1–8.Google ScholarGoogle Scholar
  29. Xingbo Liu, Xiushan Nie, Haoliang Sun, Chaoran Cui, and Yilong Yin. 2019. Supervised short-length hashing. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 3031–3037.Google ScholarGoogle ScholarCross RefCross Ref
  30. Xu Lu, Lei Zhu, Jingjing Li, Huaxiang Zhang, and Heng Tao Shen. 2019. Efficient supervised discrete multi-view hashing for large-scale multimedia search. IEEE Transactions on Multimedia 22, 8 (2020), 2048–2060. DOI:10.1109/TMM.2019.2947358Google ScholarGoogle ScholarCross RefCross Ref
  31. Zhi Lu, Yang Hu, Yunchao Jiang, Yan Chen, and Bing Zeng. 2019. Learning binary code for personalized fashion recommendation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10562–10570.Google ScholarGoogle ScholarCross RefCross Ref
  32. Xin Luo, Liqiang Nie, Xiangnan He, Ye Wu, Zhen-Duo Chen, and Xin-Shun Xu. 2018. Fast scalable supervised hashing. In Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 735–744.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Xin Luo, Peng-Fei Zhang, Zi Huang, Liqiang Nie, and Xin-Shun Xu. 2019. Discrete hashing with multiple supervision. IEEE Transactions on Image Processing 28, 6 (2019), 2962–2975.Google ScholarGoogle ScholarCross RefCross Ref
  34. Chao Ma, Chen Gong, Xiang Li, Xiaolin Huang, Wei Liu, and Jie Yang. 2020. Toward making unsupervised graph hashing discriminative. IEEE Transactions on Cybernetics 22, 3 (2020), 760–774.Google ScholarGoogle Scholar
  35. Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, Nov (2008), 2579–2605.Google ScholarGoogle Scholar
  36. Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press.Google ScholarGoogle Scholar
  37. Feiping Nie, Heng Huang, Xiao Cai, and Chris H Ding. 2010. Efficient and robust feature selection via joint -norms minimization. In Proceedings of the 23rd International Conference on Neural Information Processing Systems. 1813–1821.Google ScholarGoogle Scholar
  38. Mila Nikolova and Raymond H. Chan. 2007. The equivalence of half-quadratic minimization and the gradient linearization iteration. IEEE Transactions on Image Processing 16, 6 (2007), 1623–1627.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Fumin Shen, Yadong Mu, Yang Yang, Wei Liu, Li Liu, Jingkuan Song, and Heng Tao Shen Shen. 2017. Classification by retrieval: Binarizing data and classifier. In Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval. 595–604.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. 2015. Supervised discrete hashing. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 37–45.Google ScholarGoogle ScholarCross RefCross Ref
  41. Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton van den Hengel, Zhenmin Tang, and Heng Tao Shen. 2015. Hashing on nonlinear manifolds. IEEE Transactions on Image Processing 24, 6 (2015), 1839–1851.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Dan Shi, Lei Zhu, Yikun Li, Jingjing Li, and Xiushan Nie. 2020. Robust structured graph clustering. IEEE Transactions on Neural Networks and Learning Systems 31, 11 (2020), 4424–4436.Google ScholarGoogle ScholarCross RefCross Ref
  43. Jingkuan Song, Tao He, Lianli Gao, Xing Xu, Alan Hanjalic, and Heng Tao Shen. 2020. Unified binary generative adversarial network for image retrieval and compression. International Journal of Computer Vision 128 (2020), 2243–2264.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Jingdong Wang, Ting Zhang, Nicu Sebe, and Heng Tao Shen. 2018. A survey on learning to hash. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 4 (2018), 769–790.Google ScholarGoogle ScholarCross RefCross Ref
  45. Ke Wang and Xin Geng. 2019. Discrete binary coding based label distribution learning. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 3733–3739.Google ScholarGoogle ScholarCross RefCross Ref
  46. Yair Weiss, Antonio Torralba, and Rob Fergus. 2009. Spectral hashing. In Proceedings of the Neural Information Processing Systems. 1753–1760.Google ScholarGoogle Scholar
  47. Peichao Zhang, Wei Zhang, Wu-Jun Li, and Minyi Guo. 2014. Supervised hashing with latent factor models. In Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 173–182.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Zheng Zhang, Zhihui Lai, Zi Huang, Wai Keung Wong, Guo-Sen Xie, Li Liu, and Ling Shao. 2019. Scalable supervised asymmetric hashing with semantic and latent factor embedding. IEEE Transactions on Image Processing 28, 10 (2019), 4803–4818.Google ScholarGoogle ScholarCross RefCross Ref
  49. Zheng Zhang, Luyao Liu, Yadan Luo, Zi Huang, Fumin Shen, Heng Tao Shen, and Guangming Lu. 2020. Inductive structure consistent hashing via flexible semantic calibration. IEEE Transactions on Neural Networks and Learning Systems. DOI:10.1109/TNNLS.2020.3018790Google ScholarGoogle Scholar
  50. Zheng Zhang, Li Liu, Fumin Shen, Heng Tao Shen, and Ling Shao. 2019. Binary multi-view clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 7 (2019), 1774–1782.Google ScholarGoogle ScholarCross RefCross Ref
  51. Zheng Zhang, Guo-sen Xie, Yang Li, Sheng Li, and Zi Huang. 2019. SADIH: Semantic-aware discrete hashing. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 5853–5860.Google ScholarGoogle ScholarCross RefCross Ref
  52. Han Zhu, Mingsheng Long, Jianmin Wang, and Yue Cao. 2016. Deep hashing network for efficient similarity retrieval. In Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2415–2421.Google ScholarGoogle Scholar
  53. Xiatian Zhu, Botong Wu, Dongcheng Huang, and Wei-Shi Zheng. 2018. Fast open-world person re-identification. IEEE Transactions on Image Processing 27, 5 (2018), 2286–2300.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Probability Ordinal-Preserving Semantic Hashing for Large-Scale Image Retrieval

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Knowledge Discovery from Data
          ACM Transactions on Knowledge Discovery from Data  Volume 15, Issue 3
          June 2021
          533 pages
          ISSN:1556-4681
          EISSN:1556-472X
          DOI:10.1145/3454120
          Issue’s Table of Contents

          Copyright © 2021 Association for Computing Machinery.

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 April 2021
          • Accepted: 1 December 2020
          • Received: 1 August 2020
          Published in tkdd Volume 15, Issue 3

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format