Skip to main content

Fusing Semantic Prior Based Deep Hashing Method for Fuzzy Image Retrieval

  • Conference paper
  • First Online:
PRICAI 2018: Trends in Artificial Intelligence (PRICAI 2018)

Abstract

Fuzzy image retrieval is a novel visual application about designing a multi-modal retrieval system that supports querying across image modalities, e.g., a fuzzy type image searches for some similar images. However, most existing deep hashing methods are not suitable for obtaining a robust image hash code on multi-modal retrieval task. In this paper, we propose Fusing Semantic Prior based Deep Hashing (FSPDH) method, which is the first attempt to integrate unsupervised semantic prior into end-to-end deep architecture for fuzzy image retrieval task. The major contribution in this work is extracting the prior information from images and incorporating it effectively into hash learning process. In addition, our strategy can be usefully used in single-modal retrieval task. Extensive experiments show that our FSPDH approach yields state-of-the-art results in both multi-modal and single-modal image retrieval tasks on our image datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.taobao.com/.

  2. 2.

    https://www.amazon.com/.

  3. 3.

    In this paper, “data point” represents an image-image pair and “sample” represents an image for one modality.

  4. 4.

    http://www.cs.toronto.edu/~kriz/cifar.html.

  5. 5.

    http://lms.comp.nus.edu.sg/research/NUS-WIDE.htm.

References

  1. Bu, S., Wang, L., Han, P., Liu, Z., Li, K.: 3D shape recognition and retrieval based on multi-modality deep learning. Neurocomputing 259, 183–193 (2017)

    Article  Google Scholar 

  2. Cao, Z., Long, M., Wang, J., Yang, Q.: Transitive hashing network for heterogeneous multimedia retrieval, pp. 81–87 (2017)

    Google Scholar 

  3. Castrejon, L., Aytar, Y., Vondrick, C., Pirsiavash, H., Torralba, A.: Learning aligned cross-modal representations from weakly aligned data, pp. 2940–2949 (2016)

    Google Scholar 

  4. Ding, G., Guo, Y., Zhou, J., Gao, Y.: Large-scale cross-modality search via collective matrix factorization hashing. IEEE Trans. Image Process. 25(11), 5427–5440 (2016)

    Article  MathSciNet  Google Scholar 

  5. Dorfer, M., Schlüter, J., Vall, A., Korzeniowski, F., Widmer, G.: End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss. arXiv preprint arXiv:1705.06979 (2017)

  6. Drai-Zerbib, V., Baccino, T.: The effect of expertise in music reading: cross-modal competence. J. Eye Mov. Res. 6(5) (2014)

    Google Scholar 

  7. Jiang, Q.Y., Li, W.J.: Deep cross-modal hashing. arXiv preprint arXiv:1602.02255 (2016)

  8. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks, pp. 1097–1105 (2012)

    Google Scholar 

  9. Lai, H., Pan, Y., Liu, Y., Yan, S.: Simultaneous feature learning and hash coding with deep neural networks, pp. 3270–3278 (2015)

    Google Scholar 

  10. Li, Y., Wang, R., Huang, Z., Shan, S., Chen, X.: Face video retrieval with image query via hashing across euclidean space and riemannian manifold, pp. 4758–4767 (2015)

    Google Scholar 

  11. Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval, pp. 27–35 (2015)

    Google Scholar 

  12. Liu, H., Wang, R., Shan, S., Chen, X.: Deep supervised hashing for fast image retrieval, pp. 2064–2072 (2016)

    Google Scholar 

  13. Liu, H., Ji, R., Wu, Y., Hua, G.: Supervised matrix factorization for cross-modality hashing (2016)

    Google Scholar 

  14. Masci, J., Bronstein, M.M., Bronstein, A.M., Schmidhuber, J.: Multimodal similarity-preserving hashing. IEEE Trans. Pattern Anal. Mach. Intell. 36(4), 824–830 (2014)

    Article  Google Scholar 

  15. Qu, W., Wang, D., Feng, S., Zhang, Y., Yu, G.: A novel cross-modal hashing algorithm based on multimodal deep learning. Sci. China Inf. Sci. 60(9), 092104 (2017)

    Article  Google Scholar 

  16. Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources, pp. 785–796 (2013)

    Google Scholar 

  17. Wang, D., Gao, X., Wang, X., He, L., Yuan, B.: Multimodal discriminative binary embedding for large-scale cross-modal retrieval. IEEE Trans. Image Process. 25(10), 4540–4554 (2016)

    Article  MathSciNet  Google Scholar 

  18. Wang, W., Ooi, B.C., Yang, X., Zhang, D., Zhuang, Y.: Effective multi-modal retrieval based on stacked auto-encoders. Proc. VLDB Endow. 7(8), 649–660 (2014)

    Article  Google Scholar 

  19. Wang, W., Yang, X., Ooi, B.C., Zhang, D., Zhuang, Y.: Effective deep learning-based multi-modal retrieval. VLDB J. 25(1), 79–101 (2016)

    Article  Google Scholar 

  20. Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing, pp. 1753–1760 (2009)

    Google Scholar 

  21. Wu, B., Yang, Q., Zheng, W.S., Wang, Y., Wang, J.: Quantized correlation hashing for fast cross-modal search, pp. 3946–3952 (2015)

    Google Scholar 

  22. Wu, P., Hoi, S.C., Zhao, P., Miao, C., Liu, Z.Y.: Online multi-modal distance metric learning with application to image retrieval. IEEE Trans. Knowl. Data Eng. 28(2), 454–467 (2016)

    Article  Google Scholar 

  23. Xia, R., Pan, Y., Lai, H., Liu, C., Yan, S.: Supervised hashing for image retrieval via image representation learning, vol. 1, pp. 2156–2162 (2014)

    Google Scholar 

  24. Xia, S., Li, T., Ge, S., Dong, Z.: Efficient web video classification via cross-modality knowledge transferring, pp. 211–216 (2016)

    Google Scholar 

  25. Zhao, F., Huang, Y., Wang, L., Tan, T.: Deep semantic ranking based hashing for multi-label image retrieval, pp. 1556–1564 (2015)

    Google Scholar 

  26. Zhu, X., Huang, Z., Shen, H.T., Zhao, X.: Linear cross-modal hashing for efficient multimedia search, pp. 143–152 (2013)

    Google Scholar 

Download references

Acknowledgments

This work was partially supported by Shanghai Municipal Commission of Economy and Informatization (No. 201701052). We thank Zhongyi Zhou from Shanghai Jiao Tong University for his useful discussions and feedback.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Linpeng Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gong, X., Huang, L., Wang, F. (2018). Fusing Semantic Prior Based Deep Hashing Method for Fuzzy Image Retrieval. In: Geng, X., Kang, BH. (eds) PRICAI 2018: Trends in Artificial Intelligence. PRICAI 2018. Lecture Notes in Computer Science(), vol 11012. Springer, Cham. https://doi.org/10.1007/978-3-319-97304-3_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97304-3_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97303-6

  • Online ISBN: 978-3-319-97304-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics