Fusing Semantic Prior Based Deep Hashing Method for Fuzzy Image Retrieval

Gong, Xiaolong; Huang, Linpeng; Wang, Fuwei

doi:10.1007/978-3-319-97304-3_31

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11012))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

3284 Accesses

Abstract

Fuzzy image retrieval is a novel visual application about designing a multi-modal retrieval system that supports querying across image modalities, e.g., a fuzzy type image searches for some similar images. However, most existing deep hashing methods are not suitable for obtaining a robust image hash code on multi-modal retrieval task. In this paper, we propose Fusing Semantic Prior based Deep Hashing (FSPDH) method, which is the first attempt to integrate unsupervised semantic prior into end-to-end deep architecture for fuzzy image retrieval task. The major contribution in this work is extracting the prior information from images and incorporating it effectively into hash learning process. In addition, our strategy can be usefully used in single-modal retrieval task. Extensive experiments show that our FSPDH approach yields state-of-the-art results in both multi-modal and single-modal image retrieval tasks on our image datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.taobao.com/.
2.
https://www.amazon.com/.
3.
In this paper, “data point” represents an image-image pair and “sample” represents an image for one modality.
4.
http://www.cs.toronto.edu/~kriz/cifar.html.
5.
http://lms.comp.nus.edu.sg/research/NUS-WIDE.htm.

References

Bu, S., Wang, L., Han, P., Liu, Z., Li, K.: 3D shape recognition and retrieval based on multi-modality deep learning. Neurocomputing 259, 183–193 (2017)
Article Google Scholar
Cao, Z., Long, M., Wang, J., Yang, Q.: Transitive hashing network for heterogeneous multimedia retrieval, pp. 81–87 (2017)
Google Scholar
Castrejon, L., Aytar, Y., Vondrick, C., Pirsiavash, H., Torralba, A.: Learning aligned cross-modal representations from weakly aligned data, pp. 2940–2949 (2016)
Google Scholar
Ding, G., Guo, Y., Zhou, J., Gao, Y.: Large-scale cross-modality search via collective matrix factorization hashing. IEEE Trans. Image Process. 25(11), 5427–5440 (2016)
Article MathSciNet Google Scholar
Dorfer, M., Schlüter, J., Vall, A., Korzeniowski, F., Widmer, G.: End-to-end cross-modality retrieval with CCA projections and pairwise ranking loss. arXiv preprint arXiv:1705.06979 (2017)
Drai-Zerbib, V., Baccino, T.: The effect of expertise in music reading: cross-modal competence. J. Eye Mov. Res. 6(5) (2014)
Google Scholar
Jiang, Q.Y., Li, W.J.: Deep cross-modal hashing. arXiv preprint arXiv:1602.02255 (2016)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks, pp. 1097–1105 (2012)
Google Scholar
Lai, H., Pan, Y., Liu, Y., Yan, S.: Simultaneous feature learning and hash coding with deep neural networks, pp. 3270–3278 (2015)
Google Scholar
Li, Y., Wang, R., Huang, Z., Shan, S., Chen, X.: Face video retrieval with image query via hashing across euclidean space and riemannian manifold, pp. 4758–4767 (2015)
Google Scholar
Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval, pp. 27–35 (2015)
Google Scholar
Liu, H., Wang, R., Shan, S., Chen, X.: Deep supervised hashing for fast image retrieval, pp. 2064–2072 (2016)
Google Scholar
Liu, H., Ji, R., Wu, Y., Hua, G.: Supervised matrix factorization for cross-modality hashing (2016)
Google Scholar
Masci, J., Bronstein, M.M., Bronstein, A.M., Schmidhuber, J.: Multimodal similarity-preserving hashing. IEEE Trans. Pattern Anal. Mach. Intell. 36(4), 824–830 (2014)
Article Google Scholar
Qu, W., Wang, D., Feng, S., Zhang, Y., Yu, G.: A novel cross-modal hashing algorithm based on multimodal deep learning. Sci. China Inf. Sci. 60(9), 092104 (2017)
Article Google Scholar
Song, J., Yang, Y., Yang, Y., Huang, Z., Shen, H.T.: Inter-media hashing for large-scale retrieval from heterogeneous data sources, pp. 785–796 (2013)
Google Scholar
Wang, D., Gao, X., Wang, X., He, L., Yuan, B.: Multimodal discriminative binary embedding for large-scale cross-modal retrieval. IEEE Trans. Image Process. 25(10), 4540–4554 (2016)
Article MathSciNet Google Scholar
Wang, W., Ooi, B.C., Yang, X., Zhang, D., Zhuang, Y.: Effective multi-modal retrieval based on stacked auto-encoders. Proc. VLDB Endow. 7(8), 649–660 (2014)
Article Google Scholar
Wang, W., Yang, X., Ooi, B.C., Zhang, D., Zhuang, Y.: Effective deep learning-based multi-modal retrieval. VLDB J. 25(1), 79–101 (2016)
Article Google Scholar
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing, pp. 1753–1760 (2009)
Google Scholar
Wu, B., Yang, Q., Zheng, W.S., Wang, Y., Wang, J.: Quantized correlation hashing for fast cross-modal search, pp. 3946–3952 (2015)
Google Scholar
Wu, P., Hoi, S.C., Zhao, P., Miao, C., Liu, Z.Y.: Online multi-modal distance metric learning with application to image retrieval. IEEE Trans. Knowl. Data Eng. 28(2), 454–467 (2016)
Article Google Scholar
Xia, R., Pan, Y., Lai, H., Liu, C., Yan, S.: Supervised hashing for image retrieval via image representation learning, vol. 1, pp. 2156–2162 (2014)
Google Scholar
Xia, S., Li, T., Ge, S., Dong, Z.: Efficient web video classification via cross-modality knowledge transferring, pp. 211–216 (2016)
Google Scholar
Zhao, F., Huang, Y., Wang, L., Tan, T.: Deep semantic ranking based hashing for multi-label image retrieval, pp. 1556–1564 (2015)
Google Scholar
Zhu, X., Huang, Z., Shen, H.T., Zhao, X.: Linear cross-modal hashing for efficient multimedia search, pp. 143–152 (2013)
Google Scholar

Download references

Acknowledgments

This work was partially supported by Shanghai Municipal Commission of Economy and Informatization (No. 201701052). We thank Zhongyi Zhou from Shanghai Jiao Tong University for his useful discussions and feedback.

Author information

Authors and Affiliations

Department of Computer Science, Shanghai Jiao Tong University, Shanghai, China
Xiaolong Gong, Linpeng Huang & Fuwei Wang

Authors

Xiaolong Gong
View author publications
You can also search for this author in PubMed Google Scholar
Linpeng Huang
View author publications
You can also search for this author in PubMed Google Scholar
Fuwei Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Linpeng Huang .

Editor information

Editors and Affiliations

Southeast University, Nanjing, China
Xin Geng
University of Tasmania, Hobart, Tasmania, Australia
Byeong-Ho Kang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gong, X., Huang, L., Wang, F. (2018). Fusing Semantic Prior Based Deep Hashing Method for Fuzzy Image Retrieval. In: Geng, X., Kang, BH. (eds) PRICAI 2018: Trends in Artificial Intelligence. PRICAI 2018. Lecture Notes in Computer Science(), vol 11012. Springer, Cham. https://doi.org/10.1007/978-3-319-97304-3_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-97304-3_31
Published: 27 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97303-6
Online ISBN: 978-3-319-97304-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics