Multi-semantic region weighting and multi-scale flatness weighting based image retrieval

Gu, Guanghua; Li, Zhuoyi; Feng, Linjing; Liu, Jiangtao; Lu, Huibin; Zhao, Yao

doi:10.1007/s00500-020-05565-5

Multi-semantic region weighting and multi-scale flatness weighting based image retrieval

Methodologies and Application
Published: 27 February 2021

Volume 25, pages 5699–5708, (2021)
Cite this article

Soft Computing Aims and scope Submit manuscript

Guanghua Gu^1,2,
Zhuoyi Li^1,2,
Linjing Feng^1,2,
Jiangtao Liu^1,2,
Huibin Lu^1,2 &
…
Yao Zhao³

214 Accesses
Explore all metrics

Abstract

The feature representation of images is the key to bridge the semantic gap and make computer understand images in the tasks of image retrieval. In image recognition, semantic regions and salient targets are irreplaceable parts of image cognition. The purpose of segmenting semantic regions is to analyze the connections between image targets, while some distinctive targets are able to highlight the important semantics. However, the semantic regions and salient targets were often ignored in previous retrieval methods. Considering the two aspects, this paper proposes an unsupervised image retrieval method based on multi-semantic region weighting and multi-scale flatness weighting. Firstly, we divide the semantic regions by using the Fully Convolutional Network and calculate the multi-Semantic weight map (S-mask) to obtain the global features. Secondly, we introduce a flatness-weighted strategy to weight feature maps and aggregate the multi-scale features to obtain the local features. Finally, we cascade the global features and the local features to construct the final image representation. There are two main contributions in this paper. One is that the S-mask assigns different weights to the semantic regions. It distinguishes the importance of the semantic regions and balances the weight within the semantic region. The other is that the flatness-weighted strategy suppresses the background and highlights the target region. Experimental results demonstrate that the proposed method achieves the state-of-the-art performance on Paris and Oxford databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Unsupervised semantic-based convolutional features aggregation for image retrieval

Article 28 November 2018

A multi-scale multi-level deep descriptor with saliency for image retrieval

Article 26 August 2022

Image Retrieval Using Multilayer Feature Aggregation Histogram

Article 27 August 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Arandjelovic R, Gronat P, Torii A, Pajdla T, Sivic J (2016) NetVLAD: CNN architecture for weakly supervised place recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5297–5307. https://doi.org/10.1109/TPAMI.2017.2711011
Azizpour H, Sharif Razavian A, Sullivan J, Maki A, Carlsson S (2015) From generic to specific deep representations for visual recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 36–45. https://doi.org/10.1109/CVPRW.2015.7301270
Babenko A, Slesarev A, Chigorin A, Lempitsky V (2014) Neural codes for image retrieval. In: European conference on computer vision. Springer, pp 584–599. https://doi.org/10.1007/978-3-319-10590-1_38
Chaudhuri B, Demir B, Bruzzone L, Chaudhuri S (2017) Multi-label remote sensing image retrieval using a semi-supervised graph-theoretic method. IEEE Trans Geosci Rem Sens 99(1):1. https://doi.org/10.1109/TGRS.2017.2760909
Article Google Scholar
Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: Automatic query expansion with a generative feature model for object retrieval. In: IEEE 11th international conference on computer vision. IEEE, pp 1–8. https://doi.org/10.1109/ICCV.2007.4408891
Fu J, Zheng H, Mei T (2017) Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4438–4446. https://doi.org/10.1109/CVPR.2017.476
Gong Y, Wang L, Guo R, Lazebnik S (2014) Multi-scale orderless pooling of deep convolutional activation features. In: European conference on computer vision. Springer, pp 392–407. https://doi.org/10.1007/978-3-319-10584-0_26
Iscen A, Tolias G, Avrithis Y, Furon T, Chum O (2017) Efficient diffusion on region manifolds: Recovering small objects with compact cnn representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2077–2086. https://doi.org/10.1109/CVPR.2017.105
Jégou H, Chum O (2012) Negative evidences and co-occurences in image retrieval: the benefit of PCA and whitening. European Conference on Computer Vision. https://doi.org/10.1007/978-3-642-33709-3_55
Jimenez A, Alvarez JM, Giro-i Nieto X (2017) Class-weighted convolutional features for visual instance search. https://doi.org/10.5244/C.31.144
Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. In: European conference on computer vision. Springer, pp 685–701. https://doi.org/10.1007/978-3-319-46604-0_48
Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25(2):1. https://doi.org/10.1145/3065386
Article Google Scholar
Li X, Yang J, Ma J (2020) Large scale category-structured image retrieval for object identification through supervised learning of CNN and SURF-based matching. IEEE Access 8:57796. https://doi.org/10.1109/ACCESS.2020.2982560
Article Google Scholar
Long J, Shelhamer E, Darrell T (2014) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 39(4):640. https://doi.org/10.1109/TPAMI.2016.2572683
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91. https://doi.org/10.1023/b:visi.0000029664.99615.94
Article Google Scholar
MacTavish K, Paton M, Barfoot TD (2017) Visual triage: a bag-of-words experience selector for long-term visual route following. In: IEEE international conference on robotics and automation (ICRA) (IEEE, 2017), pp 2065–2072. https://doi.org/10.1109/ICRA.2017.7989238
Manipoonchelvi P, Muneeswaran K (2014) Significant region-based image retrieval. Signal Image Video Process 9(8):1. https://doi.org/10.1007/s11760-014-0657-0
Article MATH Google Scholar
Mohedano E, McGuinness K, O’Connor NE, Salvador A, Marques F, Giro-i Nieto X (2016) Bags of local convolutional features for scalable instance search. In: Proceedings of the 2016 ACM on international conference on multimedia retrieval (ACM), pp 327–331. https://doi.org/10.1145/2911996.2912061
Ning Q, Zhu J, Zhong Z, Hoi SCH, Chen C (2017) Scalable Image Retrieval by Sparse Product Quantization. IEEE Trans Multimedia 19(3):586. https://doi.org/10.1109/TMM.2016.2625260
Article Google Scholar
Noh H, Araujo A, Sim J, Weyand T, Han B (2017) Large-scale image retrieval with attentive deep local features. In: Proceedings of the IEEE international conference on computer vision, pp 3456–3465. https://doi.org/10.1109/ICCV.2017.374
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8. https://doi.org/10.1109/CVPR.2007.383172
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2008) Lost in quantization: Improving particular object retrieval in large scale image databases. In: 2008 IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8. https://doi.org/10.1109/CVPR.2008.4587635
Portaz M, Kohl M, Quénot G, Chevallet JP (2018) Fully convolutional network and region proposal for instance identification with egocentric vision. In: IEEE international conference on computer vision workshop. https://doi.org/10.1109/ICCVW.2017.281
Radenović F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting oxford and paris: Large-scale image retrieval benchmarking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5706–5715. https://doi.org/10.1109/CVPR.2018.00598
Radenović F, Tolias G, Chum O, CNN image retrieval learns from BoW: unsupervised fine-tuning with hard examples. In: European conference on computer vision. Springer, pp 3–20. https://doi.org/10.1007/978-3-319-46448-0_1
Razavian AS, Sullivan J, Carlsson S, Maki A (2016) Visual instance retrieval with deep convolutional networks. ITE Trans Media Technol Appl 4(3):251
Article Google Scholar
Shao Z, Zhou W, Deng X, Zhang M, Cheng Q (2020) Multilabel remote sensing image retrieval based on fully convolutional network. IEEE J Sel Top Appl Earth Observ Rem Sens 13(1):318. https://doi.org/10.1109/JSTARS.2019.2961634
Article Google Scholar
Sharif Razavian A, Azizpour H, Sullivan J, Carlsson S (2014) CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 806–813. https://doi.org/10.1109/CVPRW.2014.131
Spyromitros-Xioufis E, Papadopoulos S, Kompatsiaris IY, Tsoumakas G, Vlahavas I (2014) A comprehensive study over VLAD and product quantization in large-scale image retrieval. IEEE Trans Multimed 16(6):1713. https://doi.org/10.1109/tmm.2014.2329648
Article Google Scholar
Sundararajan SK, Sankaragomathi B, Priya DS (2019) Deep belief CNN feature representation based content based image retrieval for medical images. J Med Syst 43(6):1. https://doi.org/10.1007/s10916-019-1305-6
Article Google Scholar
Tolias G, Jégou H (2014) Visual query expansion with or without geometry: refining local descriptors by feature aggregation. Pattern Recogn 47(10):3466. https://doi.org/10.1016/j.patcog.2014.04.007
Article Google Scholar
Tolias G, Sicre R, Jégou H (2015) Particular object retrieval with integral max-pooling of CNN activations. arXiv preprint arXiv:1511.05879
Wang Q, Feng G, Wang Y, Duan LY (2016) Adaptive weighted matching of deep convolutional features for painting retrieval. In: IEEE 2nd international conference on multimedia big data (BigMM). https://doi.org/10.1109/BigMM.2016.69
Wu HC, Luk RWP, Wong KF, Kwok KL (2008) Interpreting TF-IDF term weights as making relevance decisions. ACM Trans Inf Syst (TOIS) 26(3):13. https://doi.org/10.1145/1361684.1361686
Xie L, Hong R, Bo Z, Qi T (2015) Image classification and retrieval are one. In the 5th ACM. https://doi.org/10.1145/2671188.2749289
Yandex AB, Lempitsky V (2015) Aggregating local deep features for image retrieval. In: IEEE international conference on computer vision (ICCV). https://doi.org/10.1109/ICCV.2015.150
Zhang G, Zeng Z, Zhang S, Zhang Y, Wu W (2017) SIFT matching with CNN evidences for particular object retrieval. Neurocomputing 238(238):399. https://doi.org/10.1016/j.neucom.2017.01.081
Article Google Scholar
Zhong Z, Zhu J, Hoi SC (2015) Fast object retrieval using direct spatial matching. IEEE Trans Multimedia 17(8):1391. https://doi.org/10.1109/TMM.2015.2446201
Article Google Scholar
Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A (2016) Learning deep features for discriminative localization. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2921–2929. https://doi.org/10.1109/CVPR.2016.319
Zongmin L, Xiuxiu L, Yujie L, Hua L (2019) Sketch-based image retrieval based on fine-grained feature and deep convolutional neural network. J Image Gr

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for valuable comments. This work was partly supported by National Natural Science Foundation of China (No.62072394), Natural Science Foundation of Hebei province (F2017203169) and Key Scientific Research Projects of Colleges and Universities in Hebei province (ZD2017080).

Author information

Authors and Affiliations

School of Information Science and Engineering, Yanshan University, Qinhuangdao City, China
Guanghua Gu, Zhuoyi Li, Linjing Feng, Jiangtao Liu & Huibin Lu
Hebei Key Laboratory of Information Transmission and Signal Processing, Hebei Province, China
Guanghua Gu, Zhuoyi Li, Linjing Feng, Jiangtao Liu & Huibin Lu
Institute of Information Science, Beijing Jiaotong University, Beijing, China
Yao Zhao

Authors

Guanghua Gu
View author publications
You can also search for this author inPubMed Google Scholar
Zhuoyi Li
View author publications
You can also search for this author inPubMed Google Scholar
Linjing Feng
View author publications
You can also search for this author inPubMed Google Scholar
Jiangtao Liu
View author publications
You can also search for this author inPubMed Google Scholar
Huibin Lu
View author publications
You can also search for this author inPubMed Google Scholar
Yao Zhao
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Huibin Lu.

Ethics declarations

Conflict of interest

We declare that we have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gu, G., Li, Z., Feng, L. et al. Multi-semantic region weighting and multi-scale flatness weighting based image retrieval. Soft Comput 25, 5699–5708 (2021). https://doi.org/10.1007/s00500-020-05565-5

Download citation

Published: 27 February 2021
Issue Date: April 2021
DOI: https://doi.org/10.1007/s00500-020-05565-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-semantic region weighting and multi-scale flatness weighting based image retrieval

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Unsupervised semantic-based convolutional features aggregation for image retrieval

A multi-scale multi-level deep descriptor with saliency for image retrieval

Image Retrieval Using Multilayer Feature Aggregation Histogram

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now