An Improved Graph Convolution Network for Robust Image Retrieval

Du, Xinwei; Wan, Lin; Shen, Gang

doi:10.1007/s11063-022-11083-2

An Improved Graph Convolution Network for Robust Image Retrieval

Published: 03 November 2022

Volume 55, pages 5121–5141, (2023)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

249 Accesses
Explore all metrics

Abstract

Image retrieval is one of the most critical foundations for many content-based search applications. However, the image retrieval methods have to balance demands on both training accuracy and generalization effectiveness. In this paper, we propose a graph convolution network (GCN) to improve retrieval robustness by integrating the constructs of normalized residual network (NRN) model and feature dropout (FD) operations. The normalized residual networks use skip connection and normalize vectors in each layer to enhance the learning and strengthen the generalization ability. The feature dropout step randomly discards a portion of features in the network to prevent the model from overfitting. We tested our proposed model on several benchmark datasets and the experiment results showed an improvement of 1–3 mAP in comparison with the state-of-the-art Guided Similarity Separation (GSS) algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exploring geometric information in CNN for image retrieval

Article 23 July 2018

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers

LDGC-Net: learnable descriptor graph convolutional network for image retrieval

Article 27 December 2022

Notes

http://cmp.felk.cvut.cz/revisitop/data/features/.

References

He K, Lu Y, Sclaroff S (2018). Local descriptors optimized for average precision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 596–605
Noh H, Araujo A, Sim J, Weyand T, Han B (2017). Large-scale image retrieval with attentive deep local features. In: Proceedings of the IEEE international conference on computer vision, pp. 3456–3465
Ono Y, Trulls E, Fua P, Yi K M (2018) Lf-net: learning local features from images. In: NeurIPS
Yang Fan, Hinami Ryota, Matsui Yusuke, Ly Steven, Satoh Shin’ichi (2019) Efficient image retrieval via decoupling diffusion into online and offline processing. In: Proceedings of the AAAI conference on artificial intelligence vol 33, pp. 9087–9094
Chen W, Chen J, Zou F, Li Y-F, Lu P, Zhao W (2019) Robustiq: a robust ann search method for billion-scale similarity search on gpus. In: Proceedings of the 2019 on international conference on multimedia retrieval, pp. 132–140
Kipf T N, Welling M (2017). Semi-supervised classification with graph convolutional networks. In: International conference on learning representations (ICLR)
Liu C, Yu G, Volkovs M, Chang C, Rai H, Ma J, Gorti S K (2019) Guided similarity separation for image retrieval. In: NeurIPS
Li Q, Han Z, Wu X-M (2018) Deeper insights into graph convolutional networks for semi-supervised learning. In: Thirty-second AAAI conference on artificial intelligence
Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. In: European conference on computer vision, pp. 685–701. Springer
Gong Y, Wang L, Guo R, Lazebnik S (2014) Multi-scale orderless pooling of deep convolutional activation features. In: European conference on computer vision, pp. 392–407. Springer
Radenović F, Tolias G, Chum Ondřej (2018) Fine-tuning cnn image retrieval with no human annotation. IEEE Trans Pattern Anal Mach Intell 41(7):1655–1668
Article Google Scholar
Tolias G, Sicre R, Jégou H (2015) Particular object retrieval with integral max-pooling of cnn activations. arXiv preprint arXiv:1511.05879
Kong X, Yang F, Wang Q, Ma H, Xiaodong W, Mao Gang (2020) A high generalizable feature extraction method using ensemble learning and deep auto-encoders for operational reliability assessment of bearings. Neural Process Lett 51(1):383–406
Article Google Scholar
Gordo A, Almazán J, Revaud J, Larlus D (2016) Deep image retrieval: learning global representations for image search. In: European conference on computer vision, pp. 241–257. Springer
Gordo A, Almazan J, Revaud J, Larlus Diane (2017) End-to-end learning of deep visual representations for image retrieval. Int J Comput Vision 124(2):237–254
Article MathSciNet Google Scholar
Mukundan A, Tolias G, Chum O (2019) Explicit spatial encoding for deep local descriptors. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 9394–9403
Revaud J, Weinzaepfel P, de Souza C R, Humenberger M (2019) R2D2: repeatable and reliable detector and descriptor. In: NeurIPS
Jegou H, Douze M, Schmid Cordelia (2010) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128
Article Google Scholar
Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE 11th international conference on computer vision, pp. 1–8. IEEE
Chum O, Mikulik A, Perdoch M, Matas J (2011) Total recall ii: query expansion revisited. In: CVPR 2011, pp. 889–896. IEEE
Chum O, Matas J, Kittler J (2003) Locally optimized ransac. In: Joint pattern recognition symposium, pp. 236–243. Springer
Iscen A, Tolias G, Avrithis Y, Furon T, Chum O (2017) Efficient diffusion on region manifolds: recovering small objects with compact cnn representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2077–2086
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations, accepted as poster
Schlichtkrull M, Kipf T N, Bloem P, Den Berg R Van, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks. In: European semantic web conference, pp. 593–607. Springer
Xu K, Li C, Tian Y, Sonobe T, Kawarabayashi K-I, Jegelka S (2018) Representation learning on graphs with jumping knowledge networks. In: International conference on machine learning
Gilmer J, Schoenholz S S, Riley P F, Vinyals O, Dahl G E (2017) Neural message passing for quantum chemistry. In: Proceedings of the 34th international conference on machine learning (ICML), pp. 1263–1272
Li Y, Tarlow D, Brockschmidt M, Zemel R (2015) Gated graph sequence neural networks. In: International conference on learning representations (ICLR)
Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 1024–1034
Chen J, Ma T, Xiao C (2018) FastGCN: fast learning with graph convolutional networks via importance sampling. In: International conference on learning representations
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
Hinton G E, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov R R (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: 2007 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2008) Lost in quantization: improving particular object retrieval in large scale image databases. In: 2008 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE
Radenović F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting oxford and paris: large-scale image retrieval benchmarking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5706–5715
Tolias G, Avrithis Y, Jégou Hervé (2016) Image search with selective match kernels: aggregation across single and multiple images. Int J Comput Vision 116(3):247–261
Article MathSciNet Google Scholar
Deng J, Guo J, Xue N, Zafeiriou S (2019) Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4690–4699
Cao B, Araujo A, Sim J (2020) Unifying deep local and global features for image search. In: Proceedings of the ECCV
Weyand T, Araujo A, Cao B, Sim J (2020) Google landmarks dataset v2-a large-scale benchmark for instance-level recognition and retrieval. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2575–2584

Download references

Acknowledgements

We would like to thank the anonymous reviewers for their helpful remarks. This research was partially supported by the National Natural Science Foundation of China (NSFC) under grant No.61927801.

Author information

Authors and Affiliations

School of Software, Huazhong University of Science and Technology, Wuhan, 430074, China
Xinwei Du, Lin Wan & Gang Shen

Authors

Xinwei Du
View author publications
You can also search for this author in PubMed Google Scholar
Lin Wan
View author publications
You can also search for this author in PubMed Google Scholar
Gang Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lin Wan.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Du, X., Wan, L. & Shen, G. An Improved Graph Convolution Network for Robust Image Retrieval. Neural Process Lett 55, 5121–5141 (2023). https://doi.org/10.1007/s11063-022-11083-2

Download citation

Accepted: 21 October 2022
Published: 03 November 2022
Issue Date: August 2023
DOI: https://doi.org/10.1007/s11063-022-11083-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Improved Graph Convolution Network for Robust Image Retrieval

Abstract

Access this article

Similar content being viewed by others

Exploring geometric information in CNN for image retrieval

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers

LDGC-Net: learnable descriptor graph convolutional network for image retrieval

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Improved Graph Convolution Network for Robust Image Retrieval

Abstract

Access this article

Similar content being viewed by others

Exploring geometric information in CNN for image retrieval

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers

LDGC-Net: learnable descriptor graph convolutional network for image retrieval

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation