Abstract
Image retrieval is one of the most critical foundations for many content-based search applications. However, the image retrieval methods have to balance demands on both training accuracy and generalization effectiveness. In this paper, we propose a graph convolution network (GCN) to improve retrieval robustness by integrating the constructs of normalized residual network (NRN) model and feature dropout (FD) operations. The normalized residual networks use skip connection and normalize vectors in each layer to enhance the learning and strengthen the generalization ability. The feature dropout step randomly discards a portion of features in the network to prevent the model from overfitting. We tested our proposed model on several benchmark datasets and the experiment results showed an improvement of 1–3 mAP in comparison with the state-of-the-art Guided Similarity Separation (GSS) algorithm.
Similar content being viewed by others
References
He K, Lu Y, Sclaroff S (2018). Local descriptors optimized for average precision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 596–605
Noh H, Araujo A, Sim J, Weyand T, Han B (2017). Large-scale image retrieval with attentive deep local features. In: Proceedings of the IEEE international conference on computer vision, pp. 3456–3465
Ono Y, Trulls E, Fua P, Yi K M (2018) Lf-net: learning local features from images. In: NeurIPS
Yang Fan, Hinami Ryota, Matsui Yusuke, Ly Steven, Satoh Shin’ichi (2019) Efficient image retrieval via decoupling diffusion into online and offline processing. In: Proceedings of the AAAI conference on artificial intelligence vol 33, pp. 9087–9094
Chen W, Chen J, Zou F, Li Y-F, Lu P, Zhao W (2019) Robustiq: a robust ann search method for billion-scale similarity search on gpus. In: Proceedings of the 2019 on international conference on multimedia retrieval, pp. 132–140
Kipf T N, Welling M (2017). Semi-supervised classification with graph convolutional networks. In: International conference on learning representations (ICLR)
Liu C, Yu G, Volkovs M, Chang C, Rai H, Ma J, Gorti S K (2019) Guided similarity separation for image retrieval. In: NeurIPS
Li Q, Han Z, Wu X-M (2018) Deeper insights into graph convolutional networks for semi-supervised learning. In: Thirty-second AAAI conference on artificial intelligence
Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. In: European conference on computer vision, pp. 685–701. Springer
Gong Y, Wang L, Guo R, Lazebnik S (2014) Multi-scale orderless pooling of deep convolutional activation features. In: European conference on computer vision, pp. 392–407. Springer
Radenović F, Tolias G, Chum Ondřej (2018) Fine-tuning cnn image retrieval with no human annotation. IEEE Trans Pattern Anal Mach Intell 41(7):1655–1668
Tolias G, Sicre R, Jégou H (2015) Particular object retrieval with integral max-pooling of cnn activations. arXiv preprint arXiv:1511.05879
Kong X, Yang F, Wang Q, Ma H, Xiaodong W, Mao Gang (2020) A high generalizable feature extraction method using ensemble learning and deep auto-encoders for operational reliability assessment of bearings. Neural Process Lett 51(1):383–406
Gordo A, Almazán J, Revaud J, Larlus D (2016) Deep image retrieval: learning global representations for image search. In: European conference on computer vision, pp. 241–257. Springer
Gordo A, Almazan J, Revaud J, Larlus Diane (2017) End-to-end learning of deep visual representations for image retrieval. Int J Comput Vision 124(2):237–254
Mukundan A, Tolias G, Chum O (2019) Explicit spatial encoding for deep local descriptors. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 9394–9403
Revaud J, Weinzaepfel P, de Souza C R, Humenberger M (2019) R2D2: repeatable and reliable detector and descriptor. In: NeurIPS
Jegou H, Douze M, Schmid Cordelia (2010) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128
Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE 11th international conference on computer vision, pp. 1–8. IEEE
Chum O, Mikulik A, Perdoch M, Matas J (2011) Total recall ii: query expansion revisited. In: CVPR 2011, pp. 889–896. IEEE
Chum O, Matas J, Kittler J (2003) Locally optimized ransac. In: Joint pattern recognition symposium, pp. 236–243. Springer
Iscen A, Tolias G, Avrithis Y, Furon T, Chum O (2017) Efficient diffusion on region manifolds: recovering small objects with compact cnn representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2077–2086
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations, accepted as poster
Schlichtkrull M, Kipf T N, Bloem P, Den Berg R Van, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks. In: European semantic web conference, pp. 593–607. Springer
Xu K, Li C, Tian Y, Sonobe T, Kawarabayashi K-I, Jegelka S (2018) Representation learning on graphs with jumping knowledge networks. In: International conference on machine learning
Gilmer J, Schoenholz S S, Riley P F, Vinyals O, Dahl G E (2017) Neural message passing for quantum chemistry. In: Proceedings of the 34th international conference on machine learning (ICML), pp. 1263–1272
Li Y, Tarlow D, Brockschmidt M, Zemel R (2015) Gated graph sequence neural networks. In: International conference on learning representations (ICLR)
Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 1024–1034
Chen J, Ma T, Xiao C (2018) FastGCN: fast learning with graph convolutional networks via importance sampling. In: International conference on learning representations
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
Hinton G E, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov R R (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: 2007 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE
Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2008) Lost in quantization: improving particular object retrieval in large scale image databases. In: 2008 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE
Radenović F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting oxford and paris: large-scale image retrieval benchmarking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5706–5715
Tolias G, Avrithis Y, Jégou Hervé (2016) Image search with selective match kernels: aggregation across single and multiple images. Int J Comput Vision 116(3):247–261
Deng J, Guo J, Xue N, Zafeiriou S (2019) Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4690–4699
Cao B, Araujo A, Sim J (2020) Unifying deep local and global features for image search. In: Proceedings of the ECCV
Weyand T, Araujo A, Cao B, Sim J (2020) Google landmarks dataset v2-a large-scale benchmark for instance-level recognition and retrieval. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2575–2584
Acknowledgements
We would like to thank the anonymous reviewers for their helpful remarks. This research was partially supported by the National Natural Science Foundation of China (NSFC) under grant No.61927801.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Du, X., Wan, L. & Shen, G. An Improved Graph Convolution Network for Robust Image Retrieval. Neural Process Lett 55, 5121–5141 (2023). https://doi.org/10.1007/s11063-022-11083-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-022-11083-2