Skip to main content
Log in

An Improved Graph Convolution Network for Robust Image Retrieval

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

Image retrieval is one of the most critical foundations for many content-based search applications. However, the image retrieval methods have to balance demands on both training accuracy and generalization effectiveness. In this paper, we propose a graph convolution network (GCN) to improve retrieval robustness by integrating the constructs of normalized residual network (NRN) model and feature dropout (FD) operations. The normalized residual networks use skip connection and normalize vectors in each layer to enhance the learning and strengthen the generalization ability. The feature dropout step randomly discards a portion of features in the network to prevent the model from overfitting. We tested our proposed model on several benchmark datasets and the experiment results showed an improvement of 1–3 mAP in comparison with the state-of-the-art Guided Similarity Separation (GSS) algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. http://cmp.felk.cvut.cz/revisitop/data/features/.

References

  1. He K, Lu Y, Sclaroff S (2018). Local descriptors optimized for average precision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 596–605

  2. Noh H, Araujo A, Sim J, Weyand T, Han B (2017). Large-scale image retrieval with attentive deep local features. In: Proceedings of the IEEE international conference on computer vision, pp. 3456–3465

  3. Ono Y, Trulls E, Fua P, Yi K M (2018) Lf-net: learning local features from images. In: NeurIPS

  4. Yang Fan, Hinami Ryota, Matsui Yusuke, Ly Steven, Satoh Shin’ichi (2019) Efficient image retrieval via decoupling diffusion into online and offline processing. In: Proceedings of the AAAI conference on artificial intelligence vol 33, pp. 9087–9094

  5. Chen W, Chen J, Zou F, Li Y-F, Lu P, Zhao W (2019) Robustiq: a robust ann search method for billion-scale similarity search on gpus. In: Proceedings of the 2019 on international conference on multimedia retrieval, pp. 132–140

  6. Kipf T N, Welling M (2017). Semi-supervised classification with graph convolutional networks. In: International conference on learning representations (ICLR)

  7. Liu C, Yu G, Volkovs M, Chang C, Rai H, Ma J, Gorti S K (2019) Guided similarity separation for image retrieval. In: NeurIPS

  8. Li Q, Han Z, Wu X-M (2018) Deeper insights into graph convolutional networks for semi-supervised learning. In: Thirty-second AAAI conference on artificial intelligence

  9. Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. In: European conference on computer vision, pp. 685–701. Springer

  10. Gong Y, Wang L, Guo R, Lazebnik S (2014) Multi-scale orderless pooling of deep convolutional activation features. In: European conference on computer vision, pp. 392–407. Springer

  11. Radenović F, Tolias G, Chum Ondřej (2018) Fine-tuning cnn image retrieval with no human annotation. IEEE Trans Pattern Anal Mach Intell 41(7):1655–1668

    Article  Google Scholar 

  12. Tolias G, Sicre R, Jégou H (2015) Particular object retrieval with integral max-pooling of cnn activations. arXiv preprint arXiv:1511.05879

  13. Kong X, Yang F, Wang Q, Ma H, Xiaodong W, Mao Gang (2020) A high generalizable feature extraction method using ensemble learning and deep auto-encoders for operational reliability assessment of bearings. Neural Process Lett 51(1):383–406

    Article  Google Scholar 

  14. Gordo A, Almazán J, Revaud J, Larlus D (2016) Deep image retrieval: learning global representations for image search. In: European conference on computer vision, pp. 241–257. Springer

  15. Gordo A, Almazan J, Revaud J, Larlus Diane (2017) End-to-end learning of deep visual representations for image retrieval. Int J Comput Vision 124(2):237–254

    Article  MathSciNet  Google Scholar 

  16. Mukundan A, Tolias G, Chum O (2019) Explicit spatial encoding for deep local descriptors. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 9394–9403

  17. Revaud J, Weinzaepfel P, de Souza C R, Humenberger M (2019) R2D2: repeatable and reliable detector and descriptor. In: NeurIPS

  18. Jegou H, Douze M, Schmid Cordelia (2010) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128

    Article  Google Scholar 

  19. Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE 11th international conference on computer vision, pp. 1–8. IEEE

  20. Chum O, Mikulik A, Perdoch M, Matas J (2011) Total recall ii: query expansion revisited. In: CVPR 2011, pp. 889–896. IEEE

  21. Chum O, Matas J, Kittler J (2003) Locally optimized ransac. In: Joint pattern recognition symposium, pp. 236–243. Springer

  22. Iscen A, Tolias G, Avrithis Y, Furon T, Chum O (2017) Efficient diffusion on region manifolds: recovering small objects with compact cnn representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2077–2086

  23. Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations, accepted as poster

  24. Schlichtkrull M, Kipf T N, Bloem P, Den Berg R Van, Titov I, Welling M (2018) Modeling relational data with graph convolutional networks. In: European semantic web conference, pp. 593–607. Springer

  25. Xu K, Li C, Tian Y, Sonobe T, Kawarabayashi K-I, Jegelka S (2018) Representation learning on graphs with jumping knowledge networks. In: International conference on machine learning

  26. Gilmer J, Schoenholz S S, Riley P F, Vinyals O, Dahl G E (2017) Neural message passing for quantum chemistry. In: Proceedings of the 34th international conference on machine learning (ICML), pp. 1263–1272

  27. Li Y, Tarlow D, Brockschmidt M, Zemel R (2015) Gated graph sequence neural networks. In: International conference on learning representations (ICLR)

  28. Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Adv Neural Inf Process Syst 1024–1034

  29. Chen J, Ma T, Xiao C (2018) FastGCN: fast learning with graph convolutional networks via importance sampling. In: International conference on learning representations

  30. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778

  31. Hinton G E, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov R R (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580

  32. Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: 2007 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE

  33. Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2008) Lost in quantization: improving particular object retrieval in large scale image databases. In: 2008 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE

  34. Radenović F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting oxford and paris: large-scale image retrieval benchmarking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5706–5715

  35. Tolias G, Avrithis Y, Jégou Hervé (2016) Image search with selective match kernels: aggregation across single and multiple images. Int J Comput Vision 116(3):247–261

    Article  MathSciNet  Google Scholar 

  36. Deng J, Guo J, Xue N, Zafeiriou S (2019) Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4690–4699

  37. Cao B, Araujo A, Sim J (2020) Unifying deep local and global features for image search. In: Proceedings of the ECCV

  38. Weyand T, Araujo A, Cao B, Sim J (2020) Google landmarks dataset v2-a large-scale benchmark for instance-level recognition and retrieval. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2575–2584

Download references

Acknowledgements

We would like to thank the anonymous reviewers for their helpful remarks. This research was partially supported by the National Natural Science Foundation of China (NSFC) under grant No.61927801.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lin Wan.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Du, X., Wan, L. & Shen, G. An Improved Graph Convolution Network for Robust Image Retrieval. Neural Process Lett 55, 5121–5141 (2023). https://doi.org/10.1007/s11063-022-11083-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-022-11083-2

Keywords

Navigation