Abstract
Few-shot instance segmentation aims to segment unseen objects in a so-called query image, given only one close-up illustration named support image. Recently, many few-shot instance segmentation methods are based on Mask R-CNN, and do not take advantage of the information embedding in support images. In this work, we propose Triple Parts Network (TPN) based on Mask R-CNN for few-shot instance segmentation consisting of three key modules, i.e., Attention Fusion Module (AFM), Cosine Similarity based Classifier with Circle Loss (CSC), and Cross-Local Module (CLM). AFM generates proposals targeting at the object in the support image. CSC uses cosine similarity with circle loss to classify objects. CLM is applied to the mask branch to establish a full relation between support feature maps and query feature maps. The experiments are conducted on the Microsoft COCO and PASCAL VOC benchmarks, and the results show that our TPN has better performance than other state-of-the-art methods.
Similar content being viewed by others
References
Bai M, Urtasun R (2017) Deep Watershed Transform for Instance Segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, 5221–5229
Bolya D, Zhou C, Xiao F, Lee Y J (2019) YOLACT: Real-time instance segmentation. In: IEEE International Conference on Computer Vision, 9157–9166
Bolya D, Zhou C, Xiao F, Lee YJ (2020) YOLACT++: Better Real-time Instance Segmentation. IEEE Trans Pattern Anal Mach Intell 44(2):1108–1121
De Brabandere B, Neven D, Van Gool L (2017). Semantic Instance Segmentation with a Discriminative Loss Function. arXiv:1708.02551
Chen K, Wang J, Pang J, Cao Y, Xiong Y, Li X, Sun S, Feng W, Liu Z, Xu J (2019). MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv:1906.07155
Chen H, Wang Y, Wang G, Qiao Y (2018) Lstd: A Low-Shot Transfer Detector for Object Detection. In: Association for the Advancement of Artificial Intelligence, 2836–2843
Choi D, Ye-Bin M, Kim J, Oh T-H (2022) FoxInst: A Frustratingly Simple Baseline for Weakly Few-shot Instance Segmentation. In: International Conference on Learning Representations
Doersch C, Gupta A, Zisserman A (2020). CrossTransformers: spatially-aware few-shot transfer. In: Neural Information Processing Systems, 21981–21993
Dong N, Xing E P (2018) Few-Shot Semantic Segmentation with Prototype Learning. In: British Machine Vision Conference
Dong X, Zheng L, Ma F, Yang Y, Meng D (2018) Few-Example Object Detection with Model Communication. IEEE Trans Pattern Anal Mach Intell 41(7):1641–1654
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The Pascal Visual Object Classes (VOC) Challenge. Int J Comput Vision 88(2):303–338
Fan Z, Yu J-G, Liang Z, Ou J, Gao C, Xia G-S, Li Y (2020) FGN: Fully Guided Network for Few-Shot Instance Segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, 9172–9181
Fan Q, Zhuo W, Tang C-K, Tai Y-W (2020) Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector. In: IEEE Conference on Computer Vision and Pattern Recognition, 4013–4022
Finn C, Abbeel P, Levine S (2017) Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In: International Conference on Machine Learning, 1126–1135
Ganea D A, Boom B, Poppe R (2021) Incremental Few-Shot Instance Segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, 1185–1194
Gidaris S, Komodakis N (2019) Generating Classification Weights with GNN Denoising Autoencoders for Few-Shot Learning. In: IEEE Conference on Computer Vision and Pattern Recognition, 21–30
Gu K, Liu H, Xia Z, Qiao J, Lin W, Thalmann D (2021) PM2.5 Monitoring: Use Information Abundance Measurement and Wide and Deep Learning. IEEE Trans Neural Networks Learn Syst 32(10):4278–4290
Gu K, Xia Z, Qiao J (2020) Stacked Selective Ensemble for PM2.5 Forecast. IEEE Trans Instrum Meas 69(3):660–671
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: IEEE International Conference on Computer Vision, 2961–2969
He K, Zhang X, Ren S, Sun J (2016) Deep Residual Learning for Image Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, 770–778
Kang B, Liu Z, Wang X, Yu F, Feng J, Darrell T (2019) Few-shot Object Detection via Feature Reweighting. In: IEEE International Conference on Computer Vision, 8420–8429
Karlinsky L, Shtok J, Harary S, Schwartz E, Aides A, Feris R, Giryes R, Bronstein A M (2019) RepMet: Representative-based metric learning for classification and few-shot object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, 5197–5206
Kim J, Kim T, Kim S, Yoo C D (2019) Edge-labeling Graph Neural Network for Few-shot Learning. In: IEEE Conference on Computer Vision and Pattern Recognition, 11–20
Koch G, Zemel R, Salakhutdinov R (2015) Siamese Neural Networks for One-shot Image Recognition. In: International Conference on Machine Learning Deep Learning Workshop
Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature Pyramid Networks for Object Detection. In: IEEE Conference on Computer Vision and Pattern Recognition, 2117–2125
Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick C L (2014) Microsoft COCO: Common Objects in Context. In: European Conference on Computer Vision, 740–755
Michaelis C, Ustyuzhaninov I, Bethge M, Ecker A S (2018). One-Shot Instance Segmentation. arXiv:1811.11507
Nguyen K, Todorovic S (2021) FAPIS: A Few-shot Anchor-free Part-based Instance Segmenter. In: IEEE Conference on Computer Vision and Pattern Recognition, 11099–11108
Rakelly K, Shelhamer E, Darrell T, Efros A, Levine S (2018). Conditional Networks for Few-Shot Semantic Segmentation. In: International Conference on Learning Representations Workshop
Ravi S, Larochelle H (2016). Optimization as a Model for Few-Shot Learning. In: International Conference on Learning Representations
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You Only Look once: Unified, Real-Time Object Detection. In: IEEE Conference on Computer Vision and Pattern Recognition, 779–788
Ren S, He K, Girshick R, Sun J (2015). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In: Neural Information Processing Systems, 91–99
Santoro A, Bartunov S, Botvinick M, Wierstra D, Lillicrap T (2016) Meta-Learning with Memory-Augmented Neural Networks. In: International Conference on Machine Learning, 1842–1850
Satorras V G, Estrach J B (2018) Few-Shot Learning with Graph Neural Networks. In: International Conference on Learning Representations
Shaban A, Bansal S, Liu Z, Essa I, Boots B (2017) One-Shot Learning for Semantic Segmentation. In: British Machine Vision Conference
Snell J, Swersky K, Zemel R (2017). Prototypical Networks for Few-shot Learning. In: Neural Information Processing Systems, 4077–4087
Sun Y, Cheng C, Zhang Y, Zhang C, Zheng L, Wang Z, Wei Y (2020) Circle Loss: A Unified Perspective of Pair Similarity Optimization. In: IEEE Conference on Computer Vision and Pattern Recognition, 6398–6407
Sung F, Yang Y, Zhang L, Xiang T, Torr P H, Hospedales T M (2018) Learning to Compare: Relation Network for Few-Shot Learning. In: IEEE Conference on Computer Vision and Pattern Recognition, 1199–1208
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Kaiser Ł, Polosukhin I (2017) Attention is All You Need. In: Neural Information Processing Systems, 5998–6008
Vinyals O, Blundell C, Lillicrap T, Wierstra D (2016). Matching Networks for One Shot Learning. In: Neural Information Processing Systems, 3630–3638
Wang X, Girshick R, Gupta A, He K (2018) Non-local Neural Networks. In: IEEE Conference on Computer Vision and Pattern Recognition, 7794–7803
Wang X, Kong T, Shen C, Jiang Y, Li L (2020) SOLO: Segmenting Objects by Locations. In: European Conference on Computer Vision, 649–665
Wang X, Zhang R, Kong T, Li L, Shen C (2020). SOLOv2: Dynamic and Fast Instance Segmentation. In: Neural Information Processing Systems, 17721–17732
Xie E, Sun P, Song X, Wang W, Liu X, Liang D, Shen C, Luo P (2020) PolarMask: Single Shot Instance Segmentation with Polar Representation. In: IEEE Conference on Computer Vision and Pattern Recognition, 12193–12202
Yan X, Chen Z, Xu A, Wang X, Liang X, Lin L (2019) Meta R-CNN: Towards General Solver for Instance-level Low-shot Learning. In: IEEE International Conference on Computer Vision, 9577–9586
Ye H-J, Hu H, Zhan D-C, Sha F (2020) Few-shot Learning via Embedding Adaptation with Set-to-Set Functions. In: IEEE Conference on Computer Vision and Pattern Recognition, 8808–8817
Zhang C, Cai Y, Lin G, Shen C (2020) DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers. In: IEEE Conference on Computer Vision and Pattern Recognition, 12203–12213
Zhang C, Lin G, Liu F, Guo J, Wu Q, Yao R (2019) Pyramid Graph Networks with Connection Attentions for Region-Based One-Shot Semantic Segmentation. In: IEEE International Conference on Computer Vision, 9587–9595
Zhang X, Wei Y, Yang Y, Huang TS (2020) SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation. IEEE Trans Cybern 50(9):3855–3865
Funding
This work was supported in part by the National Natural Science Foundation of China (Grant nos. 61971421 and 61976217).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, H., Zhou, S., Xu, X. et al. TPN: Triple parts network for few-shot instance segmentation. Multimed Tools Appl 82, 46439–46455 (2023). https://doi.org/10.1007/s11042-023-15624-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15624-2