Skip to main content

Few-Shot Segmentation via Complementary Prototype Learning and Cascaded Refinement

  • Conference paper
  • First Online:
Pattern Recognition and Computer Vision (PRCV 2021)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 13022))

Included in the following conference series:

  • 1821 Accesses

Abstract

Prototype learning has been widely explored for few-shot segmentation. Existing methods typically learn the prototype from the foreground features of all support images, which rarely consider the background similarities between the query images and the support images. This unbalanced prototype learning strategy limits its capability to mutually correct the segmentation errors between the foreground and background. In this paper, we propose a Complementary Prototype Learning and Cascaded Refinement (CPLCR) network for few-shot segmentation. Firstly, both the foreground and background features of the support images are used to learn our complementary prototypes. Then, the foreground and background similarity maps are jointly derived between the query image feature and our complementary prototypes, which capture more comprehensive prior information. Finally, we fuse the query image feature, foreground prototype and the foreground/background similarity maps together, and feed them to a cascaded refinement module, which recursively reuses the output of previous iteration to refine the segmentation result. Extensive experimental results show that the proposed CPLCR model outperforms many state-of-the-art methods for 1-shot and 5-shot segmentation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)

    Article  Google Scholar 

  2. Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017)

  3. Dong, N., Xing, E.P.: Few-shot semantic segmentation with prototype learning. In: BMVC, vol. 3 (2018)

    Google Scholar 

  4. Everingham, M., Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88, 303–338 (2009)

    Article  Google Scholar 

  5. Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)

    Article  Google Scholar 

  6. Hariharan, B., Arbeláez, P., Bourdev, L.D., Maji, S., Malik, J.: Semantic contours from inverse detectors. In: 2011 International Conference on Computer Vision, pp. 991–998 (2011)

    Google Scholar 

  7. Liu, Y., Zhang, X., Zhang, S., He, X.: Part-aware prototype network for few-shot semantic segmentation. ArXiv abs/2007.06309 (2020)

    Google Scholar 

  8. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)

    Google Scholar 

  9. Luo, K., Meng, F., Wu, Q., Li, H.: Weakly supervised semantic segmentation by multiple group cosegmentation. In: 2018 IEEE Visual Communications and Image Processing (VCIP), pp. 1–4 (2018)

    Google Scholar 

  10. Nguyen, K.D.M., Todorovic, S.: Feature weighting and boosting for few-shot segmentation. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 622–631 (2019)

    Google Scholar 

  11. Rakelly, K., Shelhamer, E., Darrell, T., Efros, A.A., Levine, S.: Conditional networks for few-shot semantic segmentation. In: ICLR (2018)

    Google Scholar 

  12. Shaban, A., Bansal, S., Liu, Z., Essa, I., Boots, B.: One-shot learning for semantic segmentation. ArXiv abs/1709.03410 (2017). arXiv:1709.03410

  13. Siam, M., Oreshkin, B.N., Jagersand, M.: AMP: adaptive masked proxies for few-shot segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5249–5258 (2019)

    Google Scholar 

  14. Wang, K., Liew, J., Zou, Y., Zhou, D., Feng, J.: PANet: few-shot image semantic segmentation with prototype alignment. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9196–9205 (2019)

    Google Scholar 

  15. Wang, K., Liew, J.H., Zou, Y., Zhou, D., Feng, J.: PANet: few-shot image semantic segmentation with prototype alignment. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9197–9206 (2019)

    Google Scholar 

  16. Xu, X., Meng, F., liang Li, H., Wu, Q., Ngan, K.N., Chen, S.: A new bounding box based pseudo annotation generation method for semantic segmentation. In: 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP), pp. 100–103 (2020)

    Google Scholar 

  17. Yang, B., Liu, C., Li, B., Jiao, J., Ye, Q.: Prototype mixture models for few-shot semantic segmentation. ArXiv abs/2008.03898 (2020)

    Google Scholar 

  18. Yang, M., Yu, K., Zhang, C., Li, Z., Yang, K.: DenseASPP for semantic segmentation in street scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3684–3692 (2018)

    Google Scholar 

  19. Yang, Y., Meng, F., Li, H., Ngan, K., Wu, Q.: A new few-shot segmentation network based on class representation. In: 2019 IEEE Visual Communications and Image Processing (VCIP), pp. 1–4 (2019)

    Google Scholar 

  20. Yang, Y., Meng, F., Li, H., Wu, Q., Xu, X., Chen, S.: A new local transformation module for few-shot segmentation. ArXiv abs/1910.05886 (2020)

    Google Scholar 

  21. Zhang, C., Lin, G., Liu, F., Guo, J., Wu, Q., Yao, R.: Pyramid graph networks with connection attentions for region-based one-shot semantic segmentation. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9586–9594 (2019)

    Google Scholar 

  22. Zhang, C., Lin, G., Liu, F., Yao, R., Shen, C.: CANet: class-agnostic segmentation networks with iterative refinement and attentive few-shot learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5217–5226 (2019)

    Google Scholar 

  23. Zhang, X., Wei, Y., Yang, Y., Huang, T.: SG-One: similarity guidance network for one-shot semantic segmentation. IEEE Trans. Cybern. 50(9), 3855–3865 (2020)

    Article  Google Scholar 

  24. Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017)

    Google Scholar 

Download references

Acknowledgement

This work was partially supported by National Natural Science Foundation of China (No. 61971095, 61871078, 61831005, and 61871087).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qingbo Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Luo, H. et al. (2021). Few-Shot Segmentation via Complementary Prototype Learning and Cascaded Refinement. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13022. Springer, Cham. https://doi.org/10.1007/978-3-030-88013-2_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-88013-2_40

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-88012-5

  • Online ISBN: 978-3-030-88013-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics