Self-attention Based Cross-Level Fusion Network for Camouflaged Object Detection

Zhan, Chunlan; He, Linyan; Liu, Yun; Xu, Baolei; Wang, Anzhi

doi:10.1007/978-3-031-20233-9_59

Chunlan Zhan¹⁵,
Linyan He¹⁵,
Yun Liu¹⁶,
Baolei Xu¹⁷ &
…
Anzhi Wang¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13628))

Included in the following conference series:

Chinese Conference on Biometric Recognition

1123 Accesses

Abstract

Although CNN-based camouflaged object detection(COD) has made great progress in recent years, their prediction maps usually contain incomplete detail information due to the similarity between the camouflaged object and the background. To alleviate this, a CNN-based framework named SACF-Net is designed for COD via cross-level fusion that facilitates the detection of camouflaged object detail. On the one hand, the low-level features contain abundant edge detail information to distinguish the camouflaged object from the background. On the other hand, the Polarized Self-Attention(PSA) mechanism is introduced to refine high-level features that contain extensive semantic information to enhance inner details and performance. Finally, cross-level complementarity fusion is performed progressively to generate prediction maps in a top-down manner. Extensive experiments on four COD datasets exhibit that the proposed method is better than the state-of-the-art methods.

A. Wang—This work is supported by the National Natural Science Foundation of China under Grant(62162013), the National Undergraduate on Innovation and Entrepreneurship Training Program of Guizhou Province(S202110663028, S202110663029), and the Key Laboratory of Exploitation and Study of Distinctive Plants in Education Department of Sichuan Province(TSZW2109).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sun, Y., Chen, G., Zhou, T., Zhang, Y., Liu, N.: Context-aware cross-level fusion network for camouflaged object detection. arXiv:2105.12555 (2021)
Mei, H., Ji, G.P., Wei, Z., Yang, X., Wei, X., Fan, D.P.: Camouflaged object segmentation with distraction mining. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8772–8781 (2021)
Google Scholar
Wu, R., Feng, M., Guan, W., Wang, D., Lu, H., Ding, E.: A mutual learning method for salient object detection with intertwined multi-supervision. Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 8150–8159 (2019)
Google Scholar
Wu, Z., Su, L., Huang, Q.: Cascaded partial decoder for fast and accurate salient object detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp, 3907–3916 (2019)
Google Scholar
Luo, Z., Mishra, A., Achkar, A., Eichel, J., Li, S., Jodoin, P. M.: Non-local deep features for salient object detection. Proceedings of the IEEE Conference on computer vision and pattern recognition, pp. 6609–6617 (2017)
Google Scholar
Liu, J.J., Hou, Q., Cheng, M.M., Feng, J., Jiang, J.: A simple pooling-based design for real-time salient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3917–3926 (2019)
Google Scholar
Wang, W., Zhao, S., Shen, J., Hoi, S. C., Borji, A.: Salient object detection with pyramid attention and salient edges. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1448–1457 (2019)
Google Scholar
Ji, G.P., Zhu, L., Zhuge, M., Fu, K.: Fast camouflaged object detection via edge-based reversible re-calibration network. Pattern Recogn. 123, 108414 (2022)
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in neural information processing systems 30 (2017)
Google Scholar
Hui, B., Zhu, P., Hu, Q., Wang, Q.: Self-attention relation network for few-shot learning. In 2019 IEEE international conference on Multimedia and Expo Workshops, pp. 198–203. IEEE (2019)
Google Scholar
Liu, H., Liu, F., Fan, X., Huang, D.: Polarized self-attention: towards high-quality pixel-wise regression. arXiv:2107.00782 (2021)
Cao, J., et al.: DO-Conv: depthwise over-parameterized convolutional layer. arXiv:2006.12030 (2020)
Pan, X., Zhan, X., Shi, J., Tang, X., Luo, P.: Switchable whitening for deep representation learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1863–1871 (2019)
Google Scholar
Biswas, K., Kumar, S., Banerjee, S., Pandey, A. K.: SMU: smooth activation function for deep networks using smoothing maximum technique. arXiv:2111.04682 (2021)
Le, T.N., Nguyen, T.V., Nie, Z., Tran, M.T., Sugimoto, A.: Anabranch network for camouflaged object segmentation. Comput. Vis. Image Underst. 184, 45–56 (2019)
Article Google Scholar
Fan, D.P., Ji, G.P., Sun, G., Cheng, M. M., Shen, J., Shao, L.: Camouflaged object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2777–2787 (2020)
Google Scholar
Lv, Y., et al.: Simultaneously localize, segment and rank the camouflaged objects. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11591–11601 (2021)
Google Scholar
Liu, N., Han, J., Yang, M. H.: PiCANet: learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3089–3098 (2018)
Google Scholar
Zhao, J.X., Liu, J.J., Fan, D.P., Cao, Y., Yang, J., Cheng, M.M.: EGNet: edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 8779–8788 (2019)
Google Scholar
Wu, Z., Su, L., Huang, Q.: Stacked cross refinement network for edge-aware salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 7264–7273 (2019)
Google Scholar
Wei, J., Wang, S., Huang, Q.: FNet: fusion, feedback and focus for salient object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 12321–12328 (2020)
Google Scholar
Zhou, H., Xie, X., Lai, J. H., Chen, Z., Yang, L.: Interactive two-stream decoder for accurate and fast saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9141–9150 (2020)
Google Scholar
Lv, Y., et al.: Simultaneously localize, segment and rank the camouflaged objects. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11591–11601 (2021)
Google Scholar
Zhu, J., Zhang, X., Zhang, S., Liu, J.: Inferring camouflaged objects by texture-aware interactive guidance network. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3599–3607 (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Big Data and Computer Science, Guizhou Normal University, Guiyang, 550025, Guizhou, China
Chunlan Zhan, Linyan He & Anzhi Wang
College of Artificial Intelligence, Southwest University, Chongqing, 400715, China
Yun Liu
Sichuan University of Arts and Sciences, Dazhou, Sichuan, China
Baolei Xu

Authors

Chunlan Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Linyan He
View author publications
You can also search for this author in PubMed Google Scholar
Yun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Baolei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Anzhi Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anzhi Wang .

Editor information

Editors and Affiliations

Beijing University of Posts and Telecommunications, Beijing, China
Weihong Deng
Tsinghua University, Beijing, China
Jianjiang Feng
Beihang University, Beijing, China
Di Huang
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Meina Kan
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Tsinghua University, Beijing, China
Fang Zheng
China Electronics Standardization Institute, Beijing, China
Wenfeng Wang
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zhaofeng He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhan, C., He, L., Liu, Y., Xu, B., Wang, A. (2022). Self-attention Based Cross-Level Fusion Network for Camouflaged Object Detection. In: Deng, W., et al. Biometric Recognition. CCBR 2022. Lecture Notes in Computer Science, vol 13628. Springer, Cham. https://doi.org/10.1007/978-3-031-20233-9_59

Download citation

DOI: https://doi.org/10.1007/978-3-031-20233-9_59
Published: 03 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20232-2
Online ISBN: 978-3-031-20233-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Self-attention Based Cross-Level Fusion Network for Camouflaged Object Detection