Abstract
Although CNN-based camouflaged object detection(COD) has made great progress in recent years, their prediction maps usually contain incomplete detail information due to the similarity between the camouflaged object and the background. To alleviate this, a CNN-based framework named SACF-Net is designed for COD via cross-level fusion that facilitates the detection of camouflaged object detail. On the one hand, the low-level features contain abundant edge detail information to distinguish the camouflaged object from the background. On the other hand, the Polarized Self-Attention(PSA) mechanism is introduced to refine high-level features that contain extensive semantic information to enhance inner details and performance. Finally, cross-level complementarity fusion is performed progressively to generate prediction maps in a top-down manner. Extensive experiments on four COD datasets exhibit that the proposed method is better than the state-of-the-art methods.
A. Wang—This work is supported by the National Natural Science Foundation of China under Grant(62162013), the National Undergraduate on Innovation and Entrepreneurship Training Program of Guizhou Province(S202110663028, S202110663029), and the Key Laboratory of Exploitation and Study of Distinctive Plants in Education Department of Sichuan Province(TSZW2109).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sun, Y., Chen, G., Zhou, T., Zhang, Y., Liu, N.: Context-aware cross-level fusion network for camouflaged object detection. arXiv:2105.12555 (2021)
Mei, H., Ji, G.P., Wei, Z., Yang, X., Wei, X., Fan, D.P.: Camouflaged object segmentation with distraction mining. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8772–8781 (2021)
Wu, R., Feng, M., Guan, W., Wang, D., Lu, H., Ding, E.: A mutual learning method for salient object detection with intertwined multi-supervision. Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 8150–8159 (2019)
Wu, Z., Su, L., Huang, Q.: Cascaded partial decoder for fast and accurate salient object detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp, 3907–3916 (2019)
Luo, Z., Mishra, A., Achkar, A., Eichel, J., Li, S., Jodoin, P. M.: Non-local deep features for salient object detection. Proceedings of the IEEE Conference on computer vision and pattern recognition, pp. 6609–6617 (2017)
Liu, J.J., Hou, Q., Cheng, M.M., Feng, J., Jiang, J.: A simple pooling-based design for real-time salient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3917–3926 (2019)
Wang, W., Zhao, S., Shen, J., Hoi, S. C., Borji, A.: Salient object detection with pyramid attention and salient edges. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1448–1457 (2019)
Ji, G.P., Zhu, L., Zhuge, M., Fu, K.: Fast camouflaged object detection via edge-based reversible re-calibration network. Pattern Recogn. 123, 108414 (2022)
Vaswani, A., et al.: Attention is all you need. In: Advances in neural information processing systems 30 (2017)
Hui, B., Zhu, P., Hu, Q., Wang, Q.: Self-attention relation network for few-shot learning. In 2019 IEEE international conference on Multimedia and Expo Workshops, pp. 198–203. IEEE (2019)
Liu, H., Liu, F., Fan, X., Huang, D.: Polarized self-attention: towards high-quality pixel-wise regression. arXiv:2107.00782 (2021)
Cao, J., et al.: DO-Conv: depthwise over-parameterized convolutional layer. arXiv:2006.12030 (2020)
Pan, X., Zhan, X., Shi, J., Tang, X., Luo, P.: Switchable whitening for deep representation learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1863–1871 (2019)
Biswas, K., Kumar, S., Banerjee, S., Pandey, A. K.: SMU: smooth activation function for deep networks using smoothing maximum technique. arXiv:2111.04682 (2021)
Le, T.N., Nguyen, T.V., Nie, Z., Tran, M.T., Sugimoto, A.: Anabranch network for camouflaged object segmentation. Comput. Vis. Image Underst. 184, 45–56 (2019)
Fan, D.P., Ji, G.P., Sun, G., Cheng, M. M., Shen, J., Shao, L.: Camouflaged object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 2777–2787 (2020)
Lv, Y., et al.: Simultaneously localize, segment and rank the camouflaged objects. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11591–11601 (2021)
Liu, N., Han, J., Yang, M. H.: PiCANet: learning pixel-wise contextual attention for saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3089–3098 (2018)
Zhao, J.X., Liu, J.J., Fan, D.P., Cao, Y., Yang, J., Cheng, M.M.: EGNet: edge guidance network for salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 8779–8788 (2019)
Wu, Z., Su, L., Huang, Q.: Stacked cross refinement network for edge-aware salient object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 7264–7273 (2019)
Wei, J., Wang, S., Huang, Q.: FNet: fusion, feedback and focus for salient object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 12321–12328 (2020)
Zhou, H., Xie, X., Lai, J. H., Chen, Z., Yang, L.: Interactive two-stream decoder for accurate and fast saliency detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 9141–9150 (2020)
Lv, Y., et al.: Simultaneously localize, segment and rank the camouflaged objects. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11591–11601 (2021)
Zhu, J., Zhang, X., Zhang, S., Liu, J.: Inferring camouflaged objects by texture-aware interactive guidance network. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3599–3607 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhan, C., He, L., Liu, Y., Xu, B., Wang, A. (2022). Self-attention Based Cross-Level Fusion Network for Camouflaged Object Detection. In: Deng, W., et al. Biometric Recognition. CCBR 2022. Lecture Notes in Computer Science, vol 13628. Springer, Cham. https://doi.org/10.1007/978-3-031-20233-9_59
Download citation
DOI: https://doi.org/10.1007/978-3-031-20233-9_59
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20232-2
Online ISBN: 978-3-031-20233-9
eBook Packages: Computer ScienceComputer Science (R0)