skip to main content
10.1145/3664647.3680645acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Semi-supervised Camouflaged Object Detection from Noisy Data

Published: 28 October 2024 Publication History

Abstract

Most of previous camouflaged object detection methods heavily lean upon large-scale manually-labeled training samples, which are notoriously difficult to obtain. Even worse, the reliability of labels is compromised by the inherent challenges in accurately annotating concealed targets that exhibit high similarities with their surroundings. To overcome these shortcomings, this paper develops the first semi-supervised camouflaged object detection framework, which requires merely a small amount of samples even having noisy/incorrect annotations. Specifically, on the one hand, we introduce an innovative pixel-level loss re-weighting technique to reduce possible negative impacts from imperfect labels, through a window-based voting strategy. On the other hand, we take advantages of ensemble learning to explore robust features against noises/outliers, thereby generating relatively reliable pseudo labels for unlabelled images. Extensive experimental results on four benchmark datasets have been conducted.

References

[1]
Yunhao Bai, Duowen Chen, Qingli Li, Wei Shen, and Yan Wang. 2023. Bidirectional copy-paste for semi-supervised medical image segmentation. In CVPR. 11514--11524.
[2]
Hritam Basak and Zhaozheng Yin. 2023. Pseudo-label guided contrastive learning for semi-supervised medical image segmentation. In CVPR. 19786--19797.
[3]
David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, and Colin Raffel. 2020. Remixmatch: semi-supervised learning with distribution matching and augmentation anchoring. In ICLR.
[4]
Hao-Wei Chen, Yu-Syuan Xu, Min-Fong Hong, Yi-Min Tsai, Hsien-Kai Kuo, and Chun-Yi Lee. 2023. Cascaded local implicit transformer for arbitrary-scale super-resolution. In CVPR. 18257--18267.
[5]
Yuhao Chen, Xin Tan, Borui Zhao, Zhaowei Chen, Renjie Song, Jiajun Liang, and Xuequan Lu. 2023. Boosting semi-supervised learning by exploiting all unlabeled data. In CVPR. 7548--7557.
[6]
Zhongxi Chen, Ke Sun, and Xianming Lin. 2024. Camodiffusion: camouflaged object detection via conditional diffusion models. In AAAI. 1272--1280.
[7]
Ming-Ming Cheng and Deng-Ping Fan. 2021. Structure-measure: a new way to evaluate foreground maps. IJCV, Vol. 129, 9 (2021), 2622--2638.
[8]
Yae Jee Cho, Andre Manoel, Gauri Joshi, Robert Sim, and Dimitrios Dimitriadis. 2022. Heterogeneous ensemble knowledge transfer for training large models in federated learning. In IJCAI. 2881--2887. https://doi.org/10.24963/ijcai.2022/399
[9]
Hung-Kuo Chu, Wei-Hsin Hsu, Niloy J. Mitra, Daniel Cohen-Or, Tien-Tsin Wong, and Tong-Yee Lee. 2010. Camouflage images. ACM Trans. Graph., Vol. 29, 4 (2010), 51:1--51:8.
[10]
Runmin Cong, Mengyao Sun, Sanyi Zhang, Xiaofei Zhou, Wei Zhang, and Yao Zhao. 2023. Frequency perception network for camouflaged object detection. In ACM MM. 1179--1189.
[11]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: a large-scale hierarchical image database. In CVPR. 248--255.
[12]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An image is worth 16x16 words: transformers for image recognition at scale. In ICLR.
[13]
Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji. 2018. Enhanced-alignment measure for binary foreground map evaluation. In IJCAI. 698--704.
[14]
Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, and Ling Shao. 2022. Concealed object detection. TPAMI, Vol. 44, 10 (2022), 6024--6042.
[15]
Deng-Ping Fan, Ge-Peng Ji, Guolei Sun, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. 2020. Camouflaged object detection. In CVPR. 2774--2784.
[16]
Deng-Ping Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao. 2020. Pranet: parallel reverse attention network for polyp segmentation. In MICCAI, Vol. 12266. Springer, 263--273.
[17]
Deng-Ping Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao. 2020. Inf-net: automatic covid-19 lung infection segmentation from ct Images. IEEE Trans. Medical Imaging, Vol. 39, 8 (2020), 2626--2637.
[18]
Meirav Galun, Eitan Sharon, Ronen Basri, and Achi Brandt. 2003. Texture segmentation by multiscale aggregation of filter responses and shape elements. In ICCV. 716--723.
[19]
Yves Grandvalet and Yoshua Bengio. 2005. Semi-supervised learning by entropy minimization. In CAP. 281--296.
[20]
Ali Hassani, Steven Walton, Jiachen Li, Shen Li, and Humphrey Shi. 2023. Neighborhood attention transformer. In CVPR. 6185--6194.
[21]
Chunming He, Kai Li, Yachao Zhang, Longxiang Tang, Yulun Zhang, Zhenhua Guo, and Xiu Li. 2023. Camouflaged object detection with feature decomposition and edge reconstruction. In CVPR. 22046--22055.
[22]
Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross B. Girshick. 2022. Masked autoencoders are scalable vision learners. In CVPR. 15979--15988.
[23]
Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross B. Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In CVPR. 9726--9735.
[24]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. 770--778.
[25]
Ruozhen He, Qihua Dong, Jiaying Lin, and Rynson W. H. Lau. 2023. Weakly-supervised camouflaged object detection with scribble annotations. In AAAI. 781--789.
[26]
Lena Heidemann, Adrian Schwaiger, and Karsten Roscher. 2021. Measuring ensemble diversity and its effects on model robustness. In IJCAI.
[27]
Xiaobin Hu, Shuo Wang, Xuebin Qin, Hang Dai, Wenqi Ren, Donghao Luo, Ying Tai, and Ling Shao. 2023. High-resolution iterative feedback network for camouflaged object detection. In AAAI. 881--889.
[28]
Wei Hua, Dingkang Liang, Jingyu Li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, and Xiang Bai. 2023. Sood: towards semi-supervised oriented object detection. In CVPR. 15558--15567.
[29]
Huimin Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen, Yuexiang Li, Hong Wang, Yawen Huang, and Yefeng Zheng. 2023. Semicvt: semi-supervised convolutional vision transformer for semantic segmentation. In CVPR. 11340--11349.
[30]
Zhou Huang, Hang Dai, Tian-Zhu Xiang, Shuo Wang, Huai-Xin Chen, Jie Qin, and Huan Xiong. 2023. Feature shrinkage pyramid for camouflaged object detection with transformers. In CVPR. 5557--5566.
[31]
Qi Jia, Shuilian Yao, Yu Liu, Xin Fan, Risheng Liu, and Zhongxuan Luo. 2022. Segment, magnify and reiterate: detecting camouflaged objects the hard way. In CVPR. 4703--4712.
[32]
Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, and Anima Anandkumar. 2023. Vision transformers are good mask auto-labelers. In CVPR. 23745--23755.
[33]
Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, and Akihiro Sugimoto. 2019. Anabranch network for camouflaged object segmentation. Comput. Vis. Image Underst., Vol. 184 (2019), 45--56. https://doi.org/10.1016/J.CVIU.2019.04.006
[34]
Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, and Akihiro Sugimoto. 2019. Anabranch network for camouflaged object segmentation. CVIU, Vol. 184 (2019), 45--56.
[35]
Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, and Yuchao Dai. 2021. Uncertainty-aware joint salient object and camouflaged object detection. In CVPR. 10071--10081.
[36]
Muyang Li, Runze Wu, Haoyu Liu, Jun Yu, Xun Yang, Bo Han, and Tongliang Liu. 2023. Instant: semi-supervised learning with instance-dependent thresholds. In NeurIPS.
[37]
Peng Li, Xuefeng Yan, Hongwei Zhu, Mingqiang Wei, Xiao-Ping Zhang, and Jing Qin. 2022. Findnet: can you find me? boundary-and-texture enhancement network for camouflaged object detection. TIP, Vol. 31 (2022), 6396--6411.
[38]
Xiaofei Li, Jiaxin Yang, Shuohao Li, Jun Lei, Jun Zhang, and Dong Chen. 2023. Locate, refine and restore: a progressive enhancement network for camouflaged object detection. In IJCAI. 1116--1124.
[39]
Wei Lin and Antoni B. Chan. 2023. Optimal transport minimization: crowd localization on density maps for semi-supervised counting. In CVPR. 21663--21673.
[40]
Jinming Liu, Heming Sun, and Jiro Katto. 2023. Learned image compression with mixed transformer-cnn architectures. In CVPR. 14388--14397.
[41]
Yu Liu, Haihang Li, Juan Cheng, and Xun Chen. 2023. Mscaf-net: a general framework for camouflaged object detection via learning multi-scale context-aware features. TCSVT, Vol. 33, 9 (2023), 4934--4947.
[42]
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: hierarchical vision transformer using shifted windows. In ICCV. 9992--10002.
[43]
Yangdi Lu and Wenbo He. 2022. Selc: self-ensemble label correction improves learning with noisy labels. In IJCAI. 3278--3284. https://doi.org/10.24963/ijcai.2022/455
[44]
Naisong Luo, Yuwen Pan, Rui Sun, Tianzhu Zhang, Zhiwei Xiong, and Feng Wu. 2023. Camouflaged instance segmentation via explicit de-camouflaging. In CVPR. 17918--17927.
[45]
Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, and Deng-Ping Fan. 2021. Simultaneously localize, segment and rank the camouflaged objects. In CVPR. 11591--11601.
[46]
Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, and Deng-Ping Fan. 2021. Simultaneously localize, segment and rank the camouflaged objects. In CVPR. 11591--11601.
[47]
Ran Margolin, Lihi Zelnik-Manor, and Ayellet Tal. 2014. How to evaluate foreground maps. In CVPR. 248--255.
[48]
Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, and Deng-Ping Fan. 2021. Camouflaged object segmentation with distraction mining. In CVPR. 8772--8781.
[49]
Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, and Shin Ishii. 2019. Virtual adversarial training: a regularization method for supervised and semi-supervised learning. TPAMI, Vol. 41, 8 (2019), 1979--1993.
[50]
Ajoy Mondal, Susmita Ghosh, and Ashish Ghosh. 2017. Partially camouflaged object tracking using modified probabilistic neural network and fuzzy energy based active contour. IJCV, Vol. 122, 1 (2017), 116--148.
[51]
Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, and Huchuan Lu. 2022. Zoom in and out: a mixed-scale triplet network for camouflaged object detection. In CVPR. 2150--2160.
[52]
Jialun Pei, Tianyang Cheng, Deng-Ping Fan, He Tang, Chuanbo Chen, and Luc Van Gool. 2022. Osformer: one-stage camouflaged instance segmentation with transformers. In ECCV, Vol. 13678. 19--37.
[53]
Tomer Ronen, Omer Levy, and Avram Golbert. 2023. Vision transformers with mixed-resolution tokenization. In CVPR. 4613--4622.
[54]
Sujit K Singh, Chitra A Dhawale, and Sanjay Misra. [n.,d.]. Survey of object detection methods in camouflaged image. In EECS.
[55]
Przemysław Skurowski, Hassan Abdulameer, J Błaszczyk, Tomasz Depta, Adam Kornacki, and P Kozieł. 2017. Animal camouflage analysis: chameleon database. http://kgwisc.aei.polsl.pl/index.php/pl/dataset/63-animal-camouflage-analysis (2017).
[56]
Kihyuk Sohn, David Berthelot, Nicholas Carlini, Zizhao Zhang, Han Zhang, Colin Raffel, Ekin Dogus Cubuk, Alexey Kurakin, and Chun-Liang Li. 2020. Fixmatch: simplifying semi-supervised learning with consistency and confidence. In NeurIPS.
[57]
Yujia Sun, Geng Chen, Tao Zhou, Yi Zhang, and Nian Liu. 2021. Context-aware cross-level fusion network for camouflaged object detection. In IJCAI, Zhi-Hua Zhou (Ed.). 1025--1031.
[58]
Yujia Sun, Shuo Wang, Chenglizhao Chen, and Tian-Zhu Xiang. 2022. Boundary-guided camouflaged object detection. In IJCAI. 1335--1341.
[59]
Qingwei Wang, Jinyu Yang, Xiaosheng Yu, Fangyi Wang, Peng Chen, and Feng Zheng. 2023. Depth-aided camouflaged object detection. In ACM MM. 3297--3306.
[60]
Xiaolong Wang, Ross B. Girshick, Abhinav Gupta, and Kaiming He. 2018. Non-local neural networks. In CVPR. 7794--7803.
[61]
Yuchao Wang, Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Guoqiang Jin, Liwei Wu, Rui Zhao, and Xinyi Le. 2022. Semi-supervised semantic segmentation using unreliable pseudo-labels. In CVPR. 4238--4247.
[62]
Chenxi Xie, Changqun Xia, Mingcan Ma, Zhirui Zhao, Xiaowu Chen, and Jia Li. 2022. Pyramid grafting network for one-stage high resolution saliency detection. In CVPR. 11707--11716.
[63]
Chenxi Xie, Changqun Xia, Tianshu Yu, and Jia Li. 2023. Frequency representation integration for camouflaged object detection. In ACM MM. 1789--1797.
[64]
Qizhe Xie, Zihang Dai, Eduard H. Hovy, Thang Luong, and Quoc Le. 2020. Unsupervised data augmentation for consistency training. In NeurIPS.
[65]
Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, and Deng-Ping Fan. 2021. Uncertainty-guided transformer reasoning for camouflaged object detection. In ICCV. 4126--4135.
[66]
Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, and Deng-Ping Fan. 2021. Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection. In ICCV. IEEE, 4126--4135.
[67]
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, and Yang Gao. 2022. St: make self-training work better for semi-supervised semantic segmentation. In CVPR. 4258--4267.
[68]
Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Seong Joon Oh, Youngjoon Yoo, and Junsuk Choe. 2019. Cutmix: regularization strategy to train strong classifiers with localizable features. In ICCV. 6022--6031.
[69]
Qiang Zhai, Xin Li, Fan Yang, Zhicheng Jiao, Ping Luo, Hong Cheng, and Zicheng Liu. 2023. Mgl: mutual graph learning for camouflaged object detection. TIP, Vol. 32 (2023), 1897--1910.
[70]
Yi Zhang, Jing Zhang, Wassim Hamidouche, and Olivier Déforges. 2023. Predictive uncertainty estimation for camouflaged object detection. TIP, Vol. 32 (2023), 3580--3591.
[71]
Yijie Zhong, Bo Li, Lv Tang, Senyun Kuang, Shuang Wu, and Shouhong Ding. 2022. Detecting camouflaged object in frequency domain. In CVPR. 4494--4503.
[72]
Hongyu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, and Jian Sun. 2022. Dense teacher: dense pseudo-labels for semi-supervised object detection. In ECCV, Vol. 13669. 35--50.
[73]
Hongwei Zhu, Peng Li, Haoran Xie, Xuefeng Yan, Dong Liang, Dapeng Chen, Mingqiang Wei, and Jing Qin. 2022. I can find you! boundary-guided separated attention network for camouflaged object detection. In AAAI. 3608--3616.
[74]
Jinjing Zhu, Yunhao Luo, Xu Zheng, Hao Wang, and Lin Wang. 2023. A good student is cooperative and reliable: cnn-transformer collaborative learning for semantic segmentation. In ICCV. 11686--11696.
[75]
Jinchao Zhu, Xiaoyu Zhang, Shuo Zhang, and Junnan Liu. 2021. Inferring camouflaged objects by texture-aware interactive guidance network. In AAAI. 3599--3607.

Index Terms

  1. Semi-supervised Camouflaged Object Detection from Noisy Data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
    October 2024
    11719 pages
    ISBN:9798400706868
    DOI:10.1145/3664647
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 October 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. camouflaged object detection
    2. pseudo labels
    3. semi-supervised

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MM '24
    Sponsor:
    MM '24: The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne VIC, Australia

    Acceptance Rates

    MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 208
      Total Downloads
    • Downloads (Last 12 months)208
    • Downloads (Last 6 weeks)141
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media