research-article

Semi-supervised Camouflaged Object Detection from Noisy Data

Authors:

Xiaojie GuoAuthors Info & Claims

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

Pages 4766 - 4775

https://doi.org/10.1145/3664647.3680645

Published: 28 October 2024 Publication History

Abstract

Most of previous camouflaged object detection methods heavily lean upon large-scale manually-labeled training samples, which are notoriously difficult to obtain. Even worse, the reliability of labels is compromised by the inherent challenges in accurately annotating concealed targets that exhibit high similarities with their surroundings. To overcome these shortcomings, this paper develops the first semi-supervised camouflaged object detection framework, which requires merely a small amount of samples even having noisy/incorrect annotations. Specifically, on the one hand, we introduce an innovative pixel-level loss re-weighting technique to reduce possible negative impacts from imperfect labels, through a window-based voting strategy. On the other hand, we take advantages of ensemble learning to explore robust features against noises/outliers, thereby generating relatively reliable pseudo labels for unlabelled images. Extensive experimental results on four benchmark datasets have been conducted.

References

[1]

Yunhao Bai, Duowen Chen, Qingli Li, Wei Shen, and Yan Wang. 2023. Bidirectional copy-paste for semi-supervised medical image segmentation. In CVPR. 11514--11524.

[2]

Hritam Basak and Zhaozheng Yin. 2023. Pseudo-label guided contrastive learning for semi-supervised medical image segmentation. In CVPR. 19786--19797.

[3]

David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, and Colin Raffel. 2020. Remixmatch: semi-supervised learning with distribution matching and augmentation anchoring. In ICLR.

[4]

Hao-Wei Chen, Yu-Syuan Xu, Min-Fong Hong, Yi-Min Tsai, Hsien-Kai Kuo, and Chun-Yi Lee. 2023. Cascaded local implicit transformer for arbitrary-scale super-resolution. In CVPR. 18257--18267.

[5]

Yuhao Chen, Xin Tan, Borui Zhao, Zhaowei Chen, Renjie Song, Jiajun Liang, and Xuequan Lu. 2023. Boosting semi-supervised learning by exploiting all unlabeled data. In CVPR. 7548--7557.

[6]

Zhongxi Chen, Ke Sun, and Xianming Lin. 2024. Camodiffusion: camouflaged object detection via conditional diffusion models. In AAAI. 1272--1280.

[7]

Ming-Ming Cheng and Deng-Ping Fan. 2021. Structure-measure: a new way to evaluate foreground maps. IJCV, Vol. 129, 9 (2021), 2622--2638.

Digital Library

[8]

Yae Jee Cho, Andre Manoel, Gauri Joshi, Robert Sim, and Dimitrios Dimitriadis. 2022. Heterogeneous ensemble knowledge transfer for training large models in federated learning. In IJCAI. 2881--2887. https://doi.org/10.24963/ijcai.2022/399

[9]

Hung-Kuo Chu, Wei-Hsin Hsu, Niloy J. Mitra, Daniel Cohen-Or, Tien-Tsin Wong, and Tong-Yee Lee. 2010. Camouflage images. ACM Trans. Graph., Vol. 29, 4 (2010), 51:1--51:8.

Digital Library

[10]

Runmin Cong, Mengyao Sun, Sanyi Zhang, Xiaofei Zhou, Wei Zhang, and Yao Zhao. 2023. Frequency perception network for camouflaged object detection. In ACM MM. 1179--1189.

[11]

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: a large-scale hierarchical image database. In CVPR. 248--255.

[12]

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An image is worth 16x16 words: transformers for image recognition at scale. In ICLR.

[13]

Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji. 2018. Enhanced-alignment measure for binary foreground map evaluation. In IJCAI. 698--704.

[14]

Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, and Ling Shao. 2022. Concealed object detection. TPAMI, Vol. 44, 10 (2022), 6024--6042.

Digital Library

[15]

Deng-Ping Fan, Ge-Peng Ji, Guolei Sun, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. 2020. Camouflaged object detection. In CVPR. 2774--2784.

[16]

Deng-Ping Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao. 2020. Pranet: parallel reverse attention network for polyp segmentation. In MICCAI, Vol. 12266. Springer, 263--273.

Digital Library

[17]

Deng-Ping Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao. 2020. Inf-net: automatic covid-19 lung infection segmentation from ct Images. IEEE Trans. Medical Imaging, Vol. 39, 8 (2020), 2626--2637.

[18]

Meirav Galun, Eitan Sharon, Ronen Basri, and Achi Brandt. 2003. Texture segmentation by multiscale aggregation of filter responses and shape elements. In ICCV. 716--723.

[19]

Yves Grandvalet and Yoshua Bengio. 2005. Semi-supervised learning by entropy minimization. In CAP. 281--296.

[20]

Ali Hassani, Steven Walton, Jiachen Li, Shen Li, and Humphrey Shi. 2023. Neighborhood attention transformer. In CVPR. 6185--6194.

[21]

Chunming He, Kai Li, Yachao Zhang, Longxiang Tang, Yulun Zhang, Zhenhua Guo, and Xiu Li. 2023. Camouflaged object detection with feature decomposition and edge reconstruction. In CVPR. 22046--22055.

[22]

Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross B. Girshick. 2022. Masked autoencoders are scalable vision learners. In CVPR. 15979--15988.

[23]

Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross B. Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In CVPR. 9726--9735.

[24]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. 770--778.

[25]

Ruozhen He, Qihua Dong, Jiaying Lin, and Rynson W. H. Lau. 2023. Weakly-supervised camouflaged object detection with scribble annotations. In AAAI. 781--789.

[26]

Lena Heidemann, Adrian Schwaiger, and Karsten Roscher. 2021. Measuring ensemble diversity and its effects on model robustness. In IJCAI.

[27]

Xiaobin Hu, Shuo Wang, Xuebin Qin, Hang Dai, Wenqi Ren, Donghao Luo, Ying Tai, and Ling Shao. 2023. High-resolution iterative feedback network for camouflaged object detection. In AAAI. 881--889.

[28]

Wei Hua, Dingkang Liang, Jingyu Li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, and Xiang Bai. 2023. Sood: towards semi-supervised oriented object detection. In CVPR. 15558--15567.

[29]

Huimin Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen, Yuexiang Li, Hong Wang, Yawen Huang, and Yefeng Zheng. 2023. Semicvt: semi-supervised convolutional vision transformer for semantic segmentation. In CVPR. 11340--11349.

[30]

Zhou Huang, Hang Dai, Tian-Zhu Xiang, Shuo Wang, Huai-Xin Chen, Jie Qin, and Huan Xiong. 2023. Feature shrinkage pyramid for camouflaged object detection with transformers. In CVPR. 5557--5566.

[31]

Qi Jia, Shuilian Yao, Yu Liu, Xin Fan, Risheng Liu, and Zhongxuan Luo. 2022. Segment, magnify and reiterate: detecting camouflaged objects the hard way. In CVPR. 4703--4712.

[32]

Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, and Anima Anandkumar. 2023. Vision transformers are good mask auto-labelers. In CVPR. 23745--23755.

[33]

Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, and Akihiro Sugimoto. 2019. Anabranch network for camouflaged object segmentation. Comput. Vis. Image Underst., Vol. 184 (2019), 45--56. https://doi.org/10.1016/J.CVIU.2019.04.006

Digital Library

[34]

Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, and Akihiro Sugimoto. 2019. Anabranch network for camouflaged object segmentation. CVIU, Vol. 184 (2019), 45--56.

Digital Library

[35]

Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, and Yuchao Dai. 2021. Uncertainty-aware joint salient object and camouflaged object detection. In CVPR. 10071--10081.

[36]

Muyang Li, Runze Wu, Haoyu Liu, Jun Yu, Xun Yang, Bo Han, and Tongliang Liu. 2023. Instant: semi-supervised learning with instance-dependent thresholds. In NeurIPS.

[37]

Peng Li, Xuefeng Yan, Hongwei Zhu, Mingqiang Wei, Xiao-Ping Zhang, and Jing Qin. 2022. Findnet: can you find me? boundary-and-texture enhancement network for camouflaged object detection. TIP, Vol. 31 (2022), 6396--6411.

Digital Library

[38]

Xiaofei Li, Jiaxin Yang, Shuohao Li, Jun Lei, Jun Zhang, and Dong Chen. 2023. Locate, refine and restore: a progressive enhancement network for camouflaged object detection. In IJCAI. 1116--1124.

[39]

Wei Lin and Antoni B. Chan. 2023. Optimal transport minimization: crowd localization on density maps for semi-supervised counting. In CVPR. 21663--21673.

[40]

Jinming Liu, Heming Sun, and Jiro Katto. 2023. Learned image compression with mixed transformer-cnn architectures. In CVPR. 14388--14397.

[41]

Yu Liu, Haihang Li, Juan Cheng, and Xun Chen. 2023. Mscaf-net: a general framework for camouflaged object detection via learning multi-scale context-aware features. TCSVT, Vol. 33, 9 (2023), 4934--4947.

Digital Library

[42]

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: hierarchical vision transformer using shifted windows. In ICCV. 9992--10002.

[43]

Yangdi Lu and Wenbo He. 2022. Selc: self-ensemble label correction improves learning with noisy labels. In IJCAI. 3278--3284. https://doi.org/10.24963/ijcai.2022/455

[44]

Naisong Luo, Yuwen Pan, Rui Sun, Tianzhu Zhang, Zhiwei Xiong, and Feng Wu. 2023. Camouflaged instance segmentation via explicit de-camouflaging. In CVPR. 17918--17927.

[45]

Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, and Deng-Ping Fan. 2021. Simultaneously localize, segment and rank the camouflaged objects. In CVPR. 11591--11601.

[46]

Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, and Deng-Ping Fan. 2021. Simultaneously localize, segment and rank the camouflaged objects. In CVPR. 11591--11601.

[47]

Ran Margolin, Lihi Zelnik-Manor, and Ayellet Tal. 2014. How to evaluate foreground maps. In CVPR. 248--255.

[48]

Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, and Deng-Ping Fan. 2021. Camouflaged object segmentation with distraction mining. In CVPR. 8772--8781.

[49]

Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, and Shin Ishii. 2019. Virtual adversarial training: a regularization method for supervised and semi-supervised learning. TPAMI, Vol. 41, 8 (2019), 1979--1993.

[50]

Ajoy Mondal, Susmita Ghosh, and Ashish Ghosh. 2017. Partially camouflaged object tracking using modified probabilistic neural network and fuzzy energy based active contour. IJCV, Vol. 122, 1 (2017), 116--148.

[51]

Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, and Huchuan Lu. 2022. Zoom in and out: a mixed-scale triplet network for camouflaged object detection. In CVPR. 2150--2160.

[52]

Jialun Pei, Tianyang Cheng, Deng-Ping Fan, He Tang, Chuanbo Chen, and Luc Van Gool. 2022. Osformer: one-stage camouflaged instance segmentation with transformers. In ECCV, Vol. 13678. 19--37.

[53]

Tomer Ronen, Omer Levy, and Avram Golbert. 2023. Vision transformers with mixed-resolution tokenization. In CVPR. 4613--4622.

[54]

Sujit K Singh, Chitra A Dhawale, and Sanjay Misra. [n.,d.]. Survey of object detection methods in camouflaged image. In EECS.

[55]

Przemysław Skurowski, Hassan Abdulameer, J Błaszczyk, Tomasz Depta, Adam Kornacki, and P Kozieł. 2017. Animal camouflage analysis: chameleon database. http://kgwisc.aei.polsl.pl/index.php/pl/dataset/63-animal-camouflage-analysis (2017).

[56]

Kihyuk Sohn, David Berthelot, Nicholas Carlini, Zizhao Zhang, Han Zhang, Colin Raffel, Ekin Dogus Cubuk, Alexey Kurakin, and Chun-Liang Li. 2020. Fixmatch: simplifying semi-supervised learning with consistency and confidence. In NeurIPS.

[57]

Yujia Sun, Geng Chen, Tao Zhou, Yi Zhang, and Nian Liu. 2021. Context-aware cross-level fusion network for camouflaged object detection. In IJCAI, Zhi-Hua Zhou (Ed.). 1025--1031.

[58]

Yujia Sun, Shuo Wang, Chenglizhao Chen, and Tian-Zhu Xiang. 2022. Boundary-guided camouflaged object detection. In IJCAI. 1335--1341.

[59]

Qingwei Wang, Jinyu Yang, Xiaosheng Yu, Fangyi Wang, Peng Chen, and Feng Zheng. 2023. Depth-aided camouflaged object detection. In ACM MM. 3297--3306.

[60]

Xiaolong Wang, Ross B. Girshick, Abhinav Gupta, and Kaiming He. 2018. Non-local neural networks. In CVPR. 7794--7803.

[61]

Yuchao Wang, Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Guoqiang Jin, Liwei Wu, Rui Zhao, and Xinyi Le. 2022. Semi-supervised semantic segmentation using unreliable pseudo-labels. In CVPR. 4238--4247.

[62]

Chenxi Xie, Changqun Xia, Mingcan Ma, Zhirui Zhao, Xiaowu Chen, and Jia Li. 2022. Pyramid grafting network for one-stage high resolution saliency detection. In CVPR. 11707--11716.

[63]

Chenxi Xie, Changqun Xia, Tianshu Yu, and Jia Li. 2023. Frequency representation integration for camouflaged object detection. In ACM MM. 1789--1797.

[64]

Qizhe Xie, Zihang Dai, Eduard H. Hovy, Thang Luong, and Quoc Le. 2020. Unsupervised data augmentation for consistency training. In NeurIPS.

[65]

Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, and Deng-Ping Fan. 2021. Uncertainty-guided transformer reasoning for camouflaged object detection. In ICCV. 4126--4135.

[66]

Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, and Deng-Ping Fan. 2021. Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection. In ICCV. IEEE, 4126--4135.

[67]

Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, and Yang Gao. 2022. St: make self-training work better for semi-supervised semantic segmentation. In CVPR. 4258--4267.

[68]

Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Seong Joon Oh, Youngjoon Yoo, and Junsuk Choe. 2019. Cutmix: regularization strategy to train strong classifiers with localizable features. In ICCV. 6022--6031.

[69]

Qiang Zhai, Xin Li, Fan Yang, Zhicheng Jiao, Ping Luo, Hong Cheng, and Zicheng Liu. 2023. Mgl: mutual graph learning for camouflaged object detection. TIP, Vol. 32 (2023), 1897--1910.

Digital Library

[70]

Yi Zhang, Jing Zhang, Wassim Hamidouche, and Olivier Déforges. 2023. Predictive uncertainty estimation for camouflaged object detection. TIP, Vol. 32 (2023), 3580--3591.

Digital Library

[71]

Yijie Zhong, Bo Li, Lv Tang, Senyun Kuang, Shuang Wu, and Shouhong Ding. 2022. Detecting camouflaged object in frequency domain. In CVPR. 4494--4503.

[72]

Hongyu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, and Jian Sun. 2022. Dense teacher: dense pseudo-labels for semi-supervised object detection. In ECCV, Vol. 13669. 35--50.

[73]

Hongwei Zhu, Peng Li, Haoran Xie, Xuefeng Yan, Dong Liang, Dapeng Chen, Mingqiang Wei, and Jing Qin. 2022. I can find you! boundary-guided separated attention network for camouflaged object detection. In AAAI. 3608--3616.

[74]

Jinjing Zhu, Yunhao Luo, Xu Zheng, Hao Wang, and Lin Wang. 2023. A good student is cooperative and reliable: cnn-transformer collaborative learning for semantic segmentation. In ICCV. 11686--11696.

[75]

Jinchao Zhu, Xiaoyu Zhang, Shuo Zhang, and Junnan Liu. 2021. Inferring camouflaged objects by texture-aware interactive guidance network. In AAAI. 3599--3607.

Index Terms

Semi-supervised Camouflaged Object Detection from Noisy Data
1. Computing methodologies
  1. Computer graphics
    1. Image manipulation

Recommendations

CamoTeacher: Dual-Rotation Consistency Learning for Semi-supervised Camouflaged Object Detection
Computer Vision – ECCV 2024
Abstract
Existing camouflaged object detection (COD) methods depend heavily on large-scale pixel-level annotations. However, acquiring such annotations is laborious due to the inherent camouflage characteristics of the objects. Semi-supervised learning ...
Dynamic updating self-training for semi-weakly supervised object detection
Abstract
Semi-weakly supervised object detection (SWSOD) needs to label only a small portion of images in the training set to train an initial detector. This detector is then used to select some reliable unlabeled images and generate pseudo ...
A Survey of Semi-Supervised Learning Methods
CIS '08: Proceedings of the 2008 International Conference on Computational Intelligence and Security - Volume 02

In traditional machine learning approaches to classification, one uses only a labelled set to train the classifier. Labelled instances however are often difficult, expensive, or time consuming to obtain, as they require the efforts of experienced human ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

October 2024

11719 pages

ISBN:9798400706868

DOI:10.1145/3664647

General Chairs:
Jianfei Cai
Monash University, Australia
,
Mohan Kankanhalli
NUS, Singapore
,
Balakrishnan Prabhakaran
UT Dallas, USA
,
Susanne Boll
University of Oldenburg, Germany
,
Program Chairs:
Ramanathan Subramanian
University of Canberra & IIT Ropar, Australia
,
Liang Zheng
Australian National University, Australia
,
Vivek K. Singh
Rutgers University, USA
,
Pablo Cesar
Centrum Wiskunde & Informatica, Netherlands
,
Lexing Xie
Australian National University, Australia
,
Dong Xu
University of Hong Kong, Hong Kong

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
National Natural Science Foundation of China

Conference

MM '24

Sponsor:

SIGMM

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne VIC, Australia

Acceptance Rates

MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
208
Total Downloads

Downloads (Last 12 months)208
Downloads (Last 6 weeks)141

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten