skip to main content
10.1145/3503161.3548178acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

PreyNet: Preying on Camouflaged Objects

Published: 10 October 2022 Publication History

Abstract

Species often adopt various camouflage strategies to be seamlessly blended into the surroundings for self-protection. To figure out the concealment, predators have evolved excellent hunting skills. Exploring the intrinsic mechanisms of the predation behavior can offer more insightful glimpse into the task of camouflaged object detection (COD). In this work, we strive to seek answers for accurate COD and propose a PreyNet, which mimics the two processes of predation, namely, initial detection (sensory mechanism) and predator learning (cognitive mechanism). To exploit the sensory process, a bidirectional bridging interaction module (BBIM) is designed for selecting and aggregating initial features in an attentive manner. The predator learning process is formulated as a policy-and-calibration paradigm, with the goal of deciding on uncertain regions and encouraging targeted feature calibration. Besides, we obtain adaptive weight for multi-layer supervision during training via computing on the uncertainty estimation. Extensive experiments demonstrate that our model produces state-of-the-art results on several benchmarks. We further verify the scalability of the predator learning paradigm through applications on top-ranking salient object detection models. Our code is publicly available at \urlhttps://github.com/OIPLab-DUT/PreyNet.

References

[1]
Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. 2015. Weight uncertainty in neural network. In International conference on machine learning. PMLR, 1613--1622.
[2]
Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40, 4 (2017), 834--848.
[3]
Shuhan Chen and Yun Fu. 2020. Progressively guided alternate refinement network for rgb-d salient object detection. In European Conference on Computer Vision. Springer, 520--538.
[4]
Zuyao Chen, Qianqian Xu, Runmin Cong, and Qingming Huang. 2020. Global context-aware progressive aggregation network for salient object detection. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 10599--10606.
[5]
Hung-Kuo Chu, Wei-Hsin Hsu, Niloy J Mitra, Daniel Cohen-Or, Tien-Tsin Wong, and Tong-Yee Lee. 2010. Camouflage images. ACM Trans. Graph. 29, 4 (2010), 51--1.
[6]
Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, and Ali Borji. 2017. Structuremeasure: A new way to evaluate foreground maps. In Proceedings of the IEEE international conference on computer vision. 4548--4557.
[7]
Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, and Ali Borji. 2018. Enhanced-alignment measure for binary foreground map evaluation. arXiv preprint arXiv:1805.10421 (2018).
[8]
Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, and Ling Shao. 2021. Concealed object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[9]
Deng-Ping Fan, Ge-Peng Ji, Guolei Sun, Ming-Ming Cheng, Jianbing Shen, and Ling Shao. 2020. Camouflaged object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2777--2787.
[10]
Deng-Ping Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao. 2020. Pranet: Parallel reverse attention network for polyp segmentation. In International conference on medical image computing and computer-assisted intervention. Springer, 263--273.
[11]
Deng-Ping Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao. 2020. Inf-net: Automatic covid-19 lung infection segmentation from ct images. IEEE Transactions on Medical Imaging 39, 8 (2020), 2626--2637.
[12]
Yarin Gal and Zoubin Ghahramani. 2016. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning. PMLR, 1050--1059.
[13]
Shang-Hua Gao, Ming-Ming Cheng, Kai Zhao, Xin-Yu Zhang, Ming-Hsuan Yang, and Philip Torr. 2019. Res2net: A new multi-scale backbone architecture. IEEE transactions on pattern analysis and machine intelligence 43, 2 (2019), 652--662.
[14]
Ke Gu, Dacheng Tao, Jun-Fei Qiao, and Weisi Lin. 2017. Learning a no-reference quality assessment model of enhanced images with big data. IEEE transactions on neural networks and learning systems 29, 4 (2017), 1301--1313.
[15]
Ke Gu, Guangtao Zhai, Weisi Lin, and Min Liu. 2015. The analysis of image contrast: From quality assessment to automatic enhancement. IEEE transactions on cybernetics 46, 1 (2015), 284--297.
[16]
Ke Gu, Guangtao Zhai, Weisi Lin, Xiaokang Yang, and Wenjun Zhang. 2015. No-reference image sharpness assessment in autoregressive parameter space. IEEE Transactions on Image Processing 24, 10 (2015), 3218--3231.
[17]
Ke Gu, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang, and Chang Wen Chen. 2014. Automatic contrast enhancement technology with saliency preservation. IEEE Transactions on Circuits and Systems for Video Technology 25, 9 (2014), 1480--1494.
[18]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961--2969.
[19]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[20]
Jianqin Yin Yanbin Han Wendi Hou and Jinping Li. 2011. Detection of the mobile object with camouflage color under dynamic background based on optical flow. Procedia Engineering 15 (2011), 2201--2205.
[21]
Po-Yu Huang, Wan-Ting Hsu, Chun-Yueh Chiu, Ting-Fan Wu, and Min Sun. 2018. Efficient uncertainty estimation for semantic segmentation in videos. In Proceedings of the European Conference on Computer Vision (ECCV). 520--535.
[22]
Iván Huerta, Daniel Rowe, Mikhail Mozerov, and Jordi Gonzàlez. 2007. Improving background subtraction based on a casuistry of colour-motion segmentation problems. In Iberian Conference on Pattern Recognition and Image Analysis. Springer, 475--482.
[23]
Alex Kendall, Vijay Badrinarayanan, and Roberto Cipolla. 2015. Bayesian segnet: Model uncertainty in deep convolutional encoder-decoder architectures for scene understanding. arXiv preprint arXiv:1511.02680 (2015).
[24]
Alex Kendall and Yarin Gal. 2017. What uncertainties do we need in bayesian deep learning for computer vision? Advances in neural information processing systems 30 (2017).
[25]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[26]
Philipp Krähenbühl and Vladlen Koltun. 2011. Efficient inference in fully connected crfs with gaussian edge potentials. Advances in neural information processing systems 24 (2011).
[27]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25 (2012).
[28]
Yongchan Kwon, Joong-HoWon, Beom Joon Kim, and Myunghee Cho Paik. 2020. Uncertainty quantification using Bayesian neural networks in classification: Application to biomedical image segmentation. Computational Statistics & Data Analysis 142 (2020), 106816.
[29]
Cynthia M Langley, Donald A Riley, Alan B Bond, and Namni Goel. 1996. Visual search for natural grains in pigeons (Columba livia): search images and selective attention. Journal of Experimental Psychology: Animal Behavior Processes 22, 2 (1996), 139.
[30]
Trung-Nghia Le, Tam V Nguyen, Zhongliang Nie, Minh-Triet Tran, and Akihiro Sugimoto. 2019. Anabranch network for camouflaged object segmentation. Computer Vision and Image Understanding 184 (2019), 45--56.
[31]
Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2117--2125.
[32]
Nian Liu, Junwei Han, and Ming-Hsuan Yang. 2018. Picanet: Learning pixel-wise contextual attention for saliency detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3089--3098.
[33]
Wei Liu, Andrew Rabinovich, and Alexander C Berg. 2015. Parsenet: Looking wider to see better. arXiv preprint arXiv:1506.04579 (2015).
[34]
Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, and Deng-Ping Fan. 2021. Simultaneously localize, segment and rank the camouflaged objects. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11591--11601.
[35]
Mingcan Ma12, Changqun Xia, and Jia Li123. 2021. Pyramidal feature shrinking for salient object detection. (2021).
[36]
Wesley J Maddox, Pavel Izmailov, Timur Garipov, Dmitry P Vetrov, and Andrew Gordon Wilson. 2019. A simple baseline for bayesian uncertainty in deep learning. Advances in Neural Information Processing Systems 32 (2019).
[37]
Ran Margolin, Lihi Zelnik-Manor, and Ayellet Tal. 2014. How to evaluate foreground maps?. In Proceedings of the IEEE conference on computer vision and pattern recognition. 248--255.
[38]
Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, and Deng-Ping Fan. 2021. Camouflaged object segmentation with distraction mining. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8772--8781.
[39]
Yuxin Pan, Yiwang Chen, Qiang Fu, Ping Zhang, and Xin Xu. 2011. Study on the camouflaged target detection method based on 3D convexity. Modern Applied Science 5, 4 (2011), 152.
[40]
Youwei Pang, Xiaoqi Zhao, Lihe Zhang, and Huchuan Lu. 2020. Multi-scale interactive network for salient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9413--9422.
[41]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems 32 (2019).
[42]
Ricardo Pérez-de la Fuente, Xavier Delclòs, Enrique Peñalver, Mariela Speranza, Jacek Wierzchos, Carmen Ascaso, and Michael S Engel. 2012. Early evolution and ecology of camouflage in insects. Proceedings of the National Academy of Sciences 109, 52 (2012), 21414--21419.
[43]
Donald A Riley and HL Roitblat. 2018. Selective attention and related cognitive processes in pigeons. In Cognitive processes in animal behavior. Routledge, 249--276.
[44]
Hippolyt Ritter, Aleksandar Botev, and David Barber. 2018. A scalable laplace approximation for neural networks. In 6th International Conference on Learning Representations, ICLR 2018-Conference Track Proceedings, Vol. 6. International Conference on Representation Learning.
[45]
P Sengottuvelan, Amitabh Wahi, and A Shanmugam. 2008. Performance of decamouflaging through exploratory image analysis. In 2008 First International Conference on Emerging Trends in Engineering and Technology. IEEE, 6--10.
[46]
Murat Sensoy, Lance Kaplan, and Melih Kandemir. 2018. Evidential deep learning to quantify classification uncertainty. Advances in Neural Information Processing Systems 31 (2018).
[47]
Przemyslaw Skurowski, Hassan Abdulameer, J Blaszczyk, Tomasz Depta, Adam Kornacki, and P Koziel. 2018. Animal camouflage analysis: Chameleon database. Unpublished manuscript 2, 6 (2018), 7.
[48]
Martin Stevens and Sami Merilaita. 2009. Animal camouflage: current issues and new perspectives. Philosophical Transactions of the Royal Society B: Biological Sciences 364, 1516 (2009), 423--427.
[49]
Yujia Sun, Geng Chen, Tao Zhou, Yi Zhang, and Nian Liu. 2021. Context-aware Cross-level Fusion Network for Camouflaged Object Detection. arXiv preprint arXiv:2105.12555 (2021).
[50]
Jolyon Troscianko, Alice E Lown, Anna E Hughes, and Martin Stevens. 2013. Defeating crypsis: detection and learning of camouflage strategies. PloS one 8, 9 (2013), e73733.
[51]
Bo Wang, Quan Chen, Min Zhou, Zhiqiang Zhang, Xiaogang Jin, and Kun Gai. 2020. Progressive feature polishing network for salient object detection. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 12128--12135.
[52]
Jun Wei, Shuhui Wang, and Qingming Huang. 2020. F3Net: fusion, feedback and focus for salient object detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 12321--12328.
[53]
Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV). 3--19.
[54]
Zhe Wu, Li Su, and Qingming Huang. 2019. Cascaded partial decoder for fast and accurate salient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 3907--3916.
[55]
Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, and Deng-Ping Fan. 2021. Uncertainty-guided transformer reasoning for camouflaged object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4146--4155.
[56]
Tianyuan Yu, Da Li, Yongxin Yang, Timothy M Hospedales, and Tao Xiang. 2019. Robust person re-identification by modelling feature uncertainty. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 552--561.
[57]
Qiang Zhai, Xin Li, Fan Yang, Chenglizhao Chen, Hong Cheng, and Deng-Ping Fan. 2021. Mutual graph learning for camouflaged object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12997--13007.
[58]
Miao Zhang, Tingwei Liu, Yongri Piao, Shunyu Yao, and Huchuan Lu. 2021. Auto-msfnet: Search multi-scale fusion network for salient object detection. In Proceedings of the 29th ACM International Conference on Multimedia. 667--676.
[59]
Zhirui Zhao, Changqun Xia, Chenxi Xie, and Jia Li. 2021. Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection. In Proceedings of the 29th ACM International Conference on Multimedia. 4967--4975.
[60]
Zongwei Zhou, Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, and Jianming Liang. 2018. Unet: A nested u-net architecture for medical image segmentation. In Deep learning in medical image analysis and multimodal learning for clinical decision support. Springer, 3--11.
[61]
Jinchao Zhu, Xiaoyu Zhang, Shuo Zhang, and Junnan Liu. 2021. Inferring camouflaged objects by texture-aware interactive guidance network. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 3599--3607.

Cited By

View all
  • (2025)IRFNet: Cognitive-Inspired Iterative Refinement Fusion Network for Camouflaged Object DetectionSensors10.3390/s2505155525:5(1555)Online publication date: 3-Mar-2025
  • (2025)Cross-Layer Semantic Guidance Network for Camouflaged Object DetectionElectronics10.3390/electronics1404077914:4(779)Online publication date: 17-Feb-2025
  • (2025)Conditional Diffusion Models for Camouflaged and Salient Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2025.352746947:4(2833-2848)Online publication date: Apr-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '22: Proceedings of the 30th ACM International Conference on Multimedia
October 2022
7537 pages
ISBN:9781450392037
DOI:10.1145/3503161
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. camouflaged object detection
  2. predator learning
  3. sensory mechanism

Qualifiers

  • Research-article

Funding Sources

  • the Natural Science Foundation of Liaoning Province
  • the Central Government Guided Local Science and Technology Development Funds of Liaoning Province
  • the National Natural Science Foundation of China

Conference

MM '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)135
  • Downloads (Last 6 weeks)7
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)IRFNet: Cognitive-Inspired Iterative Refinement Fusion Network for Camouflaged Object DetectionSensors10.3390/s2505155525:5(1555)Online publication date: 3-Mar-2025
  • (2025)Cross-Layer Semantic Guidance Network for Camouflaged Object DetectionElectronics10.3390/electronics1404077914:4(779)Online publication date: 17-Feb-2025
  • (2025)Conditional Diffusion Models for Camouflaged and Salient Object DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2025.352746947:4(2833-2848)Online publication date: Apr-2025
  • (2025)Progressive Region-to-Boundary Exploration Network for Camouflaged Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2024.352176127(236-248)Online publication date: 1-Jan-2025
  • (2025)DPSNet: A Detail Perception Synergistic Network for Camouflaged Object DetectionIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2024.349718174(1-14)Online publication date: 2025
  • (2025)Hunt Camouflaged Objects via Revealing Mutation RegionsIEEE Transactions on Information Forensics and Security10.1109/TIFS.2025.353070320(1836-1851)Online publication date: 2025
  • (2025)CoNet: A Consistency-Oriented Network for Camouflaged Object SegmentationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.346246535:1(287-299)Online publication date: Jan-2025
  • (2025)CSFIN: A lightweight network for camouflaged object detection via cross-stage feature interactionExpert Systems with Applications10.1016/j.eswa.2025.126451269(126451)Online publication date: Apr-2025
  • (2025)Semantic-spatial guided context propagation network for camouflaged object detectionApplied Intelligence10.1007/s10489-025-06264-055:5Online publication date: 20-Jan-2025
  • (2025)Camouflaged object detection via boundary refinementMultimedia Systems10.1007/s00530-024-01662-931:1Online publication date: 8-Jan-2025
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media