skip to main content
10.1145/3503161.3548264acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection

Published: 10 October 2022 Publication History

Abstract

Single frame infrared small target (SIRST) detection is useful for many practical applications, such as maritime rescue. However, SIRST detection is challenging due to the low-contrast between small targets and noisy background in infrared images. To address this challenge, we propose a novel FC3-Net by exploring feature compensation and cross-level correlation for SIRST detection. Specifically, FC3-Net consists of a Fine-detail guided Multi-level Feature Compensation (F-MFC) module, and a Cross-level Feature Correlation (CFC) module. The F-MFC module aims to compensate the information loss of details caused by the downsampling layers in convolutional neural networks (CNN) via aggregating features from multiple adjacent levels, so that the detail features of small targets can be propagated to the deeper layers of the network. Besides, to suppress the side impact of background noise, the CFC module constructs an energy filtering kernel based on the higher-level features with less background noise to filter out the noise in the middle-level features, and fuse them with the low-level ones to learn a strong target representation. Putting them together into the encoder-decoder structure, our FC3-Net could produce an accurate target mask with fine shape and details. Experiment results on the public NUAA-SIRST and IRSTD-1k datasets demonstrate that the proposed FC3-Net outperforms state-of-the-art methods in terms of both pixel-level and object-level metrics. The code will be released at https://github.com/IPIC-Lab/SIRST-Detection-FC3-Net.

Supplementary Material

MP4 File (MM22-fp02194.mp4)
Here is a video description of our work "Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection". Single frame infrared small target detection is useful for many applications, such as maritime rescue. However, subsampling operations in convolutional neural networks can cause small targets to disappear deep into the network. In addition, the signal-to-noise ratio of small infrared targets tends to be low. Small targets are affected by a large amount of background clutter, which seriously affects the detection effect. To address this challenge, we propose a novel FC3-Net. Specifically, FC3-Net consists of a Fine-detail guided Multi-level Feature Compensation (F-MFC) module and a Cross-level Feature Correlation (CFC) module. F-MFC mitigates the irreversible loss and CFC suppresses the background noise and improves the target signal-to-noise ratio. Extensive experiments have verified the effectiveness of FC3-Net.

References

[1]
Xiangzhi Bai and Fugen. Zhou. 2010. Analysis of new top-hat transformation and the application for infrared dim small target detection. Pattern Recognition 43, 6 (2010), 2145--2156.
[2]
Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40, 4 (2017), 834--848.
[3]
Liang-Chieh Chen, George Papandreou, Florian Schroff, and Hartwig Adam. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017).
[4]
Yuwen Chen and Yunhong. Xin. 2016. An Efficient Infrared Small Target Detection Method Based on Visual Contrast Mechanism. IEEE Geoscience and Remote Sensing Letters 13, 7 (2016), 962--966.
[5]
Yimian Dai and Yiquan Wu. 2017. Reweighted infrared patch-tensor model with both nonlocal and local priors for single-frame small target detection. IEEE journal of selected topics in applied earth observations and remote sensing 10, 8 (2017), 3752--3767.
[6]
Yimian Dai, Yiquan Wu, Fei Zhou, and Kobus. Barnard. 2021. Asymmetric contextual modulation for infrared small target detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 950--959.
[7]
Yimian Dai, Yiquan Wu, Fei Zhou, and Kobus Barnard. 2021. Attentional local contrast networks for infrared small target detection. IEEE Transactions on Geoscience and Remote Sensing 59, 11 (2021), 9813--9824.
[8]
He Deng, Xianping Sun, Maili Liu, Chaohui Ye, and Xin. Zhou. 2016. Small Infrared Target Detection Based on Weighted Local Difference Measure. IEEE Transactions on Geoscience and Remote Sensing 54, 7 (2016), 4204--4214.
[9]
Suyog D Deshpande, Meng Hwa Er, Ronda Venkateswarlu, and Philip. Chan. 1999. Max-mean and max-median filters for detection of small targets. In Signal and Data Processing of Small Targets 1999. 74--83.
[10]
Ayush Dogra and Parvinder Bhalla. 2014. Image sharpening by gaussian and butterworth high pass filter. Biomedical and pharmacology journal 7, 2 (2014), 707--713.
[11]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, et al. 2020. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations.
[12]
Chenqiang Gao, Deyu Meng, Yi Yang, YongtaoWang, Xiaofang Zhou, and Alexander G. Hauptmann. 2013. Infrared patch-image model for small target detection in a single image. IEEE transactions on image processing 22, 12 (2013), 4996--5009.
[13]
Li Gao, Jing Zhang, Lefei Zhang, and Dacheng Tao. 2021. Dsp: Dual soft-paste for unsupervised domain adaptive semantic segmentation. In Proceedings of the 29th ACM International Conference on Multimedia. 2825--2833.
[14]
Jinhui Han, Saed Moradi, Iman Faramarzi, Chengyin Liu, Honghui Zhang, and Qian Zhao. 2019. A local contrast method for infrared small-target detection utilizing a tri-layer window. IEEE Geoscience and Remote Sensing Letters 17, 10 (2019), 1822--1826.
[15]
Jinhui Han, Saed Moradi, Iman Faramarzi, Honghui Zhang, Qian Zhao, Xiaojian Zhang, and Nan Li. 2020. Infrared small target detection based on the weighted strengthened local contrast measure. IEEE Geoscience and Remote Sensing Letters 18, 9 (2020), 1670--1674.
[16]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian. Sun. 2015. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE international conference on computer vision. 1026--1034.
[17]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[18]
Boyang Li, Chao Xiao, Longguang Wang, Yingqian Wang, Zaiping Lin, Miao Li, Wei An, and Yulan Guo. 2021. Dense nested attention network for infrared small target detection. arXiv preprint arXiv:2106.00487 (2021).
[19]
Xiang Li, Wenhai Wang, Xiaolin Hu, and Jian. Yang. 2019. Selective kernel networks. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 510--519.
[20]
Chen Liang-Chieh, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan Yuille. 2015. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs. In International Conference on Learning Representations.
[21]
Jiawei Liu, Zheng-Jun Zha, QI Tian, Dong Liu, Ting Yao, Qiang Ling, and Tao. Mei. 2016. Multi-scale triplet cnn for person re-identification. In Proceedings of the 24th ACM international conference on Multimedia. 192--196.
[22]
Lingyi Lu and Xin Xu. 2021. Visible-Infrared Cross-Modal Person Reidentification based on Positive Feedback. In ACM Multimedia Asia. 1--6.
[23]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali. Farhadi. 2016. You only look once: Unified, real-time object detection. In Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 779--788.
[24]
Carole H Sudre,Wenqi Li, Tom Vercauteren, Sebastien Ourselin, and M Jorge Cardoso. 2017. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. In Deep learning in medical image analysis and multimodal learning for clinical decision support. Springer, 240--248.
[25]
Ke Sun, Yang Zhao, Borui Jiang, Tianheng Cheng, Bin Xiao, Dong Liu, Yadong Mu, Xinggang Wang, Wenyu Liu, and Jingdong Wang. 2019. High-resolution representations for labeling pixels and regions. arXiv preprint arXiv:1904.04514 (2019).
[26]
Yang Sun, Jungang Yang, and Wei An. 2020. Infrared dim and small target detection via multiple subspace learning and spatial-temporal patch-tensor model. IEEE Transactions on Geoscience and Remote Sensing 59, 5 (2020), 3737--3752.
[27]
Karasawa Takumi, KoheiWatanabe, Qishen Ha, Antonio Tejero-De-Pablos, Yoshitaka Ushiku, and Tatsuya Harada. 2017. Multispectral object detection for autonomous vehicles. In Proceedings of the on Thematic Workshops of ACM Multimedia 2017. 35--43.
[28]
Victor T Tom, Tamar Peli, May Leung, and Joseph E. Bondaryk. 1993. Morphologybased algorithm for point target detection in infrared backgrounds. In Signal and Data Processing of Small Targets 1993. International Society for Optics and Photonics, 2--11.
[29]
Chaoyue Wang, Chang Xu, Chaohui Wang, and Dacheng Tao. 2018. Perceptual adversarial networks for image-to-image transformation. IEEE Transactions on Image Processing 27, 8 (2018), 4066--4079.
[30]
HuanWang, Luping Zhou, and Lei.Wang. 2019. Miss detection vs. false alarm: Adversarial learning for small object segmentation in infrared images. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8509--8518.
[31]
Xing Wei, Diangang Li, Xiaopeng Hong, Wei Ke, and Yihong. Gong. 2020. Coattentive lifting for infrared-visible person re-identification. In Proceedings of the 28th ACM International Conference on Multimedia. 1028--1037.
[32]
Bai Xiangzhi, Zhou Fugen, Xie Yongchun, and Jin Ting. 2008. Modified top-hat transformation based on contour structuring element to detect infrared small target. In 3rd IEEE Conference on Industrial Electronics and Applications. 575--579.
[33]
Yufei Xu, Qiming Zhang, Jing Zhang, and Dacheng Tao. 2021. Vitae: Vision transformer advanced by exploring intrinsic inductive bias. Advances in Neural Information Processing Systems 34 (2021).
[34]
Xudong Yan, Huaidong Zhang, Xuemiao Xu, Xiaowei Hu, and Pheng-Ann. Heng. 2021. Learning Semantic Context from Normal Samples for Unsupervised Anomaly Detection. In Proceedings of the AAAI Conference on Artificial Intelligence. 3110--3118.
[35]
Bin Ye and Jia-xiong Peng. 2002. Small target detection method based on morphology top-hat operator. Journal of Image and Graphics 7, 7 (2002), 638--642.
[36]
Farah H Zawaideh, Qais M Yousef, and Firas H Zawaideh. 2017. Comparison between butterworth and gaussian high-pass filters using an enhanced method. International Journal of Computer Science and Network Security 17, 7 (2017), 113-- 117.
[37]
Ming Zeng, Jianxun Li, and Zhang Peng. 2006. The design of top-hat morphological filter and application to infrared target detection. Infrared physics & technology 48, 1 (2006), 67--76.
[38]
Jing Zhang and Dacheng Tao. 2020. Empowering things with intelligence: a survey of the progress, challenges, and opportunities in artificial intelligence of things. IEEE Internet of Things Journal 8, 10 (2020), 7789--7817.
[39]
Landan Zhang, Lingbing Peng, Tianfang Zhang, Siying Cao, and Zhenming Peng. 2018. Infrared small target detection via non-convex rank approximation minimization joint l2, 1 norm. Remote Sensing 10, 11 (2018), 1821.
[40]
Landan Zhang and Zhenming Peng. 2019. Infrared small target detection based on partial sum of the tensor nuclear norm. Remote Sensing 11, 4 (2019), 382.
[41]
Mingjin Zhang, Rui Zhang, Yuxiang Yang, Haichen Bai, Jing Zhang, and Jie Guo. 2022. ISNet: Shape Matters for Infrared Small Target Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 877--886.
[42]
Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, and Xiang Ruan. 2017. Amulet: Aggregating multi-level convolutional features for salient object detection. In Proceedings of the IEEE international conference on computer vision. 202--211.
[43]
Qiming Zhang, Yufei Xu, Jing Zhang, and Dacheng Tao. 2022. Vitaev2: Vision transformer advanced by exploring inductive bias for image recognition and beyond. arXiv preprint arXiv:2202.10108 (2022).
[44]
Qiming Zhang, Yufei Xu, Jing Zhang, and Dacheng Tao. 2022. VSA: Learning Varied-Size Window Attention in Vision Transformers. arXiv preprint arXiv:2204.08446 (2022).
[45]
Qiming Zhang, Jing Zhang, Wei Liu, and Dacheng Tao. 2019. Category anchorguided unsupervised domain adaptation for semantic segmentation. Advances in Neural Information Processing Systems 32 (2019).
[46]
Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, XiaogangWang, and Jiaya Jia. 2017. Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2881--2890.
[47]
Hu Zhu, Shiming Liu, Lizhen Deng, Yansheng Li, and Fu. Xiao. 2020. Infrared Small Target Detection via Low-Rank Tensor Completion With Top-Hat Regularization. IEEE Transactions on Geoscience and Remote Sensing 58, 2 (2020), 1004--1016.

Cited By

View all
  • (2025)Enhancing Infrared Small Target Detection: A Saliency-Guided Multi-Task Learning ApproachIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.352042426:3(3603-3618)Online publication date: Mar-2025
  • (2025)5-D spatial–temporal information-based infrared small target detection in complex environmentsPattern Recognition10.1016/j.patcog.2024.111003158(111003)Online publication date: Feb-2025
  • (2025)A fully locally selective large kernel network for traffic video detectionMeasurement10.1016/j.measurement.2024.115779242(115779)Online publication date: Jan-2025
  • Show More Cited By

Index Terms

  1. Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '22: Proceedings of the 30th ACM International Conference on Multimedia
      October 2022
      7537 pages
      ISBN:9781450392037
      DOI:10.1145/3503161
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 10 October 2022

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. deep learning
      2. filtering
      3. infrared small target detection

      Qualifiers

      • Research-article

      Funding Sources

      • EquipmentAdvance Research Field Fund Project
      • Young Elite Scientists Sponsorship Program by CAST
      • Youth Talent Promotion Project of Shaanxi University Science and Technology Association
      • Shaanxi Province Key Research and Development Program Project
      • Special Project on Technological Innovation and Application Development
      • National Natural Science Foundation of China
      • Chongqing Excellent Scientist Project

      Conference

      MM '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)157
      • Downloads (Last 6 weeks)14
      Reflects downloads up to 01 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Enhancing Infrared Small Target Detection: A Saliency-Guided Multi-Task Learning ApproachIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.352042426:3(3603-3618)Online publication date: Mar-2025
      • (2025)5-D spatial–temporal information-based infrared small target detection in complex environmentsPattern Recognition10.1016/j.patcog.2024.111003158(111003)Online publication date: Feb-2025
      • (2025)A fully locally selective large kernel network for traffic video detectionMeasurement10.1016/j.measurement.2024.115779242(115779)Online publication date: Jan-2025
      • (2024)A Multi-Scale Mask Convolution-Based Blind-Spot Network for Hyperspectral Anomaly DetectionRemote Sensing10.3390/rs1616303616:16(3036)Online publication date: 18-Aug-2024
      • (2024)Multi-Scale Object Detection in Remote Sensing Images Based on Feature Interaction and Gaussian DistributionRemote Sensing10.3390/rs1611198816:11(1988)Online publication date: 31-May-2024
      • (2024)SAR-to-Optical Image Translation via an Interpretable NetworkRemote Sensing10.3390/rs1602024216:2(242)Online publication date: 8-Jan-2024
      • (2024)Explore Hybrid Modeling for Moving Infrared Small Target DetectionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680887(6172-6181)Online publication date: 28-Oct-2024
      • (2024)Unleashing the Power of Generic Segmentation Model: A Simple Baseline for Infrared Small Target DetectionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680609(10392-10401)Online publication date: 28-Oct-2024
      • (2024)Learning Contrast-Enhanced Shape-Biased Representations for Infrared Small Target DetectionIEEE Transactions on Image Processing10.1109/TIP.2024.339101133(3047-3058)Online publication date: 2024
      • (2024)Dual-Stream Edge-Target Learning Network for Infrared Small Target DetectionIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.348805462(1-14)Online publication date: 2024
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media