Camouflaged target detection based on multimodal image input pixel-level fusion

Peng, Ruihui; Lai, Jie; Yang, Xueting; Sun, Dianxing; Tan, Shuncheng; Song, Yingjuan; Guo, Wei

doi:10.1631/FITEE.2300503

Camouflaged target detection based on multimodal image input pixel-level fusion

基于多模态图像输入端像素级融合的伪装目标检测

Research Article
Published: 30 September 2024

Volume 25, pages 1226–1239, (2024)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Ruihui Peng (彭锐晖)^1,2,
Jie Lai (赖杰) ORCID: orcid.org/0009-0005-7918-0941¹,
Xueting Yang (杨雪婷)¹,
Dianxing Sun (孙殿星)^1,3,
Shuncheng Tan (谭顺成)³,
Yingjuan Song (宋颖娟)¹ &
…
Wei Guo (郭伟)¹

317 Accesses
1 Citation
Explore all metrics

Abstract

Camouflaged targets are a type of nonsalient target with high foreground and background fusion and minimal target feature information, making target recognition extremely difficult. Most detection algorithms for camouflaged targets use only the target’s single-band information, resulting in low detection accuracy and a high missed detection rate. We present a multimodal image fusion camouflaged target detection technique (MIF-YOLOv5) in this paper. First, we provide a multimodal image input to achieve pixel-level fusion of the camouflaged target’s optical and infrared images to improve the effective feature information of the camouflaged target. Second, a loss function is created, and the K-Means++ clustering technique is used to optimize the target anchor frame in the dataset to increase camouflage personnel detection accuracy and robustness. Finally, a comprehensive detection index of camouflaged targets is proposed to compare the overall effectiveness of various approaches. More crucially, we create a multispectral camouflage target dataset to test the suggested technique. Experimental results show that the proposed method has the best comprehensive detection performance, with a detection accuracy of 96.5%, a recognition probability of 92.5%, a parameter number increase of 1×10⁴, a theoretical calculation amount increase of 0.03 GFLOPs, and a comprehensive detection index of 0.85. The advantage of this method in terms of detection accuracy is also apparent in performance comparisons with other target algorithms.

摘要

伪装目标是一种前景和背景高度融合、目标特征信息极少的非显著目标, 给目标识别带来极大困难. 大多数伪装目标检测算法仅使用目标的单波段信息, 导致检测精度低、漏检率高. 本文提出一种多模态图像融合伪装目标检测技术(MIF-YOLOv5). 首先, 通过多模态图像输入端实现伪装目标的光学和红外图像的像素级融合, 增强伪装目标的有效特征信息. 其次, 创建损失函数, 并利用K-Means++聚类算法优化数据集中的目标锚框, 提高伪装人员的检测精度和算法鲁棒性. 最后, 提出伪装目标的综合检测指标, 以比较各种方法的综合检测效果. 更重要的是, 创建了一个多光谱伪装目标数据集来测试所提技术. 实验结果表明, 所提方法综合检测性能最佳, 其检测精度为96.5%, 识别概率为92.5%, 模型参数增加1×10⁴, 理论计算量增加0.03 GFLOPs, 伪装目标综合检测指数为0.85. 与其他目标算法相比, 该方法在检测精度上的优势显而易见.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CAMOUFLAGE-Net: comprehensive advanced model for optimal camouflaged target detection and analysis using groundbreaking elements

Article 03 December 2024

Evaluation method for the hyperspectral image camouflage effect based on multifeature description and grayscale clustering

Article Open access 17 January 2023

An Improved Camouflage Target Detection Using Hyperspectral Image Based on Block-Diagonal and Low-Rank Representation

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Bhajantri NU, Nagabhushan P, 2006. Camouflage defect identification: a novel approach. Proc 9^th Int Conf on Information Technology, p.145–148. https://doi.org/10.1109/ICIT.2006.34
Google Scholar
Bochkovskiy A, Wang CY, Liao HY, et al., 2020. YOLOv4: optimal speed and accuracy of object detection. https://arxiv.org/abs/2004.10934
Google Scholar
Cheng XL, Geng KK, Wang ZW, et al., 2023. SLBAF-Net: super-lightweight bimodal adaptive fusion network for UAV detection in low recognition environment. Multim Tools Appl, 82(30):47773–47792. https://doi.org/10.1007/s11042-023-15333-w
Article Google Scholar
Cheng Y, Hao HZ, Ji Y, et al., 2022. Attention-based neighbor selective aggregation network for camouflaged object detection. Proc Int Joint Conf on Neural Networks, p. 1–8. https://doi.org/10.1109/IJCNN55064.2022.9892156
Google Scholar
Fan DP, Ji GP, Sun GL, et al., 2020a. Camouflaged object detection. Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.2777–2787. https://doi.org/10.1109/CVPR42600.2020.00285
Google Scholar
Fan DP, Ji GP, Zhou T, et al., 2020b. PraNet: parallel reverse attention network for polyp segmentation. Proc 23^rd Int Conf on Medical Image Computing and Computer-Assisted Intervention, p.263–273. https://doi.org/10.1007/978-3-030-59725-2_26
Google Scholar
Fang QY, Han DP, Wang ZK, 2021. Cross-modality fusion Transformer for multispectral object detection. https://arxiv.org/abs/2111.00273
Google Scholar
Gevorgyan Z, 2022. SIoU loss: more powerful learning for bounding box regression. https://arxiv.org/abs/2205.12740
Google Scholar
Girshick R, 2015. Fast R-CNN. Proc IEEE Int Conf on Computer Vision, p.1440–1448. https://doi.org/10.1109/ICCV.2015.169
Google Scholar
Girshick R, Donahue J, Darrell T, et al., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.580–587. https://doi.org/10.1109/CVPR.2014.81
Google Scholar
Hu JH, Cui GZ, Qin L, 2015. A new method of multispectral image processing with camouflage effect detection. Proc SPIE 9675, Image Processing and Analysis, Article 967510. https://doi.org/10.1117/12.2199206
Google Scholar
Liang XY, Lin HK, Yang H, et al., 2021. Construction of semantic segmentation dataset of camouflage target image. Lasers Optoelectron Prog, 58(4):0410015 (in Chinese). https://doi.org/10.3788/LOP202158.0410015
Article Google Scholar
Lin ZY, Goyal P, Girshick R, et al., 2020. Focal loss for dense object detection. IEEE Trans Patt Anal Mach Intell, 42(2):318–327. https://doi.org/10.1109/TPAMI.2018.2858826
Article Google Scholar
Liu CX, 2022. Research on the Fusion Algorithms of Infrared and Visible Image. MS Thesis, Lanzhou Jiaotong University, Lanzhou, China (in Chinese). https://doi.org/10.27205/d.cnki.gltec.2022.001211
Google Scholar
Liu W, Anguelov D, Erhan D, et al., 2016. SSD: single shot multibox detector. Proc 14^th European Conf on Computer Vision, p.21–37. https://doi.org/10.1007/978-3-319-46448-0_2
Google Scholar
Lv YQ, Zhang J, Dai YC, et al., 2021. Simultaneously localize, segment and rank the camouflaged objects. Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.11591–11601. https://doi.org/10.1109/CVPR46437.2021.01142
Google Scholar
Putatunda R, Gangopadhyay A, Erbacher RF, et al., 2022. Camouflaged object detection system at the edge. Proc SPIE 12096, Automatic Target Recognition XXXII, Article 120960I. https://doi.org/10.1117/12.2618869
Google Scholar
Qi B, 2022. Research on Fusion of Infrared and Visible Light Image Based on Co-occurrence Analysis Shearlet Transform. MS Thesis, Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, China (in Chinese). https://doi.org/10.27522/d.cnki.gkcgs.2022.000050
Google Scholar
Redmon J, Farhadi A, 2017. YOLO9000: better, faster, stronger. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.7263–7271. https://doi.org/10.1109/CVPR.2017.690
Google Scholar
Redmon J, Divvala S, Girshick R, et al., 2016. You only look once: unified, real-time object detection. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.779–788. https://doi.org/10.1109/CVPR.2016.91
Google Scholar
Sun XH, Guan Z, Wang X, 2023. Vision Transformer for fusing infrared and visible images in groups. J Image Graph, 28(1):166–178 (in Chinese). https://doi.org/10.11834/jig.220515
Google Scholar
Tan XY, Hu X, Yang JX, et al., 2022. Camouflaged object detection based on progressive feature enhancement aggregation. J Comput Appl, 42(7):2192–2200 (in Chinese). https://doi.org/10.11772/j.issn.1001-9081.2021060900
Google Scholar
Wu GJ, Lyu XL, Xing HN, et al., 2015. Application of three-dimensional convex analysis in pattern painting camouflage detection. J PLA Univ Sci Technol (Nat Sci Ed), 16(6):582–586 (in Chinese). https://doi.org/10.7666/j.issn.1009-3443.20141212001
Google Scholar
Yadav D, Arora MK, Tiwari KC, et al., 2018. Detection and identification of camouflaged targets using hyperspectral and LiDAR data. Def Sci J, 68(6):540–546. https://doi.org/10.14429/dsj.68.12731
Article Google Scholar
Zhang W, Zhou QK, Li RZ, et al., 2022. Research on camouflaged human target detection based on deep learning. Comput Intell Neurosci, 2022:7703444. https://doi.org/10.1155/2022/7703444
Article Google Scholar

Download references

Author information

Authors and Affiliations

Qingdao Innovation and Development Base, Harbin Engineering University, Qingdao, 266000, China
Ruihui Peng (彭锐晖), Jie Lai (赖杰), Xueting Yang (杨雪婷), Dianxing Sun (孙殿星), Yingjuan Song (宋颖娟) & Wei Guo (郭伟)
College of Information and Communication Engineering, Harbin Engineering University, Harbin, 150001, China
Ruihui Peng (彭锐晖)
Insitute of Information Fusion, Naval Aeronautical University, Yantai, 264001, China
Dianxing Sun (孙殿星) & Shuncheng Tan (谭顺成)

Authors

Ruihui Peng (彭锐晖)
View author publications
You can also search for this author inPubMed Google Scholar
Jie Lai (赖杰)
View author publications
You can also search for this author inPubMed Google Scholar
Xueting Yang (杨雪婷)
View author publications
You can also search for this author inPubMed Google Scholar
Dianxing Sun (孙殿星)
View author publications
You can also search for this author inPubMed Google Scholar
Shuncheng Tan (谭顺成)
View author publications
You can also search for this author inPubMed Google Scholar
Yingjuan Song (宋颖娟)
View author publications
You can also search for this author inPubMed Google Scholar
Wei Guo (郭伟)
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Ruihui PENG and Jie LAI designed the research. Jie LAI devised the experimental method for acquiring camouflage target datasets. Xueting YANG, Yingjuan SONG, and Wei GUO collaborated to complete data collection. Jie LAI and Xueting YANG accomplished experimental verification. Jie LAI and Dianxing SUN drafted the paper. Ruihui PENG, Dianxing SUN, and Shuncheng TAN revised and finalized the paper.

Corresponding author

Correspondence to Jie Lai (赖杰).

Ethics declarations

All the authors declare that they have no conflict of interest.

Additional information

Project supported by the Shandong Provincial Natural Science Foundation of China (No. ZR2020MF015) and the Aerospace Science and Technology Innovation Institute Stabilization Support Project (No. ZY0110020009)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Peng, R., Lai, J., Yang, X. et al. Camouflaged target detection based on multimodal image input pixel-level fusion. Front Inform Technol Electron Eng 25, 1226–1239 (2024). https://doi.org/10.1631/FITEE.2300503

Download citation

Received: 26 July 2023
Accepted: 14 November 2023
Published: 30 September 2024
Issue Date: September 2024
DOI: https://doi.org/10.1631/FITEE.2300503

Key words

关键词

CLC number

TP391

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Camouflaged target detection based on multimodal image input pixel-level fusion

Abstract

摘要

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

CAMOUFLAGE-Net: comprehensive advanced model for optimal camouflaged target detection and analysis using groundbreaking elements

Evaluation method for the hyperspectral image camouflage effect based on multifeature description and grayscale clustering

An Improved Camouflage Target Detection Using Hyperspectral Image Based on Block-Diagonal and Low-Rank Representation

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

关键词

CLC number

Subscribe and save

Buy Now