Skip to main content

Advertisement

Log in

Camouflaged target detection based on multimodal image input pixel-level fusion

基于多模态图像输入端像素级融合的伪装目标检测

  • Research Article
  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

Camouflaged targets are a type of nonsalient target with high foreground and background fusion and minimal target feature information, making target recognition extremely difficult. Most detection algorithms for camouflaged targets use only the target’s single-band information, resulting in low detection accuracy and a high missed detection rate. We present a multimodal image fusion camouflaged target detection technique (MIF-YOLOv5) in this paper. First, we provide a multimodal image input to achieve pixel-level fusion of the camouflaged target’s optical and infrared images to improve the effective feature information of the camouflaged target. Second, a loss function is created, and the K-Means++ clustering technique is used to optimize the target anchor frame in the dataset to increase camouflage personnel detection accuracy and robustness. Finally, a comprehensive detection index of camouflaged targets is proposed to compare the overall effectiveness of various approaches. More crucially, we create a multispectral camouflage target dataset to test the suggested technique. Experimental results show that the proposed method has the best comprehensive detection performance, with a detection accuracy of 96.5%, a recognition probability of 92.5%, a parameter number increase of 1×104, a theoretical calculation amount increase of 0.03 GFLOPs, and a comprehensive detection index of 0.85. The advantage of this method in terms of detection accuracy is also apparent in performance comparisons with other target algorithms.

摘要

伪装目标是一种前景和背景高度融合、目标特征信息极少的非显著目标, 给目标识别带来极大困难. 大多数伪装目标检测算法仅使用目标的单波段信息, 导致检测精度低、漏检率高. 本文提出一种多模态图像融合伪装目标检测技术(MIF-YOLOv5). 首先, 通过多模态图像输入端实现伪装目标的光学和红外图像的像素级融合, 增强伪装目标的有效特征信息. 其次, 创建损失函数, 并利用K-Means++聚类算法优化数据集中的目标锚框, 提高伪装人员的检测精度和算法鲁棒性. 最后, 提出伪装目标的综合检测指标, 以比较各种方法的综合检测效果. 更重要的是, 创建了一个多光谱伪装目标数据集来测试所提技术. 实验结果表明, 所提方法综合检测性能最佳, 其检测精度为96.5%, 识别概率为92.5%, 模型参数增加1×104, 理论计算量增加0.03 GFLOPs, 伪装目标综合检测指数为0.85. 与其他目标算法相比, 该方法在检测精度上的优势显而易见.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Download references

Author information

Authors and Affiliations

Authors

Contributions

Ruihui PENG and Jie LAI designed the research. Jie LAI devised the experimental method for acquiring camouflage target datasets. Xueting YANG, Yingjuan SONG, and Wei GUO collaborated to complete data collection. Jie LAI and Xueting YANG accomplished experimental verification. Jie LAI and Dianxing SUN drafted the paper. Ruihui PENG, Dianxing SUN, and Shuncheng TAN revised and finalized the paper.

Corresponding author

Correspondence to Jie Lai  (赖杰).

Ethics declarations

All the authors declare that they have no conflict of interest.

Additional information

Project supported by the Shandong Provincial Natural Science Foundation of China (No. ZR2020MF015) and the Aerospace Science and Technology Innovation Institute Stabilization Support Project (No. ZY0110020009)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Peng, R., Lai, J., Yang, X. et al. Camouflaged target detection based on multimodal image input pixel-level fusion. Front Inform Technol Electron Eng 25, 1226–1239 (2024). https://doi.org/10.1631/FITEE.2300503

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.2300503

Key words

关键词

CLC number