research-article

Infrared small target detection based on the combination of single image super-resolution reconstruction and YOLOX

Authors:

Dengpan SongAuthors Info & Claims

CACML '23: Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pages 547 - 552

https://doi.org/10.1145/3590003.3590104

Published: 29 May 2023 Publication History

CACML '23: Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Infrared small target detection based on the combination of single image super-resolution reconstruction and YOLOX

Pages 547 - 552

Abstract
References

Abstract

For the infrared search and tracking system, it is necessary to increase the ability to detect small infrared targets against complex backgrounds. YOLOX is a high-performance detector, but its detection performance is constrained when it uses data from low-resolution infrared images with small targets. However, occasionally design constraints and budgetary restraints will prevent the optical system and sensor resolution from being increased enough to improve image quality. Real-ESRGAN is used to solve this issue by reconstructing a high-resolution infrared image from its low-resolution counterpart, which will be used as YOLOX-S's input. Also, the YOLOX-S training strategy is modified further to make it appropriate for the detection of infrared small targets, including the Mosaic and MixUp data augmentation and the size of ground-truth. The average precision achieved by the suggested method in this work increases from 63.70% to 77.19%, which shows a considerable improvement in infrared small target detection when compared with the original model by inputting original images.

References

[1]

Hong Zhang, Lei Zhang, Ding Yuan, Hao Chen. 2017. Infrared small target detection based on local intensity and gradient properties. Infrared Physics & Technology, 89:88-96. https://doi.org/10.1016/j.infrared.2017.12.018

[2]

Wang X Y. 2018. Research on infrared dim and small target detection theory and methodology based on sparse dynamic inversion. Chengdu: University of Electronic Science and Technology of China.

[3]

Gujing Han, Tao Li, Qiang Li, Feng Zhao, Min Zhang, Ruijie Wang, Qiwei Yuan, Kaipei Liu and Liang Qin. 2022. Improved Algorithm for Insulator and Its Defect Detection Based on YOLOX. Sensors,22(16): 6186. https://doi.org/10.3390/s22166186

[4]

Zhang S Y. 2021.Research on Image SR Reconstruction Based on Generative Adversarial Network. China University of Mining and Technology. XuZhou: China University of Mining and Technology.

[5]

Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun. 2017. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis & Machine Intelligence,39(6):1137-1149.https://doi.org/ 10.1109/TPAMI.2016.2577031

Digital Library

[6]

He, K, Gkioxari, G, Dollár, P, and Girshick, R. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision, 2961-2969.

[7]

Redmon, Joseph, and Ali Farhadi. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.

[8]

Bochkovskiy A, Wang C Y, Liao H. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint arXiv:2004.10934, 2

[9]

Glenn Jocher 2021. yolov5,https://github.com/ultralyti-cs/yolov5

[10]

Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun. 2021. Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv: 2107. 08430.

[11]

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu & Alexander C. Berg. 2016. Ssd: Single shot multibox detector. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016.

[12]

Xintao Wang, Liangbin Xie, Chao Dong, Ying Shan. 2021. Real-esrgan: Training real-world blind SR with pure synthetic data. 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), October 11-17,2021, Montreal, BC, Canada, IEEE: 21442270.

[13]

X Zhou, L Jiang, C Hu, S Lei, T Zhang, X Mou. 2022. YOLO-SASE: An Improved YOLO Algorithm for the Small Targets Detection in Complex Backgrounds.Sensors, 22(12): 4600. https://doi.org/10.3390/s22124600

[14]

Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, Chen Change Loy. 2018. Esrgan: Enhanced super-resolution generative adversarial networks. Computer Vision – ECCV 2018 Workshops, 11133: 63–79.

[15]

MA Yousuf, and MN Nobi. 2011. A new method to remove noise in magnetic resonance and ultrasound images. Journal of scientific research, 81-81. https://doi.org/10.3329/jsr.v3i1.5544

[16]

B Hui, Z Song, H Fan, P Zhong, We Hu, X Zhang, J Lin, H Su, W Jin, Y Zhang, Y Bai. 2019. A dataset for infrared image dim-small aircraft target detection and tracking under ground/air background. Science Data Bank. https://doi.org/10.11922/sciencedb.902.

Cited By

Zhou GLiu XBi H(2025)Recognition of UAVs in Infrared Images Based on YOLOv8IEEE Access10.1109/ACCESS.2024.350058313(1534-1545)Online publication date: 2025
https://doi.org/10.1109/ACCESS.2024.3500583
Lin JLi SYang XNiu SYan BMeng Z(2024)CS-ViG-UNet: Infrared small and dim target detection based on cycle shift vision graph convolution networkExpert Systems with Applications10.1016/j.eswa.2024.124385254(124385)Online publication date: Nov-2024
https://doi.org/10.1016/j.eswa.2024.124385
Lin FGe SBao KYan CZeng D(2023)Learning Shape-Biased Representations for Infrared Small Target DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.332574326(4681-4692)Online publication date: 20-Oct-2023
https://dl.acm.org/doi/10.1109/TMM.2023.3325743

Index Terms

Infrared small target detection based on the combination of single image super-resolution reconstruction and YOLOX
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object detection

Recommendations

Style transformation super-resolution GAN for extremely small infrared target image
Highlights
- The image transformation and the super-resolution are integrated for the first time.
- We constructed two datasets of small infrared UAV targets to test the performance.
- It is possible to obtain a large and clear image simultaneously ...
Abstract
With the development of generative adversarial networks, the super-resolution technique of reconstructing a high-resolution image from a low-resolution has achieved excellent resolution results. However, small, low-resolution images are ...
Graphical abstract

Display Omitted
Fast Learning-Based Single Image Super-Resolution

We present a learning-based single image super-resolution (SISR) method to obtain a high resolution (HR) image from a single given low resolution (LR) image. Our method gives more accurate results while also testing (runs) and training faster with a ...
Infrared small target detection via region super resolution generative adversarial network
Abstract
Infrared small target detection has always been a difficult problem in the field of object detection. The main reason affecting the accuracy is that the small infrared target has fewer pixels and weaker features. The current optimization methods ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CACML '23: Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

March 2023

598 pages

ISBN:9781450399449

DOI:10.1145/3590003

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 May 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Fundamental Strengthening Technology Fund

Conference

CACML 2023

CACML 2023: 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

March 17 - 19, 2023

Shanghai, China

Acceptance Rates

CACML '23 Paper Acceptance Rate 93 of 241 submissions, 39%;

Overall Acceptance Rate 93 of 241 submissions, 39%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
81
Total Downloads

Downloads (Last 12 months)40
Downloads (Last 6 weeks)3

Reflects downloads up to 23 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhou GLiu XBi H(2025)Recognition of UAVs in Infrared Images Based on YOLOv8IEEE Access10.1109/ACCESS.2024.350058313(1534-1545)Online publication date: 2025
https://doi.org/10.1109/ACCESS.2024.3500583
Lin JLi SYang XNiu SYan BMeng Z(2024)CS-ViG-UNet: Infrared small and dim target detection based on cycle shift vision graph convolution networkExpert Systems with Applications10.1016/j.eswa.2024.124385254(124385)Online publication date: Nov-2024
https://doi.org/10.1016/j.eswa.2024.124385
Lin FGe SBao KYan CZeng D(2023)Learning Shape-Biased Representations for Infrared Small Target DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.332574326(4681-4692)Online publication date: 20-Oct-2023
https://dl.acm.org/doi/10.1109/TMM.2023.3325743

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten