Abstract
Infrared object detection constitutes a significant ship-targeting methodology, exerting a vital role in maritime safety. The contemporary research regarding infrared ship imagery is insufficient and remains in need of addressing the issues related to smaller object sizes and more elaborate information. To overcome these challenges, we introduce RD-YOLO, a model dedicated to ship object detection in remote sensing, with a focus on infrared images. RD-YOLO incorporates a Receptive Field Convolution module, which fully exploits the receptive field features to enhance the global feature perception capacity of RD-YOLO. Furthermore, it utilizes a multi-scale network, namely the Deep Convergence Network (DCNnet), to improve the fusion of remote sensing information within RD-YOLO. The DCNnet introduces two innovative modules to boost target detection in infrared remote sensing images. The Multiscale Fusion Module resolves the challenge of omitting detailed information from larger-scale features and ensures comprehensive characterization. The Spatial Feature Module integrates multi-scale features through input compression with 3D convolution, augmenting the network's ability to capture indistinct texture information. The experimental results demonstrate that the mAP50 of RD-YOLO on the SFISD open dataset attains 94.53%, which is 4.22% higher than that of YOLOv8s; the mAP50 of RD-YOLO on the ship remote sensing dataset of Shandong University reaches 98.91%, which is 1.87% higher than that of YOLOv8s, thereby validating the high efficiency of this method. It is exceptionally suitable for infrared ship target detection in various environments.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig7_HTML.jpg)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig9_HTML.jpg)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03656-6/MediaObjects/11760_2024_3656_Fig10_HTML.jpg)
Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
Code availability
Not applicable.
References
Sun, Z., Leng, X., Zhang, X., Xiong, B., Kuang, K.J.G.: Ship recognition for complex SAR images via dual-branch transformer fusion network. IEEE Geosci Remote Sens Lett 21, 1–5 (2024)
Yan, U.H., Li, B., Zhang, H., Wei, X.: An antijamming and lightweight ship detector designed for spaceborne optical images. IEEE J. Select. Topics Appl. Earth Observ. Remote Sens. 15, 4468–4481 (2022)
Lu, H., et al.: An improved ship detection algorithm for an airborne passive interferometric microwave sensor (PIMS) based on ship wakes. IEEE Trans. Geosci. Remote Sens. 61, 1–12 (2023)
Cao, F., Yang, Z., Hong, X., Cheng, Y., Huang, Y., Lv, J.: Supervised dimensionality reduction of hyperspectral imagery via local and global sparse representation. IEEE J. Select. Topics Appl. Earth Observ. Remote Sens. 14, 3860–3874 (2021)
Deng, X., Wang, W., Huang, Y., Lei, Z., Wu, D.: A ship recognition method based on RBM network and its performance simulation. In 2022 2nd International conference on computer science, electronic information engineering and intelligent control technology (CEI), IEEE, pp 538–541 (2022)
Strauch, G.E., Lin, J.J., Tešić, J.: Overhead projection approach for multi-camera vessel activity recognition. In 2021 IEEE international conference on big data (Big Data). IEEE. pp 5626-5632 (2021)
Shao, Z., Wang, L., Wang, Z., Du, W., Wu, W.: Saliency-aware convolution neural network for ship detection in surveillance video. IEEE Trans. Circ. Syst. Video Technol. 30(3), 781–794 (2020)
Wang, B., Wang, H., Mao, X., Wu, S., Liao, Z., Zang, Y.: Optical system design method of near-earth short-wave infrared star sensor. IEEE Sens. J. 22(2), 22169–22178 (2022)
Wang, N., Li, B., Wei, X., Wang, Y., Yan, H.: Ship detection in spaceborne infrared image based on lightweight CNN and multisource feature cascade decision. IEEE Trans. Geosci. Remote Sens. 59(5), 4324–4339 (2021)
Wang, B., Benli, E., Motai, Y., Dong, L., Xu, W.: Robust detection of infrared maritime targets for autonomous navigation. IEEE Trans. Intell. Veh. 5(4), 635–648 (2020)
Deng, H., Zhang, Y.: FMR-YOLO: infrared ship rotating target detection based on synthetic fog and multiscale weighted feature fusion. IEEE Trans. Instrument. Measure. 73(7), 1–17 (2024)
Wang, W., Zhengzhou, L., Abubakar, S.: Infrared maritime small-target detection based on fusion gray gradient clutter suppression. Remote Sens. 1255(7), 1–16 (2024)
Lu, D., Wang, M., Yang, X., Teng, L., Tan, J., Tian, Z., Wang, L., Gu, G.: A small target detection method for sea surface based on guided filtering and local mean gray difference. J. Comput. Commun. 11(22), 49–63 (2023)
Nian, B., Jiang, B., Shi, H., Zhang, Y.: Local contrast attention guide network for detecting infrared small targets. IEEE Trans. Geosci. Remote Sens. 61, 1–13 (2023)
Li, Y., Li, Z., Guo, Z., Siddique, A., Liu, Y., Yu, K.: Infrared small target detection based on adaptive region growing algorithm with iterative threshold analysis. IEEE Trans. Geosci. Remote Sens. 62(15), 1–15 (2024)
Luo, Y., Li, X., Chen, S., Xia, C., Zhao, L.: Infrared small target detection based on improved tri-layer window local contrast. In IGARSS 2023 IEEE international geoscience and remote sensing symposium, Pasadena. IEEE. 6510–6513 (2023)
Xu, Y., et al.: Infrared small target detection based on local contrast-weighted multidirectional derivative. IEEE Trans. Geosci. Remote Sens. 61(16), 1–16 (2023)
Gao, C., Zhai, Y.: Region proposal patch-image model for infrared small target detection. Int. J. Remote Sens. 43(2), 424–456 (2022)
Zhang, Z., Cheng, D., Zhisheng, G., Chunzhi, X.: **e ANLPT: self-adaptive and non-local patch-tensor model for infrared small target detection. Remote Sens. 15(4), 1–20 (2023)
Guan, X., Landan, Z., Suqi, H., Zhenming, P.: Infrared small target detection via non-convex tensor rank surrogate joint local contrast energy. Remote Sens. 12(9), 1–16 (2020)
Rawat, S.S., Alghamdi, S., Kumar, G., Alotaibi, Y., Khalaf, O.I., Verma, L.P.: Infrared small target detection based on partial sum minimization and total variation. Mathematics 10(671), 1–15 (2022)
Li, Y., Xu, Q., He, Z., Li, W.: Progressive task-based universal network for raw infrared remote sensing imagery ship detection. IEEE Trans. Geosci. Remote Sens. 61(13), 1–13 (2023)
Wu, P., Huang, H., Qian, H., Su, S., Sun, B., Zuo, Z.: SRCANet: Stacked residual coordinate attention network for infrared ship detection. IEEE Trans. Geosci. Remote Sens. 60(14), 1–5 (2022)
Han, Y., Liao, J., Lu, T., Pu, T., Peng, Z.: KCPNet: knowledge-driven context perception networks for ship detection in infrared imagery. IEEE Trans. Geosci. Remote Sens. 61(19), 1–19 (2023)
Li, B. et al.: Dense nested attention network for infrared small target detection arXiv preprint arXiv:2106.00487 (2021)
Jia, H.R., Ni, L.: Marine ship recognition based on cascade CNNs. In Proceedings of the SPIE 11427, second target recognition and artificial intelligence summit forum. IEEE. pp 1–14, (2020)
Tang, Y., Wang, S., Wei, J., Zhao, Y., Lin, J., Yu, J., Li, D.: Scene-aware data augmentation for ship detection in SAR images. Int. J. Remote Sens. 45(10), 3396–3411 (2024)
Wang, R., Luo, M., Feng, Q., Peng, C., He, D.: Multi-party privacy-preserving faster R-CNN framework for object detection. IEEE Trans. Emerg. Topics Comput. Intell. 8(1), 956–967 (2024)
Wei, Z., Hui, Z., Joe, E., Xiaolong, Q., Jiale, J., Youren, C.: Concrete crack detection using lightweight attention feature fusion single shot multibox detector. Knowl. Based Syst. 261(110216), 0950–7051 (2023)
Zhan, W., Zhan, C., Guo, S., Guo, J., Shi, M.: EGISD-YOLO: edge guidance network for infrared ship target detection. IEEE J. Select. Topics Appl. Earth Observ. Remote Sens. 14(28), 1–13 (2024)
Zhang, T., Zhang, X., Ke, X.: Quad-FPN: a novel quad feature pyramid network for SAR ship detection. Remote Sens. 13(14), 1–30 (2021)
Xu, X., Zhang, X., Shao, Z., Shi, J., Wei, S., Zhang, T., Zeng, T.: A group-wise feature enhancement-and-fusion network with dual-polarization feature enrichment for SAR ship detection. Remote Sens. 14(20), 1–24 (2022)
Li, J., Liang, X., Wei, Y., Xu, T., Feng, J., Yan, S.: Perceptual generative adversarial networks for small object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. (CVPR) IEEE. 1222–1230 (2017)
Li, S., Li, Y., Li, Y., Li, M., Xu, X.: YOLO-FIRI: improved YOLOv5 for infrared image object detection. IEEE Access. 9(17), 141861–141875 (2021)
Miao, R., Jiang, H., Tian, F.: Robust ship detection in infrared images through multiscale feature extraction and lightweight CNN. Sensors. 22(3), 12–26 (2022)
Li, L., Jiang, L., Zhang, J., Wang, S., Chen, F.: A complete YOLO based ship detection method for thermal infrared remote sensing images under complex backgrounds. Remote Sens. 14(7), 1534 (2022)
Li, J., Chen, J., Cheng, P., Yu, Z., Yu, L., Chi, C.: A survey on deep-learning-based real-time SAR ship detection. IEEE J. Select. Topics Appl. Earth Observ. Remote Sens. 16, 3218–3247 (2023)
Liu, Z., Bai, X., Sun, C., Zhou, F., Li, Y.: Infrared ship target segmentation through the integration of multiple feature maps. Image Vis. Comput. 8(48–49), 14–25 (2016)
Haoxiang, Z., Chao, L., Jianguang, M., Hui, S.: Time-prior-based stacking ensemble deep learning model for ship infrared automatic target recognition in complex maritime scenarios. Infrared Phys. Technol. 1051(68), 4480–4495 (2024)
Tang, X., Zhang, J., Xia, Y., Xiao, H.: DBW-YOLO: a high-precision SAR ship detection method for complex environments. IEEE J. Select. Topics Appl. Earth Observ. Remote Sens. 17(337), 7029–7039 (2024)
Zhou, F., Wang, X., Zhang, L., Jiang, B.: YOLO-RSA: a multiscale ship detection algorithm based on optical remote sensing image. J. Mar. Sci. Eng. 12(4), 603–615 (2024)
Liyuan, L., Jiang, L., Zhang, J., Wang, S., Chen, F.: A Complete YOLO-Based Ship Detection Method for Thermal Infrared Remote Sensing Images under Complex Backgrounds. Remote Sens. 14(7), 15–29 (2022)
Xu, X., Zhang, X., Zhang, T.: Lite-YOLOv5: a lightweight deep learning detector for on-board ship detection in large-scene sentinel-1 SAR images. Remote Sens. 14(4), 1–27 (2022)
Dong, X., Fu, R., Gao, Y., Qin, Y., Ye, Y., Li, B.: Remote sensing object detection based on receptive field expansion block. IEEE Geosci. Remote Sens. Lett. 19(8020605), 1–5 (2022)
Zhao, Y., Sun, G., Zhang, L., Zhang, A., Jia, X., Han, Z.: MSRF-Net: multiscale receptive field network for building detection from remote sensing images. IEEE Trans. Geosci. Remote Sens. 61(14), 1–14 (2023)
Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. (CVPR). IEEE. pp 13713–13722, (2021)
Jocher, G., Chaurasia, A., Qiu, J.: YOLO by ultralytics (version 8.0.0), GitHub, https://github.com/ultralytics/ultralytics, (2023)
Wu, P., et al.: SARFB: strengthened asymmetric receptive field block for accurate infrared ship detection. IEEE Sens. J. 23(5), 5028–5044 (2023)
Ship Detection Dataset for Ships in the Far Sea (10–12 km). Accessed: Sep. 15, 2022. [Online]. Available: http://www.gxzx.sdu.edu.cn/info/1133/2174.htm
Cheng, X., Fu, Z., Yang, J.: Multi-scale dynamic feature encoding network for image demoiréing. IEEE/CVF Int. l Conf. Comput. Vis Workshop 13(34), 3486–3493 (2019)
Funding
This work was supported by the Heilongjiang Province Provincial Higher Education Institutions Basic Research Operating Expenses Program under Grant (2022-KYYWF-0569).
Author information
Authors and Affiliations
Contributions
Conceptualization, Yilin Ge and Haowen Ji; methodology, Yilin Ge , Haowen Ji, and Xingli Liu; software, Haowen Ji; validation, Haowen Ji; formal analysis, Yinlin Ge and Haowen Ji; investigation, Haowen Ji; resources, Haowen Ji; data curation, Haowen Ji; writing—original draft preparation, Yilin Ge; writing—review and editing, Haowen Ji. All the authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ge, Y., Ji, H. & Liu, X. Infrared remote sensing ship image object detection model based on YOLO In multiple environments. SIViP 19, 85 (2025). https://doi.org/10.1007/s11760-024-03656-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-024-03656-6