Abstract
In SAR (Synthetic Aperture Radar) image ship detection tasks, the accuracy of detection is hindered by the presence of a significant amount of clutter and false targets. Ships in these images exhibit diverse scales, which poses a challenge for traditional models to capture the necessary features across these varying scales. Furthermore, the limited texture information available in SAR images complicates the task of distinguishing between targets and the background, leading to suboptimal performance when using conventional deep learning models. To address these challenges, this paper introduces several innovative components. Firstly, a novel dynamic convolution operator is proposed, which allows for adaptively adjusting the smoothing method to mitigate the impact of noise in the data. Additionally, the paper introduces a plug-and-play feature smoothing module as well as a visual module. The feature smoothing module contributes to reducing noise interference, enhancing the overall robustness of the model. Meanwhile, the visual module focuses on augmenting long-range dependency relationships within the data, facilitating a better understanding of the complex interplay between ships and the background. This effectively addresses the issue of texture scarcity in SAR images. Our method was compared with the most advanced Visual module Explicit Visual Center (EVC) using the YOLO7-obb object detection baseline. The experimental results unequivocally demonstrate that our proposed method achieved a notable 1.2% improvement in the Average Precision (AP) value.
Access this article
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig10_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig11_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-024-18288-8/MediaObjects/11042_2024_18288_Fig12_HTML.png)
Similar content being viewed by others
Availability of data and materials
All the data used to support the findings of the study are available in the manuscript.
References
Bao W, Huang M, Zhang Y, Xu Y, Liu X, Xiang X (2021) Boosting ship detection in sar images with complementary pretraining techniques. IEEE J Sel Top Appl Earth Obs Remote Sens 14:8941–8954
Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International conference on machine learning, PMLR, pp 10347–10357
Zheng S, Lu J, Zhao H, Zhu X, Luo Z, Wang Y, Fu Y, Feng J, Xiang T, Torr PH et al (2021) Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6881–6890
Tolstikhin IO, Houlsby N, Kolesnikov A, Beyer L, Zhai X, Unterthiner T, Yung J, Steiner A, Keysers D, Uszkoreit J et al (2021) Mlp-mixer: An all-mlp architecture for vision. Adv Neural Inf Process Syst 34:24261–24272
Wu Z, Hou B, Ren B, Ren Z, Wang S, Jiao L (2021) A deep detection network based on interaction of instance segmentation and object detection for sar images. Remote Sens 13(13). https://doi.org/10.3390/rs13132582
Li J, Chen J, Cheng P, Yu Z, Yu L, Chi C (2023) A survey on deep-learning-based real-time sar ship detection. IEEE J Sel Top Appl Earth Obs Remote Sens 16:3218–3247. https://doi.org/10.1109/JSTARS.2023.3244616
Gao G, Gao S, He J, Li G (2018) Ship detection using compact polarimetric sar based on the notch filter. IEEE Trans Geosci Remote Sens 56(9):5380–5393. https://doi.org/10.1109/TGRS.2018.2815582
Tan M, Le Q (2019) Efficientnet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning, PMLR, pp 6105–6114
Ding X, Zhang X, Han J, Ding G (2022) Scaling up your kernels to 31x31: revisiting large kernel design in cnns. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11963–11975
Liu H, Dai Z, RSo D, Le QV (2021) Pay attention to mlps. Neural Inform Process Syst (NeurIPS)
Li J, Chen J, Cheng P, Yu Z, Yu L, Chi C (2023) A survey on deep-learning-based real-time sar ship detection. IEEE J Sel Top Appl Earth Obs Remote Sens 16:3218–3247. https://doi.org/10.1109/JSTARS.2023.3244616
Zhou K, Zhang M, Wang H, Tan J (2022) Ship detection in sar images based on multi-scale feature extraction and adaptive feature fusion. Remote Sens 14(3). https://doi.org/10.3390/rs14030755
Yang X, Zhang J, Chen C, Yang D (2022) An efficient and lightweight cnn model with soft quantification for ship detection in sar images. IEEE Trans Geosci Remote Sens 60:1–13. https://doi.org/10.1109/TGRS.2022.3186155
Zhang H, Xiong A, Lai L, Chen C, Liang J (2023) Amme-yolov7: improved yolov7 based on attention mechanism and multiscale expansion for electric vehicle driver and passenger helmet wearing detection. In: 2023 IEEE Int Conf Smart Internet of Things (SmartIoT), pp 223–227. https://doi.org/10.1109/SmartIoT58732.2023.00039
Yan B, Fan P, Lei X, Liu Z, Yang F (2021) A real-time apple targets detection method for picking robot based on improved yolov5. Remote Sensing 13(9):1619
Quan Y, Zhang D, Zhang L, Tang J (2023) Centralized feature pyramid for object detection. IEEE Trans Image Process
Han K, Wang Y, Chen H, Chen X, Guo J, Liu Z, Tang Y, Xiao A, Xu C, Xu Y et al (2022) A survey on vision transformer. IEEE Trans Pattern Anal Mach Intell 45(1):87–110
Li D, Hu J, Wang C, Li X, She Q, Zhu L, Zhang T, Chen Q (2021) Involution: inverting the inherence of convolution for visual recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12321–12330
Liu Z, Mao H, Wu C-Y, Feichtenhofer C, Darrell T, Xie S (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11976–11986
Guo Y, Du L, Lyu G (2021) Sar target detection based on domain adaptive faster r-cnn with small training data size. Remote Sens 13(21):4202
Yang X, Zhang X, Wang N, Gao X (2022) A robust one-stage detector for multiscale ship detection with complex background in massive sar images. IEEE Trans Geosci Remote Sens 60:1–12. https://doi.org/10.1109/TGRS.2021.3128060
Diba A, Sharma V, Gool LV, Stiefelhagen R (2019) Dynamonet: dynamic action and motion network. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6192–6201
Mildenhall B, Barron JT, Chen J, Sharlet D, Ng R, Carroll R (2018) Burst denoising with kernel prediction networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2502–2510
Ma N, Zhang X, Huang J, Sun J (2020) Weightnet: revisiting the design space of weight networks. In: European conference on computer vision. Springer, pp 776–792
Lin X, Ma L, Liu W, Chang S-F (2020) Context-gated convolution. In: Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, proceedings, Part XVIII 16. Springer, pp 701–718. Accessed 23–28 Aug 2020
Quader N, Bhuiyan MMI, Lu J, Dai P, Li W (2020) Weight excitation: built-in attention mechanisms in convolutional neural networks. In: Computer vision – ECCV 2020. Springer
Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z (2020) Dynamic convolution: attention over convolution kernels. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11030–11039
Zhao Z, Dong M (2023) Channel-spatial dynamic convolution: an exquisite omni-dimensional dynamic convolution. In: 2023 8th international conference on intelligent computing and signal processing (ICSP), pp 1707–1711. https://doi.org/10.1109/ICSP58490.2023.10248781
Tang T, Yang Z, Liu T (2021) Cfar ship detection method of polarimetric sar imagery based on whitening filter under \(\cal{G}\) distribution. In: 2021 CIE international conference on radar (Radar), pp 599–603. https://doi.org/10.1109/Radar53847.2021.10028413
Tang G, Zhuge Y, Claramunt C, Men S (2021) N-yolo: a sar ship detection using noise-classifying and complete-target extraction. Remote Sens 13(5). https://doi.org/10.3390/rs13050871
Luo Z, Mishra A, Achkar A, Eichel J, Li S, Jodoin P-M (2017) Non-local deep features for salient object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6609–6617
Zhao S, Luo Y, Zhang T, Guo W, Zhang Z (2023) A domain specific knowledge extraction transformer method for multisource satellite-borne sar images ship detection. ISPRS J Photogramm Remote Sens 198:16–29. https://doi.org/10.1016/j.isprsjprs.2023.02.011
Xu X, Zhang X, Zhang T (2022) Lite-yolov5: a lightweight deep learning detector for on-board ship detection in large-scene sentinel-1 sar images. Remote Sens 14(4). https://doi.org/10.3390/rs14041018
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1580–1589
Yu W, Zhou P, Yan S, Wang X (2023) Inceptionnext: when inception meets convnext. arXiv:2303.16900
Zhou K, Zhang M, Wang H, Tan J (2022) Ship detection in sar images based on multi-scale feature extraction and adaptive feature fusion. Remote Sens 14(3):755
Li J, Xu C, Su H, Gao L, Wang T (2022) Deep learning for sar ship detection: past, present and future. Remote Sens 14(11):2712
Humayun MF, Bhatti FA, Khurshid K (2023) ivision mrssd: a comprehensive multi-resolution sar ship detection dataset for state of the art satellite based maritime surveillance applications. Data Brief 50. https://doi.org/10.1016/j.dib.2023.109505
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inform Process Syst 28
Xi W, Wei M, Zhang Z (2023) Design of high-performance battery cover flaw detection system. In: 2023 5th international conference on intelligent control, measurement and signal processing (ICMSP). IEEE, pp 467–471
Liu Z, Mao H, Wu C-Y, Feichtenhofer C, Darrell T, Xie S (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11976–11986
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, W., Zhang, H. & Xu, A. Efficient ship detection in sar images with dynamic feature smoothing and visual module using omni-dimensional dynamic large-scale convolution. Multimed Tools Appl 83, 68697–68721 (2024). https://doi.org/10.1007/s11042-024-18288-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-024-18288-8