Skip to main content
Log in

Efficient ship detection in sar images with dynamic feature smoothing and visual module using omni-dimensional dynamic large-scale convolution

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In SAR (Synthetic Aperture Radar) image ship detection tasks, the accuracy of detection is hindered by the presence of a significant amount of clutter and false targets. Ships in these images exhibit diverse scales, which poses a challenge for traditional models to capture the necessary features across these varying scales. Furthermore, the limited texture information available in SAR images complicates the task of distinguishing between targets and the background, leading to suboptimal performance when using conventional deep learning models. To address these challenges, this paper introduces several innovative components. Firstly, a novel dynamic convolution operator is proposed, which allows for adaptively adjusting the smoothing method to mitigate the impact of noise in the data. Additionally, the paper introduces a plug-and-play feature smoothing module as well as a visual module. The feature smoothing module contributes to reducing noise interference, enhancing the overall robustness of the model. Meanwhile, the visual module focuses on augmenting long-range dependency relationships within the data, facilitating a better understanding of the complex interplay between ships and the background. This effectively addresses the issue of texture scarcity in SAR images. Our method was compared with the most advanced Visual module Explicit Visual Center (EVC) using the YOLO7-obb object detection baseline. The experimental results unequivocally demonstrate that our proposed method achieved a notable 1.2% improvement in the Average Precision (AP) value.

This is a preview of subscription content, log in via an institution to check access.

Access this article

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Availability of data and materials

All the data used to support the findings of the study are available in the manuscript.

References

  1. Bao W, Huang M, Zhang Y, Xu Y, Liu X, Xiang X (2021) Boosting ship detection in sar images with complementary pretraining techniques. IEEE J Sel Top Appl Earth Obs Remote Sens 14:8941–8954

    Article  Google Scholar 

  2. Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International conference on machine learning, PMLR, pp 10347–10357

  3. Zheng S, Lu J, Zhao H, Zhu X, Luo Z, Wang Y, Fu Y, Feng J, Xiang T, Torr PH et al (2021) Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6881–6890

  4. Tolstikhin IO, Houlsby N, Kolesnikov A, Beyer L, Zhai X, Unterthiner T, Yung J, Steiner A, Keysers D, Uszkoreit J et al (2021) Mlp-mixer: An all-mlp architecture for vision. Adv Neural Inf Process Syst 34:24261–24272

    Google Scholar 

  5. Wu Z, Hou B, Ren B, Ren Z, Wang S, Jiao L (2021) A deep detection network based on interaction of instance segmentation and object detection for sar images. Remote Sens 13(13). https://doi.org/10.3390/rs13132582

  6. Li J, Chen J, Cheng P, Yu Z, Yu L, Chi C (2023) A survey on deep-learning-based real-time sar ship detection. IEEE J Sel Top Appl Earth Obs Remote Sens 16:3218–3247. https://doi.org/10.1109/JSTARS.2023.3244616

    Article  Google Scholar 

  7. Gao G, Gao S, He J, Li G (2018) Ship detection using compact polarimetric sar based on the notch filter. IEEE Trans Geosci Remote Sens 56(9):5380–5393. https://doi.org/10.1109/TGRS.2018.2815582

    Article  Google Scholar 

  8. Tan M, Le Q (2019) Efficientnet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning, PMLR, pp 6105–6114

  9. Ding X, Zhang X, Han J, Ding G (2022) Scaling up your kernels to 31x31: revisiting large kernel design in cnns. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11963–11975

  10. Liu H, Dai Z, RSo D, Le QV (2021) Pay attention to mlps. Neural Inform Process Syst (NeurIPS)

  11. Li J, Chen J, Cheng P, Yu Z, Yu L, Chi C (2023) A survey on deep-learning-based real-time sar ship detection. IEEE J Sel Top Appl Earth Obs Remote Sens 16:3218–3247. https://doi.org/10.1109/JSTARS.2023.3244616

  12. Zhou K, Zhang M, Wang H, Tan J (2022) Ship detection in sar images based on multi-scale feature extraction and adaptive feature fusion. Remote Sens 14(3). https://doi.org/10.3390/rs14030755

  13. Yang X, Zhang J, Chen C, Yang D (2022) An efficient and lightweight cnn model with soft quantification for ship detection in sar images. IEEE Trans Geosci Remote Sens 60:1–13. https://doi.org/10.1109/TGRS.2022.3186155

    Article  Google Scholar 

  14. Zhang H, Xiong A, Lai L, Chen C, Liang J (2023) Amme-yolov7: improved yolov7 based on attention mechanism and multiscale expansion for electric vehicle driver and passenger helmet wearing detection. In: 2023 IEEE Int Conf Smart Internet of Things (SmartIoT), pp 223–227. https://doi.org/10.1109/SmartIoT58732.2023.00039

  15. Yan B, Fan P, Lei X, Liu Z, Yang F (2021) A real-time apple targets detection method for picking robot based on improved yolov5. Remote Sensing 13(9):1619

    Article  Google Scholar 

  16. Quan Y, Zhang D, Zhang L, Tang J (2023) Centralized feature pyramid for object detection. IEEE Trans Image Process

  17. Han K, Wang Y, Chen H, Chen X, Guo J, Liu Z, Tang Y, Xiao A, Xu C, Xu Y et al (2022) A survey on vision transformer. IEEE Trans Pattern Anal Mach Intell 45(1):87–110

    Article  Google Scholar 

  18. Li D, Hu J, Wang C, Li X, She Q, Zhu L, Zhang T, Chen Q (2021) Involution: inverting the inherence of convolution for visual recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12321–12330

  19. Liu Z, Mao H, Wu C-Y, Feichtenhofer C, Darrell T, Xie S (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11976–11986

  20. Guo Y, Du L, Lyu G (2021) Sar target detection based on domain adaptive faster r-cnn with small training data size. Remote Sens 13(21):4202

  21. Yang X, Zhang X, Wang N, Gao X (2022) A robust one-stage detector for multiscale ship detection with complex background in massive sar images. IEEE Trans Geosci Remote Sens 60:1–12. https://doi.org/10.1109/TGRS.2021.3128060

    Article  Google Scholar 

  22. Diba A, Sharma V, Gool LV, Stiefelhagen R (2019) Dynamonet: dynamic action and motion network. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6192–6201

  23. Mildenhall B, Barron JT, Chen J, Sharlet D, Ng R, Carroll R (2018) Burst denoising with kernel prediction networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2502–2510

  24. Ma N, Zhang X, Huang J, Sun J (2020) Weightnet: revisiting the design space of weight networks. In: European conference on computer vision. Springer, pp 776–792

  25. Lin X, Ma L, Liu W, Chang S-F (2020) Context-gated convolution. In: Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, proceedings, Part XVIII 16. Springer, pp 701–718. Accessed 23–28 Aug 2020

  26. Quader N, Bhuiyan MMI, Lu J, Dai P, Li W (2020) Weight excitation: built-in attention mechanisms in convolutional neural networks. In: Computer vision – ECCV 2020. Springer

  27. Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z (2020) Dynamic convolution: attention over convolution kernels. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11030–11039

  28. Zhao Z, Dong M (2023) Channel-spatial dynamic convolution: an exquisite omni-dimensional dynamic convolution. In: 2023 8th international conference on intelligent computing and signal processing (ICSP), pp 1707–1711. https://doi.org/10.1109/ICSP58490.2023.10248781

  29. Tang T, Yang Z, Liu T (2021) Cfar ship detection method of polarimetric sar imagery based on whitening filter under \(\cal{G}\) distribution. In: 2021 CIE international conference on radar (Radar), pp 599–603. https://doi.org/10.1109/Radar53847.2021.10028413

  30. Tang G, Zhuge Y, Claramunt C, Men S (2021) N-yolo: a sar ship detection using noise-classifying and complete-target extraction. Remote Sens 13(5). https://doi.org/10.3390/rs13050871

  31. Luo Z, Mishra A, Achkar A, Eichel J, Li S, Jodoin P-M (2017) Non-local deep features for salient object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6609–6617

  32. Zhao S, Luo Y, Zhang T, Guo W, Zhang Z (2023) A domain specific knowledge extraction transformer method for multisource satellite-borne sar images ship detection. ISPRS J Photogramm Remote Sens 198:16–29. https://doi.org/10.1016/j.isprsjprs.2023.02.011

    Article  Google Scholar 

  33. Xu X, Zhang X, Zhang T (2022) Lite-yolov5: a lightweight deep learning detector for on-board ship detection in large-scene sentinel-1 sar images. Remote Sens 14(4). https://doi.org/10.3390/rs14041018

  34. Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1580–1589

  35. Yu W, Zhou P, Yan S, Wang X (2023) Inceptionnext: when inception meets convnext. arXiv:2303.16900

  36. Zhou K, Zhang M, Wang H, Tan J (2022) Ship detection in sar images based on multi-scale feature extraction and adaptive feature fusion. Remote Sens 14(3):755

    Article  Google Scholar 

  37. Li J, Xu C, Su H, Gao L, Wang T (2022) Deep learning for sar ship detection: past, present and future. Remote Sens 14(11):2712

    Article  Google Scholar 

  38. Humayun MF, Bhatti FA, Khurshid K (2023) ivision mrssd: a comprehensive multi-resolution sar ship detection dataset for state of the art satellite based maritime surveillance applications. Data Brief 50. https://doi.org/10.1016/j.dib.2023.109505

  39. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

  40. Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inform Process Syst 28

  41. Xi W, Wei M, Zhang Z (2023) Design of high-performance battery cover flaw detection system. In: 2023 5th international conference on intelligent control, measurement and signal processing (ICMSP). IEEE, pp 467–471

  42. Liu Z, Mao H, Wu C-Y, Feichtenhofer C, Darrell T, Xie S (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11976–11986

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huachun Zhang.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, W., Zhang, H. & Xu, A. Efficient ship detection in sar images with dynamic feature smoothing and visual module using omni-dimensional dynamic large-scale convolution. Multimed Tools Appl 83, 68697–68721 (2024). https://doi.org/10.1007/s11042-024-18288-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-024-18288-8

Keywords

Navigation