Abstract
Aiming at detecting multi-scale inshore ships in surveillance videos, a one-stage detector namely Attention Scale-aware Deformable Network (ASDN) is proposed in this paper by employing two primary components including Attention Scale-aware Module (ASM) and Deformable Convolutional Network (DCN). Moreover, ASM composed of several branches of convolutions with specially designed kernels and a Convolutional Block Attention Modules (CBAM), is designed for extracting and refining non-local features of multi-scale inshore ships with large aspect ratios at the topmost feature layer. DCN is adopted to capture irregular significant features of ships by modulating input features using parameterized offsets and amplitudes at lateral connections of fine-grained feature pyramid. Experiments conducted on public Seaships7000 dataset demonstrate the contributions of ASM and DCN, and the effectiveness of our method for multi-scale inshore ship detection in surveillance videos in comparison with other Convolutional Neural Network (CNN) based methods, e.g., FPN, Libra R-CNN, SSD, YOLOv3, RefineDet.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Arshad, N., Moon, K.S., Kim, J.N.: An adaptive moving ship detection and tracking based on edge information and morphological operations. In: International Conference on Graphic and Image Processing (ICGIP 2011), vol. 8285, p. 82851X. International Society for Optics and Photonics (2011)
Bao, X., Zinger, S., Wijnhoven, R., et al.: Ship detection in port surveillance based on context and motion saliency analysis. In: Video Surveillance and Transportation Imaging Applications, vol. 8663, p. 86630D. International Society for Optics and Photonics (2013)
Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017)
Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4507–4515 (2017)
Hyla, T., Wawrzyniak, N.: Ships detection on inland waters using video surveillance system. In: Saeed, K., Chaki, R., Janev, V. (eds.) CISIM 2019. LNCS, vol. 11703, pp. 39–49. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28957-7_4
Hyla, T., Wawrzyniak, N.: Identification of vessels on inland waters using low-quality video streams. In: International Conference on System Sciences, p. 7269 (2021)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1097–1105 (2012)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Liu, S., Huang, D., Wang, Y.: Receptive field block net for accurate and fast object detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11215, pp. 404–419. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01252-6_24
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Nie, X., Yang, M., Liu, R.W.: Deep neural network-based robust ship detection under different weather conditions. In: 2019 IEEE Intelligent Transportation Systems Conference (ITSC), pp. 47–52. IEEE (2019)
Nie, X., Liu, W., Wu, W.: Ship detection based on enhanced YOLOv3 under complex environments. JOCA 40, 2561–2570 (2020)
Pang, J., Chen, K., Shi, J., Feng, H., Ouyang, W., Lin, D.: Libra R-CNN: towards balanced learning for object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 821–830 (2019)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497 (2015)
Shao, Z., Wang, L., Wang, Z., Du, W., Wu, W.: Saliency-aware convolution neural network for ship detection in surveillance video. IEEE Trans. Circuits Syst. Video Technol. 30(3), 781–794 (2019)
Shao, Z., Wu, W., Wang, Z., Du, W., Li, C.: Seaships: a large-scale precisely annotated dataset for ship detection. IEEE Trans. Multimedia 20(10), 2593–2604 (2018)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Sullivan, M.D.R., Shah, M.: Visual surveillance in maritime port facilities. In: Visual Information Processing XVII, vol. 6978, p. 697811. International Society for Optics and Photonics (2008)
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Wei, H., Nguyen, H., Ramu, P., Raju, C., Liu, X., Yadegar, J.: Automated intelligent video surveillance system for ships. In: Optics and Photonics in Global Homeland Security V and Biometric Technology for Human Identification VI, vol. 7306, p. 73061N. International Society for Optics and Photonics (2009)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Zhang, S., Wen, L., Bian, X., Lei, Z., Li, S.Z.: Single-shot refinement neural network for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4203–4212 (2018)
Zhu, X., Hu, H., Lin, S., Dai, J.: Deformable convnets v2: More deformable, better results. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9308–9316 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, D., Zhang, Y., Zhao, Y., Zhang, Y. (2021). Attention Scale-Aware Deformable Network for Inshore Ship Detection in Surveillance Videos. In: Fang, L., Chen, Y., Zhai, G., Wang, J., Wang, R., Dong, W. (eds) Artificial Intelligence. CICAI 2021. Lecture Notes in Computer Science(), vol 13069. Springer, Cham. https://doi.org/10.1007/978-3-030-93046-2_50
Download citation
DOI: https://doi.org/10.1007/978-3-030-93046-2_50
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93045-5
Online ISBN: 978-3-030-93046-2
eBook Packages: Computer ScienceComputer Science (R0)