Attention Scale-Aware Deformable Network for Inshore Ship Detection in Surveillance Videos

Liu, Di; Zhang, Yan; Zhao, Yan; Zhang, Yu

doi:10.1007/978-3-030-93046-2_50

Attention Scale-Aware Deformable Network for Inshore Ship Detection in Surveillance Videos

Di Liu¹⁴,
Yan Zhang¹⁴,
Yan Zhao¹⁵ &
…
Yu Zhang¹⁴

Conference paper
First Online: 01 January 2022

2001 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13069))

Abstract

Aiming at detecting multi-scale inshore ships in surveillance videos, a one-stage detector namely Attention Scale-aware Deformable Network (ASDN) is proposed in this paper by employing two primary components including Attention Scale-aware Module (ASM) and Deformable Convolutional Network (DCN). Moreover, ASM composed of several branches of convolutions with specially designed kernels and a Convolutional Block Attention Modules (CBAM), is designed for extracting and refining non-local features of multi-scale inshore ships with large aspect ratios at the topmost feature layer. DCN is adopted to capture irregular significant features of ships by modulating input features using parameterized offsets and amplitudes at lateral connections of fine-grained feature pyramid. Experiments conducted on public Seaships7000 dataset demonstrate the contributions of ASM and DCN, and the effectiveness of our method for multi-scale inshore ship detection in surveillance videos in comparison with other Convolutional Neural Network (CNN) based methods, e.g., FPN, Libra R-CNN, SSD, YOLOv3, RefineDet.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Arshad, N., Moon, K.S., Kim, J.N.: An adaptive moving ship detection and tracking based on edge information and morphological operations. In: International Conference on Graphic and Image Processing (ICGIP 2011), vol. 8285, p. 82851X. International Society for Optics and Photonics (2011)
Google Scholar
Bao, X., Zinger, S., Wijnhoven, R., et al.: Ship detection in port surveillance based on context and motion saliency analysis. In: Video Surveillance and Transportation Imaging Applications, vol. 8663, p. 86630D. International Society for Optics and Photonics (2013)
Google Scholar
Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017)
Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4507–4515 (2017)
Google Scholar
Hyla, T., Wawrzyniak, N.: Ships detection on inland waters using video surveillance system. In: Saeed, K., Chaki, R., Janev, V. (eds.) CISIM 2019. LNCS, vol. 11703, pp. 39–49. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28957-7_4
Chapter Google Scholar
Hyla, T., Wawrzyniak, N.: Identification of vessels on inland waters using low-quality video streams. In: International Conference on System Sciences, p. 7269 (2021)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1097–1105 (2012)
Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Liu, S., Huang, D., Wang, Y.: Receptive field block net for accurate and fast object detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11215, pp. 404–419. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01252-6_24
Chapter Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Nie, X., Yang, M., Liu, R.W.: Deep neural network-based robust ship detection under different weather conditions. In: 2019 IEEE Intelligent Transportation Systems Conference (ITSC), pp. 47–52. IEEE (2019)
Google Scholar
Nie, X., Liu, W., Wu, W.: Ship detection based on enhanced YOLOv3 under complex environments. JOCA 40, 2561–2570 (2020)
Google Scholar
Pang, J., Chen, K., Shi, J., Feng, H., Ouyang, W., Lin, D.: Libra R-CNN: towards balanced learning for object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 821–830 (2019)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497 (2015)
Shao, Z., Wang, L., Wang, Z., Du, W., Wu, W.: Saliency-aware convolution neural network for ship detection in surveillance video. IEEE Trans. Circuits Syst. Video Technol. 30(3), 781–794 (2019)
Article Google Scholar
Shao, Z., Wu, W., Wang, Z., Du, W., Li, C.: Seaships: a large-scale precisely annotated dataset for ship detection. IEEE Trans. Multimedia 20(10), 2593–2604 (2018)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Sullivan, M.D.R., Shah, M.: Visual surveillance in maritime port facilities. In: Visual Information Processing XVII, vol. 6978, p. 697811. International Society for Optics and Photonics (2008)
Google Scholar
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Google Scholar
Wei, H., Nguyen, H., Ramu, P., Raju, C., Liu, X., Yadegar, J.: Automated intelligent video surveillance system for ships. In: Optics and Photonics in Global Homeland Security V and Biometric Technology for Human Identification VI, vol. 7306, p. 73061N. International Society for Optics and Photonics (2009)
Google Scholar
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Chapter Google Scholar
Zhang, S., Wen, L., Bian, X., Lei, Z., Li, S.Z.: Single-shot refinement neural network for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4203–4212 (2018)
Google Scholar
Zhu, X., Hu, H., Lin, S., Dai, J.: Deformable convnets v2: More deformable, better results. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9308–9316 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

National Key Laboratory of Science and Technology on Automatic Target Recognition, College of Electronic Science and Technology, National University of Defense Technology, Changsha, 410073, China
Di Liu, Yan Zhang & Yu Zhang
State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System, College of Electronic Science and Technology, National University of Defense Technology, Changsha, 410073, China
Yan Zhao

Authors

Di Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yan Zhang .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Lu Fang
Duke University, Durham, NC, USA
Yiran Chen
Shanghai Jiao Tong University, Shanghai, China
Guangtao Zhai
University of British Columbia, Vancouver, BC, Canada
Jane Wang
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Ruiping Wang
Xidian University, Xi’an, China
Weisheng Dong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, D., Zhang, Y., Zhao, Y., Zhang, Y. (2021). Attention Scale-Aware Deformable Network for Inshore Ship Detection in Surveillance Videos. In: Fang, L., Chen, Y., Zhai, G., Wang, J., Wang, R., Dong, W. (eds) Artificial Intelligence. CICAI 2021. Lecture Notes in Computer Science(), vol 13069. Springer, Cham. https://doi.org/10.1007/978-3-030-93046-2_50

Download citation

DOI: https://doi.org/10.1007/978-3-030-93046-2_50
Published: 01 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93045-5
Online ISBN: 978-3-030-93046-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics