Underwater Object Detection Using Restructured SSD

Huang, Andi; Zhong, Guoqiang; Li, Hao; Choi, Daewon

doi:10.1007/978-3-031-20497-5_43

Andi Huang¹²,
Guoqiang Zhong¹²,
Hao Li¹² &
…
Daewon Choi¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13604))

Included in the following conference series:

CAAI International Conference on Artificial Intelligence

1335 Accesses
2 Citations

Abstract

Deep learning has been widely used in computer vision tasks such as image classification, semantic segmentation and object detection, which has achieved many breakthrough results in recent years. Compared with conventional object detection tasks, due to objective factors such as uneven illumination, low contrast, and more impurities in the underwater environment, these is no guarantee of high quality for underwater images, which brings challenges to the underwater object detection task. In this paper, we construct an underwater object detection model based on multi-scale feature fusion (called Multi-scale Feature Fusion Network for Underwater Object Detection, MFFNet). MFFNet uses SSD model as the baseline, then makes an improvement by adding three different modules, which are improved FPN, assisting backbone and CBAM attention module. Based on VGG-16 and ResNet-50 as the backbone network, the composite backbone connection is performed; the attention mechanism CBAM module is involved to make the network pay more attention to the objects; the feature pyramid FPN structure is used for multi-scale feature detection. To verify the effectiveness of the network model proposed in this paper, experiments are carried out on three datasets, i.e., VOC 2007, UPRC and Fish4knowledges. The experimental results show that compared with other main object detection models, the network model proposed in this paper has obvious advantages in underwater object detection, and can obtain higher detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ge, Z., McCool, C., Sanderson, C., Corke, P.I.: Modelling local deep convolutional neural network features to improve fine-grained image classification. CoRR, vol. abs/1502.07802 (2015). http://arxiv.org/abs/1502.07802
Liu, W., et al.: SSD: single shot multibox detector. CoRR, vol. abs/1512.02325 (2015). http://arxiv.org/abs/1512.02325
Woo, S., Park, J., Lee, J., Kweon, I.S.: CBAM: convolutional block attention module. CoRR, vol. abs/1807.06521 (2018). http://arxiv.org/abs/1807.06521
Lin, T., et al.: Feature pyramid networks for object detection. CoRR, vol. abs/1612.03144 (2016). http://arxiv.org/abs/1612.03144
Rathi, D., Jain, S., Indu, S.: Underwater fish species classification using convolutional neural network and deep learning. CoRR, vol. abs/1805.10106 (2018). http://arxiv.org/abs/1805.10106
Mandal, R., Connolly, R.M., Schlacher, T.A., Stantic B.: Assessing fish abundance from underwater video using deep neural networks. CoRR, vol. abs/1807.05838 (2018). http://arxiv.org/abs/1807.05838
Rekha, B.S., Srinivasan, G.N., Reddy, S.K., Kakwani, D., Bhattad, N.: Fish detection and classification using convolutional neural networks. In: Smys, S., Tavares, J.M.R.S., Balas, V.E., Iliyasu, A.M. (eds.) ICCVBIC 2019. AISC, vol. 1108, pp. 1221–1231. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37218-7_128
Chapter Google Scholar
Sun, S., Pang, J., Shi, J., Yi, S., Ouyang, W.: Fishnet: a versatile backbone for image, region, and pixel level prediction. CoRR, vol. abs/1901.03495 (2019). http://arxiv.org/abs/1901.03495
Konovalov, D.A., Saleh, A., Bradley, M., Sankupellay, M., Marini, S., Sheaves, M.: Underwater fish detection with weak multi-domain supervision. CoRR, vol. abs/1905.10708 (2019). http://arxiv.org/abs/1905.10708
Olsvik, E., et al.: Biometric fish classification of temperate species using convolutional neural network with squeeze-and-excitation. CoRR, vol. abs/1904.02768 (2019). http://arxiv.org/abs/1904.02768
Cui, S., Zhou, Y., Wang, Y., Zhai, L.: Fish detection using deep learning. Appl. Comput. Intell. Soft Comput. 2020(11), 1–13 (2020)
Google Scholar
Salman, A., Siddiqui, S.A., Shafait, F., Mian, A.S., Schwanecke, U.: Automatic fish detection in underwater videos by a deep neural network-based hybrid motion learning system. ICES J. Mar. Sci. (2019)
Google Scholar
Knausgård, K.M., et al: Temperate fish detection and classification: a deep learning based approach. CoRR, vol. abs/2005.07518 (2020). http://arxiv.org/abs/2005.07518
Iqbal, M.A., Wang, Z., Ali, Z.A., Riaz, S.: Automatic fish species classification using deep convolutional neural networks. Wirel. Personal Commun. 116(1), 1043–1053 (2021)
Google Scholar
Liu, Y., et al.: CBNet: a novel composite backbone network architecture for object detection (2019)
Google Scholar
Boom, B.J., Huang, X., He, J., Fisher, R.B.: Supporting ground-truth annotation of image datasets using clustering (2012)
Google Scholar
Lin, W.-H., Zhong, J.-X., Liu, S., Li, T., Li, G.: ROIMIX: proposal-fusion among multiple images for underwater object detection. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2588–2592 (2020)
Google Scholar
Li, C., et al.: An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 29, 4376–4389 (2020)
Article MATH Google Scholar
Hitam, M.S., Awalludin, E.A., Jawahir Hj Wan Yussof, W.N., Bachok, Z.: Mixture contrast limited adaptive histogram equalization for underwater image enhancement. In: 2013 International Conference on Computer Applications Technology (ICCAT), pp. 1–5 (2013)
Google Scholar
Chen, L., et al.: Underwater object detection using invert multi-class adaboost with deep learning. In: 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2020)
Google Scholar
Reza, A.M.: Realization of the contrast limited adaptive histogram equalization (CLAHE) for real-time image enhancement. J. VLSI Signal Process. Syst. Signal Image Video Technol. 38(1), 35–44 (2004)
Article Google Scholar

Download references

Acknowledgment

This work was partially supported by the National Key Research and Development Program of China under Grant No. 2018AAA0100400, the Natural Science Foundation of Shandong Province under Grants No. ZR2020MF131 and No. ZR2021ZD19, and the Science and Technology Program of Qingdao under Grant No. 21-1-4-ny-19-nsh.

Author information

Authors and Affiliations

College of Computer Science and Technology, Ocean University of China, Qingdao, 266100, China
Andi Huang, Guoqiang Zhong, Hao Li & Daewon Choi

Authors

Andi Huang
View author publications
You can also search for this author in PubMed Google Scholar
Guoqiang Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Hao Li
View author publications
You can also search for this author in PubMed Google Scholar
Daewon Choi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guoqiang Zhong .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Lu Fang
Xiaomi Inc., Beijing, China
Daniel Povey
Shanghai Jiao Tong University, Shanghai, China
Guangtao Zhai
JD Explore Academy, Beijing, China
Tao Mei
Chinese Academy of Sciences, Beijing, China
Ruiping Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, A., Zhong, G., Li, H., Choi, D. (2022). Underwater Object Detection Using Restructured SSD. In: Fang, L., Povey, D., Zhai, G., Mei, T., Wang, R. (eds) Artificial Intelligence. CICAI 2022. Lecture Notes in Computer Science(), vol 13604. Springer, Cham. https://doi.org/10.1007/978-3-031-20497-5_43

Download citation

DOI: https://doi.org/10.1007/978-3-031-20497-5_43
Published: 17 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20496-8
Online ISBN: 978-3-031-20497-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics