Abstract
Deep learning has been widely used in computer vision tasks such as image classification, semantic segmentation and object detection, which has achieved many breakthrough results in recent years. Compared with conventional object detection tasks, due to objective factors such as uneven illumination, low contrast, and more impurities in the underwater environment, these is no guarantee of high quality for underwater images, which brings challenges to the underwater object detection task. In this paper, we construct an underwater object detection model based on multi-scale feature fusion (called Multi-scale Feature Fusion Network for Underwater Object Detection, MFFNet). MFFNet uses SSD model as the baseline, then makes an improvement by adding three different modules, which are improved FPN, assisting backbone and CBAM attention module. Based on VGG-16 and ResNet-50 as the backbone network, the composite backbone connection is performed; the attention mechanism CBAM module is involved to make the network pay more attention to the objects; the feature pyramid FPN structure is used for multi-scale feature detection. To verify the effectiveness of the network model proposed in this paper, experiments are carried out on three datasets, i.e., VOC 2007, UPRC and Fish4knowledges. The experimental results show that compared with other main object detection models, the network model proposed in this paper has obvious advantages in underwater object detection, and can obtain higher detection accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ge, Z., McCool, C., Sanderson, C., Corke, P.I.: Modelling local deep convolutional neural network features to improve fine-grained image classification. CoRR, vol. abs/1502.07802 (2015). http://arxiv.org/abs/1502.07802
Liu, W., et al.: SSD: single shot multibox detector. CoRR, vol. abs/1512.02325 (2015). http://arxiv.org/abs/1512.02325
Woo, S., Park, J., Lee, J., Kweon, I.S.: CBAM: convolutional block attention module. CoRR, vol. abs/1807.06521 (2018). http://arxiv.org/abs/1807.06521
Lin, T., et al.: Feature pyramid networks for object detection. CoRR, vol. abs/1612.03144 (2016). http://arxiv.org/abs/1612.03144
Rathi, D., Jain, S., Indu, S.: Underwater fish species classification using convolutional neural network and deep learning. CoRR, vol. abs/1805.10106 (2018). http://arxiv.org/abs/1805.10106
Mandal, R., Connolly, R.M., Schlacher, T.A., Stantic B.: Assessing fish abundance from underwater video using deep neural networks. CoRR, vol. abs/1807.05838 (2018). http://arxiv.org/abs/1807.05838
Rekha, B.S., Srinivasan, G.N., Reddy, S.K., Kakwani, D., Bhattad, N.: Fish detection and classification using convolutional neural networks. In: Smys, S., Tavares, J.M.R.S., Balas, V.E., Iliyasu, A.M. (eds.) ICCVBIC 2019. AISC, vol. 1108, pp. 1221–1231. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37218-7_128
Sun, S., Pang, J., Shi, J., Yi, S., Ouyang, W.: Fishnet: a versatile backbone for image, region, and pixel level prediction. CoRR, vol. abs/1901.03495 (2019). http://arxiv.org/abs/1901.03495
Konovalov, D.A., Saleh, A., Bradley, M., Sankupellay, M., Marini, S., Sheaves, M.: Underwater fish detection with weak multi-domain supervision. CoRR, vol. abs/1905.10708 (2019). http://arxiv.org/abs/1905.10708
Olsvik, E., et al.: Biometric fish classification of temperate species using convolutional neural network with squeeze-and-excitation. CoRR, vol. abs/1904.02768 (2019). http://arxiv.org/abs/1904.02768
Cui, S., Zhou, Y., Wang, Y., Zhai, L.: Fish detection using deep learning. Appl. Comput. Intell. Soft Comput. 2020(11), 1–13 (2020)
Salman, A., Siddiqui, S.A., Shafait, F., Mian, A.S., Schwanecke, U.: Automatic fish detection in underwater videos by a deep neural network-based hybrid motion learning system. ICES J. Mar. Sci. (2019)
Knausgård, K.M., et al: Temperate fish detection and classification: a deep learning based approach. CoRR, vol. abs/2005.07518 (2020). http://arxiv.org/abs/2005.07518
Iqbal, M.A., Wang, Z., Ali, Z.A., Riaz, S.: Automatic fish species classification using deep convolutional neural networks. Wirel. Personal Commun. 116(1), 1043–1053 (2021)
Liu, Y., et al.: CBNet: a novel composite backbone network architecture for object detection (2019)
Boom, B.J., Huang, X., He, J., Fisher, R.B.: Supporting ground-truth annotation of image datasets using clustering (2012)
Lin, W.-H., Zhong, J.-X., Liu, S., Li, T., Li, G.: ROIMIX: proposal-fusion among multiple images for underwater object detection. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2588–2592 (2020)
Li, C., et al.: An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 29, 4376–4389 (2020)
Hitam, M.S., Awalludin, E.A., Jawahir Hj Wan Yussof, W.N., Bachok, Z.: Mixture contrast limited adaptive histogram equalization for underwater image enhancement. In: 2013 International Conference on Computer Applications Technology (ICCAT), pp. 1–5 (2013)
Chen, L., et al.: Underwater object detection using invert multi-class adaboost with deep learning. In: 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2020)
Reza, A.M.: Realization of the contrast limited adaptive histogram equalization (CLAHE) for real-time image enhancement. J. VLSI Signal Process. Syst. Signal Image Video Technol. 38(1), 35–44 (2004)
Acknowledgment
This work was partially supported by the National Key Research and Development Program of China under Grant No. 2018AAA0100400, the Natural Science Foundation of Shandong Province under Grants No. ZR2020MF131 and No. ZR2021ZD19, and the Science and Technology Program of Qingdao under Grant No. 21-1-4-ny-19-nsh.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Huang, A., Zhong, G., Li, H., Choi, D. (2022). Underwater Object Detection Using Restructured SSD. In: Fang, L., Povey, D., Zhai, G., Mei, T., Wang, R. (eds) Artificial Intelligence. CICAI 2022. Lecture Notes in Computer Science(), vol 13604. Springer, Cham. https://doi.org/10.1007/978-3-031-20497-5_43
Download citation
DOI: https://doi.org/10.1007/978-3-031-20497-5_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20496-8
Online ISBN: 978-3-031-20497-5
eBook Packages: Computer ScienceComputer Science (R0)