Skip to main content

Underwater Object Detection Using Restructured SSD

  • Conference paper
  • First Online:
Artificial Intelligence (CICAI 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13604))

Included in the following conference series:

Abstract

Deep learning has been widely used in computer vision tasks such as image classification, semantic segmentation and object detection, which has achieved many breakthrough results in recent years. Compared with conventional object detection tasks, due to objective factors such as uneven illumination, low contrast, and more impurities in the underwater environment, these is no guarantee of high quality for underwater images, which brings challenges to the underwater object detection task. In this paper, we construct an underwater object detection model based on multi-scale feature fusion (called Multi-scale Feature Fusion Network for Underwater Object Detection, MFFNet). MFFNet uses SSD model as the baseline, then makes an improvement by adding three different modules, which are improved FPN, assisting backbone and CBAM attention module. Based on VGG-16 and ResNet-50 as the backbone network, the composite backbone connection is performed; the attention mechanism CBAM module is involved to make the network pay more attention to the objects; the feature pyramid FPN structure is used for multi-scale feature detection. To verify the effectiveness of the network model proposed in this paper, experiments are carried out on three datasets, i.e., VOC 2007, UPRC and Fish4knowledges. The experimental results show that compared with other main object detection models, the network model proposed in this paper has obvious advantages in underwater object detection, and can obtain higher detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ge, Z., McCool, C., Sanderson, C., Corke, P.I.: Modelling local deep convolutional neural network features to improve fine-grained image classification. CoRR, vol. abs/1502.07802 (2015). http://arxiv.org/abs/1502.07802

  2. Liu, W., et al.: SSD: single shot multibox detector. CoRR, vol. abs/1512.02325 (2015). http://arxiv.org/abs/1512.02325

  3. Woo, S., Park, J., Lee, J., Kweon, I.S.: CBAM: convolutional block attention module. CoRR, vol. abs/1807.06521 (2018). http://arxiv.org/abs/1807.06521

  4. Lin, T., et al.: Feature pyramid networks for object detection. CoRR, vol. abs/1612.03144 (2016). http://arxiv.org/abs/1612.03144

  5. Rathi, D., Jain, S., Indu, S.: Underwater fish species classification using convolutional neural network and deep learning. CoRR, vol. abs/1805.10106 (2018). http://arxiv.org/abs/1805.10106

  6. Mandal, R., Connolly, R.M., Schlacher, T.A., Stantic B.: Assessing fish abundance from underwater video using deep neural networks. CoRR, vol. abs/1807.05838 (2018). http://arxiv.org/abs/1807.05838

  7. Rekha, B.S., Srinivasan, G.N., Reddy, S.K., Kakwani, D., Bhattad, N.: Fish detection and classification using convolutional neural networks. In: Smys, S., Tavares, J.M.R.S., Balas, V.E., Iliyasu, A.M. (eds.) ICCVBIC 2019. AISC, vol. 1108, pp. 1221–1231. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37218-7_128

    Chapter  Google Scholar 

  8. Sun, S., Pang, J., Shi, J., Yi, S., Ouyang, W.: Fishnet: a versatile backbone for image, region, and pixel level prediction. CoRR, vol. abs/1901.03495 (2019). http://arxiv.org/abs/1901.03495

  9. Konovalov, D.A., Saleh, A., Bradley, M., Sankupellay, M., Marini, S., Sheaves, M.: Underwater fish detection with weak multi-domain supervision. CoRR, vol. abs/1905.10708 (2019). http://arxiv.org/abs/1905.10708

  10. Olsvik, E., et al.: Biometric fish classification of temperate species using convolutional neural network with squeeze-and-excitation. CoRR, vol. abs/1904.02768 (2019). http://arxiv.org/abs/1904.02768

  11. Cui, S., Zhou, Y., Wang, Y., Zhai, L.: Fish detection using deep learning. Appl. Comput. Intell. Soft Comput. 2020(11), 1–13 (2020)

    Google Scholar 

  12. Salman, A., Siddiqui, S.A., Shafait, F., Mian, A.S., Schwanecke, U.: Automatic fish detection in underwater videos by a deep neural network-based hybrid motion learning system. ICES J. Mar. Sci. (2019)

    Google Scholar 

  13. Knausgård, K.M., et al: Temperate fish detection and classification: a deep learning based approach. CoRR, vol. abs/2005.07518 (2020). http://arxiv.org/abs/2005.07518

  14. Iqbal, M.A., Wang, Z., Ali, Z.A., Riaz, S.: Automatic fish species classification using deep convolutional neural networks. Wirel. Personal Commun. 116(1), 1043–1053 (2021)

    Google Scholar 

  15. Liu, Y., et al.: CBNet: a novel composite backbone network architecture for object detection (2019)

    Google Scholar 

  16. Boom, B.J., Huang, X., He, J., Fisher, R.B.: Supporting ground-truth annotation of image datasets using clustering (2012)

    Google Scholar 

  17. Lin, W.-H., Zhong, J.-X., Liu, S., Li, T., Li, G.: ROIMIX: proposal-fusion among multiple images for underwater object detection. In: ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2588–2592 (2020)

    Google Scholar 

  18. Li, C., et al.: An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 29, 4376–4389 (2020)

    Article  MATH  Google Scholar 

  19. Hitam, M.S., Awalludin, E.A., Jawahir Hj Wan Yussof, W.N., Bachok, Z.: Mixture contrast limited adaptive histogram equalization for underwater image enhancement. In: 2013 International Conference on Computer Applications Technology (ICCAT), pp. 1–5 (2013)

    Google Scholar 

  20. Chen, L., et al.: Underwater object detection using invert multi-class adaboost with deep learning. In: 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2020)

    Google Scholar 

  21. Reza, A.M.: Realization of the contrast limited adaptive histogram equalization (CLAHE) for real-time image enhancement. J. VLSI Signal Process. Syst. Signal Image Video Technol. 38(1), 35–44 (2004)

    Article  Google Scholar 

Download references

Acknowledgment

This work was partially supported by the National Key Research and Development Program of China under Grant No. 2018AAA0100400, the Natural Science Foundation of Shandong Province under Grants No. ZR2020MF131 and No. ZR2021ZD19, and the Science and Technology Program of Qingdao under Grant No. 21-1-4-ny-19-nsh.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guoqiang Zhong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Huang, A., Zhong, G., Li, H., Choi, D. (2022). Underwater Object Detection Using Restructured SSD. In: Fang, L., Povey, D., Zhai, G., Mei, T., Wang, R. (eds) Artificial Intelligence. CICAI 2022. Lecture Notes in Computer Science(), vol 13604. Springer, Cham. https://doi.org/10.1007/978-3-031-20497-5_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-20497-5_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-20496-8

  • Online ISBN: 978-3-031-20497-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics