skip to main content
10.1145/3584376.3584523acmotherconferencesArticle/Chapter ViewAbstractPublication PagesricaiConference Proceedingsconference-collections
research-article

Research on underwater biological detection method based on multi-scale fusion and attention mechanism

Published: 19 April 2023 Publication History

Abstract

Underwater robots are an important tool for exploiting marine resources, while underwater target detection is a key technology for marine fishing by underwater robots. For the problem that the traditional algorithm is difficult to detect small targets and multiple scales in the underwater environment, this paper proposes an underwater biological detection algorithm based on the multi-scale feature fusion and attention mechanism. First, a YOLOv5 network is constructed, and a bi-directionally weighted BiFPN feature pyramid structure is fed into the neck network so that multiscale target features can be efficiently merged. Second, a channel and spatial convolutional attention mechanism is introduced between the neck network and the prediction network in order to increase the saliency of the underwater target that needs to be detected, which effectively reduces the missed and erroneous detection of underwater targets. The experimental results show that the proposed algorithm can accurately detect multi-scale underwater targets in complex underwater environments with an average accuracy of 80.18%, which is 2.87% better than YOLOv5, and an average recall of 85.06%, which is 4.81% better than YOLOv5.

References

[1]
Yu H. Research progress on object detection and tracking techniques utilization in aquaculture:a review [J]. Journal of Dalian Ocean University, 2020, 35(06): 793-804.
[2]
Li X, Shang M, Qin H, Fast accurate fish detection and recognition of underwater images with fast r-cnn[C]//OCEANS 2015-MTS/IEEE Washington. IEEE, 2015, 1-5.
[3]
Lv X. Research and Implementation of Object Detection in Underwater Images Based on Faster R-CNN [D]. Harbin Institute of Technology, 2019.
[4]
Fang X. Underwater Image Recognition Algorithm Based on Deep Neural Network [D]. Hangzhou Dianzi University, 2022.
[5]
Sun J. Research on Object Detection Technology of Sea Creatures Based on Deep Learning [D]. Harbin Engineering University, 2018.
[6]
Shi P, Hang S, Yang X, Underwater object detection algorithm combining data enhancement and improved YOLOv4 [J]. Journal of Electronic Measurement and Instrumentation, 2022, 36(03): 113-121.
[7]
Redmon J, Farhadi A. Yolov3: An incremental improvement. arXiv, 2018, arXiv:1804.02767.
[8]
Bochkovskiy A, Wang C, H Y. Yolov4: Optimal speed and accuracy of object detection. arXiv, 2020, arXiv:2004.10934.
[9]
Wang C, Liao H, Wu Y, In CSPNet: A new backbone that can enhance learning capability of CNN, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2020, pp 390-391.
[10]
Liu S, Qi L, Qin H, In Path aggregation network for instance segmentation, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp 8759-8768.
[11]
Lin T, Dollár P, Girshick, R, In Feature pyramid networks for object detection, Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp 2117-2125.
[12]
Vaswani A, Shazeer N, Parmar N, Attention Is All You Need. arXiv, 2017, arXiv.1706.03762.
[13]
Tan M, Pang R, Le Q, In Efficientdet: Scalable and efficient object detection, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp 10781-10790.
[14]
Liu S, Huang D, Wang Y, Learning spatial fusion for single-shot object detection. arXiv preprint arXiv:1911.09516, 2019.
[15]
Woo S, Park J, Lee J, In Cbam: Convolutional block attention module, Proceedings of the European conference on computer vision (ECCV), 2018, pp 3-19.
[16]
Hu J, Shen L and Sun G. In Squeeze-and-excitation networks, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp 7132-7141.
[17]
Desai M, Shah M. An anatomization on breast cancer detection and diagnosis employing multi-layer perceptron neural network (MLP) and Convolutional neural network (CNN). Clin. Ehealth, 2021, 4, 1-11.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
RICAI '22: Proceedings of the 2022 4th International Conference on Robotics, Intelligent Control and Artificial Intelligence
December 2022
1396 pages
ISBN:9781450398343
DOI:10.1145/3584376
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 April 2023

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

RICAI 2022

Acceptance Rates

Overall Acceptance Rate 140 of 294 submissions, 48%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 38
    Total Downloads
  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)8
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media