Skip to main content

Advertisement

Log in

Optimzied resnet model of convolutional neural network for under sea water object detection and classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The world’s ocean depths conceal a big mystery, and obtaining the information contained therein is a significant challenge that must be overcome. With the advent of computer vision technologies and robotics, the underwater environment is explored recently. The vast data collected from numerous underwater sensors have a variety of complications related to inadequate image quality, difficulty in acquiring training samples, and uncontrolled objects in the underwater environment. When these images are processed using machine learning techniques that involve manual intervention, the time taken to process a huge amount of images will be relatively high and prone to errors. To tackle these, we propose a novel hybrid capuchin-based coevolving particle swarm optimization (HC2PSO) algorithm with a ResNet model of Convolutional Neural Network (CNN) architecture for underwater object identification. This work mainly aims to explore different underwater objects such as fish, corals, sea urchins, etc. The speckle-reducing anisotropic diffusion (SRAD) filter performs the pre-processing step. The denoising autoencoder (DA) is used for feature extraction which can enhance the partially distorted sample images and offer increased robustness. To overcome the overfitting issue in CNN, the HC2PSO algorithm is used. The experimental works are handled in MATLAB software. Both with and without pre-processing results in terms of SRAD filter are checked and evaluated. The proposed method’s effectiveness is evaluated through various measures like accuracy, specificity, sensitivity, false-positive rate, false-negative rates, etc. The accuracy of the HC2PSO-CNN classifier is higher when compared to the standard CNN classifier in recognizing the underwater objects when evaluated with different performance metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

Code availability

Not applicable.

References

  1. Berman D, Levy D, Avidan S, Treibitz T (2020) Underwater single image color restoration using haze-lines and a new quantitative dataset. IEEE Trans Pattern Anal Mach Intell 43:2822–2837

    Google Scholar 

  2. Braik M, Sheta A, Al-Hiary H (2021) A novel meta-heuristic search algorithm for solving optimization problems: capuchin search algorithm. Neural Comput & Applic 33(7):2515–2547

    Article  Google Scholar 

  3. Chen W, Fan B (2020) Underwater object detection with mixed attention mechanism and multi-enhancement strategy. In: 2020 Chinese automation congress (CAC). IEEE, pp 2821–2826

    Chapter  Google Scholar 

  4. Fayaz S, Parah SA, Qureshi GJ (2022) Underwater object detection: architectures and algorithms–a comprehensive review. Multimed Tools Appl 81(15):20871–20916

  5. Gehring J, Miao Y, Metze F, Waibel A (2013) Extracting deep bottleneck features using stacked auto-encoders. In: 2013 IEEE international conference on acoustics, speech and signal processing. IEEE, pp 3377–3381

    Chapter  Google Scholar 

  6. Han F, Yao J, Zhu H, Wang C (2020) Underwater image processing and object detection based on deep CNN method. J Sensors 2020:1–20

    Google Scholar 

  7. Jalled F, Voronkov I (2016) Object detection using image processing. arXiv preprint arXiv:1611.07791

  8. Krohling RA, dos Santos Coelho L (2006) Coevolutionary particle swarm optimization using Gaussian distribution for solving constrained optimization problems. IEEE Trans Syst Man Cybern B Cybern 36(6):1407–1416

    Article  Google Scholar 

  9. Kvasić I, Mišković N, Vukić Z (2019) Convolutional neural network architectures for sonar-based diver detection and tracking. In: OCEANS, 2019-Marseille. IEEE, pp 1–6

    Google Scholar 

  10. Li C, Guo C, Ren W, Cong R, Hou J, Kwong S, Tao D (2019) An underwater image enhancement benchmark dataset and beyond. IEEE Trans Image Process 29:4376–4389

    Article  MATH  Google Scholar 

  11. Li Z, Yuan Y, Ma D (2021) Selection based on statistical characteristics for object detection. In: ICASSP 2021-2021 IEEE international conference on acoustics, speech, and signal processing (ICASSP). IEEE, pp 1485–1489

    Chapter  Google Scholar 

  12. Lin WH, Zhong JX, Liu S, Li T, Li G (2020) Roimix: proposal-fusion among multiple images for underwater object detection. In: ICASSP 2020-2020 IEEE international conference on acoustics, speech, and signal processing (ICASSP). IEEE, pp 2588–2592

    Chapter  Google Scholar 

  13. Lu H, Li Y, Xu X, He L, Li Y, Dansereau D, Serikawa S (2016) Underwater image descattering and quality assessment. In: 2016 IEEE international conference on image processing (ICIP). IEEE, pp 1998–2002

    Chapter  Google Scholar 

  14. Lu H, Li Y, Zhang Y, Chen M, Serikawa S, Kim H (2017) Underwater optical image processing: a comprehensive review. Mob Netw Appl 22(6):1204–1211

    Article  Google Scholar 

  15. Mitra V, Wang CJ, Banerjee S (2006) Lidar detection of underwater objects using a neuro-SVM-based architecture. IEEE Trans Neural Netw 17(3):717–731

    Article  Google Scholar 

  16. Pan T-S, Huang H-C, Lee J-C, Chen C-H (2021) Multi-scale ResNet for real-time underwater object detection. SIViP 15(5):941–949

    Article  Google Scholar 

  17. Qin X, Luo X, Wu Z, Shang J (2021) Optimizing the sediment classification of small side-scan sonar images based on deep learning. IEEE Access 9:29416–29428

    Article  Google Scholar 

  18. Ranganath HS, Kuntimad G (1999) Object detection using pulse coupled neural networks. IEEE Trans Neural Netw 10(3):615–620

    Article  Google Scholar 

  19. Shakya S (2020) Analysis of artificial intelligence based image classification techniques. J Innov Image Process 2(01):44–54

    Article  Google Scholar 

  20. Singh S, Suri A, Singh JN, Singh M, Yadav DK (2021) Object identification and tracking using YOLO model: a CNN-based approach. In: Machine learning and information processing. Springer, Singapore, pp 153–160

    Chapter  Google Scholar 

  21. Wang Z, Liu C, Wang S, Tang T, Tao Y, Yang C, Li H, Liu X, Fan X (2020) UDD: an underwater open-sea farm object detection dataset for underwater robot picking. arXiv e-prints, pp.arXiv-2003

  22. Wei X, Yu L, Tian S, Feng P, Ning X (2021) Underwater target detection with an attention mechanism and improved scale. Multimed Tools Appl 80(25):33747–33761

    Article  Google Scholar 

  23. Yoo H, Han S, Chung K (2021) Diagnosis support model of cardiomegaly based on CNN using ResNet and explainable feature map. IEEE Access 9:55802–55813

    Article  Google Scholar 

  24. Yu Y, Acton ST (2002) Speckle reducing anisotropic diffusion. IEEE Trans Image Process 11(11):1260–1270

    Article  MathSciNet  Google Scholar 

  25. Zhang J, Cao Y, Han G, Fu X (2019) Deep neural network-based underwater OFDM receiver. IET Commun 13(13):1998–2002

    Article  Google Scholar 

  26. Zhao Z, Liu Y, Sun X, Liu J, Yang X, Zhou C (2021) Composited FishNet: fish detection and species recognition from low-quality underwater videos. IEEE Trans Image Process 30:4719–4734

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Malathi.

Ethics declarations

Informed consent

Informed consent does not apply as this was a retrospective review with no identifying patient information.

Human and animal rights

This article does not contain any studies with human or animal subjects performed by any of the authors.

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

The probability density functions can be given as abs(N(0, 1)) and can be determined as,

$$ r(a)=\frac{2}{\sqrt{2\Pi}}\exp -\frac{a^2}{2},\kern0.36em a\ge 0 $$
(26)

The average value of abs(N(0, 1)) is estimated as,

$$ M(a)=\frac{2}{\sqrt{2\Pi}}\int_0^{\infty }{ae}^{-\frac{a^2}{2} da}=0.798 $$
(27)

The variance value can be calculated for the abs(N(0, 1)) as shown below,

$$ V(a)={\sigma}^2\frac{2}{\sqrt{2\Pi}}\int_0^{\infty }{\left(a-\varsigma \right)}^2{e}^{\frac{-{a}^2}{2}} da $$
(28)
$$ {\sigma}^2=\frac{2}{\sqrt{2\Pi}}\int_0^{\infty }{\left(a-\varsigma \right)}^2{e}^{\frac{-{a}^2}{2}} da=0.36 $$
(29)

From the above eqn, the obtained SD is σ = 0.60. Then it is essential to produce the stochastic coefficients for the terms (PSi − ai(t)) and (PSη − ai(t)). Henceforth the velocity equation can be upgraded as,

$$ {V}_i\left(t+1\right)=\left| rann\right|\left({PS}_i-{a}_i(t)\right)+\left|\mathit{\operatorname{ran}}1n\right|\left({PS}_{\eta }-{a}_i(t)\right) $$
(30)

The positive random number generated by the abs(N(0, 1)) is represented as |rann| and |ran1n|. Assume a random variable c that has been obtained from the sum of the above-mentioned random variables. The probability density function is considered as p(a). from this, the probability distribution function is determined as,

$$ p(a)=\frac{2}{\sqrt{\Pi}}{e}^{-\frac{c^2}{4}}.\mathit{\operatorname{erf}}(0.5c) $$
(31)

Here the value of erf(c) is defined as,

$$ \mathit{\operatorname{erf}}(c)=\left(1/\sqrt{\Pi}\int_0^c{e}^{-{a}^2} da\right) $$
(32)

Thus the generation of stochastic coefficients with the aid of abs(N(0, 1)) enables the compromise among a large number of low amplitudes and the less number of maximum amplitudes i.e., fine-tuning of parameters.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Malathi, V., Manikandan, A. & Krishnan, K. Optimzied resnet model of convolutional neural network for under sea water object detection and classification. Multimed Tools Appl 82, 37551–37571 (2023). https://doi.org/10.1007/s11042-023-15041-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15041-5

Keywords

Navigation