Skip to main content
Log in

A novel finetuned YOLOv8 model for real-time underwater trash detection

  • Research
  • Published:
Journal of Real-Time Image Processing Aims and scope Submit manuscript

Abstract

When recognizing underwater images, problems, including poor image quality and complicated backdrops, are significant. The main problem of underwater images is the blurriness and invisibility of objects present in an image. This study presents a unique object identification design built on a YOLOv8 (You Only Look Once) framework upgraded to address these problems and further improve the models' accuracy. The study also helps in identifying underwater trash. The model is a two-phase detector model. The first phase has an Underwater Image Enhancer (UIE) data augmentation technique that works with Laplacian pyramids and gamma correctness methods to enhance the underwater images. The second phase, the proposed refined, innovative YOLOv8 model for classification purposes, takes the output from the first stage as its input. The YOLOv8 model's existing feature extractor is replaced in this study with a new feature extractor technique, HEFA, that yields superior results and better detection accuracy. The introduction of the UIE and HEFA feature extractor method represents the significant novelty of this paper. The proposed model is pruned simultaneously to eliminate unnecessary parameters and further condense the model. Pruning causes the model's accuracy to decline. Thus, the transfer learning procedure is employed to raise it. The trials’ findings show that the technique can detect objects with an accuracy of 98.5% and a mAP@50 of 98.1% and that its real-time detection speed on the GPU is double that of the YOLOv8m model's baseline performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Algorithm 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 2
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Data availability

The data used in the work is freely accessible via Fulton et al. [38].

References

  1. Namadi, P., Deng, Z.: Deep learning-based ensemble modeling of Vibrio parahaemolyticus concentration in marine environment. Environ. Monit. Assess.Monit. Assess. (2023). https://doi.org/10.1007/s10661-022-10836-9

    Article  Google Scholar 

  2. Zhao, W., Han, F., Qiu, X., Peng, X., Zhao, Y., Zhang, J.: Research on the identification and distribution of biofouling using underwater cleaning robot based on deep learning. Ocean Eng. 273, 113909 (2023). https://doi.org/10.1016/j.oceaneng.2023.113909

    Article  Google Scholar 

  3. Xu, S., Zhang, M., Song, W., Mei, H., He, Q., Liotta, A.: A systematic review and analysis of deep learning-based underwater object detection. Neurocomputing 527, 204–232 (2023). https://doi.org/10.1016/j.neucom.2023.01.056

    Article  Google Scholar 

  4. Farid, A., Hussain, F., Khan, K., Shahzad, M., Khan, U., Mahmood, Z.: A fast and accurate real-time vehicle detection method using deep learning for unconstrained environments. Appl. Sci. (2023). https://doi.org/10.3390/app13053059

    Article  Google Scholar 

  5. Sangeeta, G.P.: Improved video compression using variable emission step ConvGRU based architecture. Lect. Notes Data Eng. Commun. Technol. 61, 405–415 (2021). https://doi.org/10.1007/978-981-33-4582-9_31/COVER

    Article  Google Scholar 

  6. Gupta, C., Gill, N.S., Gulia, P., Chatterjee, J.M.: A novel finetuned YOLOv6 transfer learning model for real-time object detection. J. Real Time Image Process. (2023). https://doi.org/10.1007/s11554-023-01299-3

    Article  Google Scholar 

  7. Diwan, T., Anirudh, G., Tembhurne, J.V.: Object detection using YOLO: challenges, architectural successors, datasets and applications. Multimed. Tools Appl. (2022). https://doi.org/10.1007/s11042-022-13644-y

    Article  Google Scholar 

  8. Gupta, C., Gill, N.S., Gulia, P.: SSDT: distance tracking model based on deep learning. Int. J. Electr. Comput. Eng. Syst. 13, 339–348 (2022). https://doi.org/10.32985/ijeces.13.5.2

    Article  Google Scholar 

  9. Mittal, U., Chawla, P., Tiwari, R.: EnsembleNet: a hybrid approach for vehicle detection and estimation of traffic density based on faster R-CNN and YOLO models. Neural Comput. Appl.Comput. Appl. 35, 4755–4774 (2023). https://doi.org/10.1007/s00521-022-07940-9

    Article  Google Scholar 

  10. Qiu, Z., Rong, S., Ye, L.: YOLF-ShipPnet: improved RetinaNet with pyramid vision transformer. Int. J. Comput. Intell. Syst. (2023). https://doi.org/10.1007/s44196-023-00235-4

    Article  Google Scholar 

  11. Peng, W.Y., Peng, Y.T., Lien, W.C., Chen, C.S.: Unveiling of how image restoration contributes to underwater object detection. In: 2021 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), pp. 1–2. IEEE (2021). https://doi.org/10.1109/ICCE-TW52618.2021.9602998

  12. Liu, K., Peng, L., Tang, S.: Underwater object detection using TC-YOLO with attention mechanisms. Sensors (2023). https://doi.org/10.3390/s23052567

    Article  Google Scholar 

  13. Wang, H., Sun, S., Bai, X., Wang, J., Ren, P.: A reinforcement learning paradigm of configuring visual enhancement for object detection in underwater scenes. IEEE J. Ocean. Eng. (2023). https://doi.org/10.1109/JOE.2022.3226202

    Article  Google Scholar 

  14. Song, P., Li, P., Dai, L., Wang, T., Chen, Z.: Boosting R-CNN: reweighting R-CNN samples by RPN’s error for underwater object detection. Neurocomputing 530, 150–164 (2023). https://doi.org/10.1016/j.neucom.2023.01.088

    Article  Google Scholar 

  15. Lee, M.F.R., Chen, Y.C.: Artificial intelligence based object detection and tracking for a small underwater robot. Processes (2023). https://doi.org/10.3390/pr11020312

    Article  Google Scholar 

  16. Yu, H., Li, X., Feng, Y., Han, S.: Multiple attentional path aggregation network for marine object detection. Appl. Intell.Intell. 53, 2434–2451 (2023). https://doi.org/10.1007/s10489-022-03622-0

    Article  Google Scholar 

  17. Son, Y.-T., Jin, S.-Y., Kang, T.-S.: Object detection and classification applying AI (computer vision) to underwater images. EGU23 (2023). https://doi.org/10.5194/EGUSPHERE-EGU23-2203

  18. Wu, C.M., Sun, Y.Q., Wang, T.J., Liu, Y.L.: Underwater trash detection algorithm based on improved YOLOv5s. J. Real Time Image Process. 19, 911–920 (2022). https://doi.org/10.1007/s11554-022-01232-0

    Article  Google Scholar 

  19. Zhang, X., Fang, X., Pan, M., Yuan, L., Zhang, Y., Yuan, M., Lv, S., Yu, H.: A marine organism detection framework based on the joint optimization of image enhancement and object detection. Sensors 21, 1–17 (2021). https://doi.org/10.3390/s21217205

    Article  Google Scholar 

  20. Wang, C.C., Samani, H., Yang, C.Y.: Object Detection with Deep Learning for Underwater Environment. Proceedings of 4th International Conference Information Technology Res. Bridg. Digit. Divid. Through Multidiscip. Res, pp. 1–6. ICITR (2019). https://doi.org/10.1109/ICITR49409.2019.9407797

  21. Ji, S.J., Ling, Q.H., Han, F.: An improved algorithm for small object detection based on YOLO v4 and multi-scale contextual information. Comput. Electr. Eng.. Electr. Eng. 105, 108490 (2023). https://doi.org/10.1016/j.compeleceng.2022.108490

    Article  Google Scholar 

  22. Zhang, J., Zhang, J., Zhou, K., Zhang, Y., Chen, H., Yan, X.: An improved YOLOv5-based underwater object-detection framework. Sensors 23, 1–21 (2023)

    Google Scholar 

  23. Liu, K., Sun, Q., Sun, D., Peng, L., Yang, M., Wang, N.: Underwater target detection based on improved YOLOv7. Mar. Sci. Eng. (2023). https://doi.org/10.23919/CCC55666.2022.9901920

    Article  Google Scholar 

  24. Lou, H., Duan, X., Guo, J., Liu, H., Gu, J., Bi, L., Chen, H.: DC-YOLOv8: small-size object detection algorithm based on camera sensor. Electronics 12(10), 2323 (2023). https://doi.org/10.20944/preprints202304.0124.v1

    Article  Google Scholar 

  25. Li, Y., Fan, Q., Huang, H., Han, Z., Gu, Q.: A modified YOLOv8 detection network for UAV aerial image recognition. Drones 7, 304 (2023)

    Article  Google Scholar 

  26. Kim, J.H., Kim, N., Won, C.S.: High-speed drone detection based on Yolo-V8. In: ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–2. IEEE (2023)

  27. Lou, H., Duan, X., Guo, J., Liu, H., Gu, J., Bi, L., Chen, H.: DC-YOLOv8: small-size object detection algorithm based on camera sensor. Electron 12, 1–14 (2023). https://doi.org/10.3390/electronics12102323

    Article  Google Scholar 

  28. Wang, C.Y., Mark Liao, H.Y., Wu, Y.H., Chen, P.Y., Hsieh, J.W., Yeh, I.H.: CSPNet: a new backbone that can enhance learning capability of CNN. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. Work. 2020-June, pp. 1571–1580 (2020). https://doi.org/10.1109/CVPRW50498.2020.00203

  29. Ju, R.-Y., Cai, W.: Fracture detection in pediatric wrist trauma X-ray images using YOLOv8 algorithm. Sci. Rep. Rep 13, 1–12 (2023)

    Google Scholar 

  30. GitHub - ultralytics/yolov5: YOLOv5 in PyTorch > ONNX > CoreML > TFLite, https://github.com/ultralytics/yolov5

  31. Wang, C.-Y., Bochkovskiy, A., Liao, H.-Y.M.: YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. 1–15 (2022)

  32. Li, X., Yu, H., Chen, H.: Multi-scale aggregation feature pyramid with cornerness for underwater object detection. Vis. Comput.Comput. (2023). https://doi.org/10.1007/s00371-023-02849-3

    Article  Google Scholar 

  33. Feng, C., Zhong, Y., Gao, Y., Scott, M.R., Huang, W.: TOOD: task-aligned one-stage object detection. Proc. IEEE Int. Conf. Comput. Vis. (2021). https://doi.org/10.1109/ICCV48922.2021.00349

    Article  Google Scholar 

  34. Corrigan, B.C., Tay, Z.Y., Konovessis, D.: Real-time instance segmentation for detection of underwater litter as a plastic source. J. Mar. Sci. Eng. (2023). https://doi.org/10.3390/jmse11081532

    Article  Google Scholar 

  35. Wang, Z., Zhang, G., Luan, K., Yi, C., Li, M.: Image-fused-guided underwater object detection model based on improved YOLOv7. Electron 12, 1–12 (2023). https://doi.org/10.3390/electronics12194064

    Article  Google Scholar 

  36. Yuan, X., Fang, S., Li, N., Ma, Q., Wang, Z., Gao, M., Tang, P., Yu, C., Wang, Y.: Performance comparison of sea cucumber detection by the Yolov5 and DETR approach. (2023)

  37. Walia, J.S., Seemakurthy, K.: Optimized custom dataset for efficient detection of underwater trash, pp. 292–303. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-43360-3_24

    Book  Google Scholar 

  38. Fulton, M., Hong, J., Islam, M.J., Sattar, J.: Robotic detection of marine litter using deep visual detection models. In: 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, pp. 5752–5758 (2019). https://doi.org/10.1109/ICRA.2019.8793975

Download references

Author information

Authors and Affiliations

Authors

Contributions

All authors have equally contributed for this paper.

Corresponding author

Correspondence to Jyotir Moy Chatterjee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gupta, C., Gill, N.S., Gulia, P. et al. A novel finetuned YOLOv8 model for real-time underwater trash detection. J Real-Time Image Proc 21, 48 (2024). https://doi.org/10.1007/s11554-024-01439-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11554-024-01439-3

Keywords

Navigation