A novel finetuned YOLOv8 model for real-time underwater trash detection

Gupta, Chhaya; Gill, Nasib Singh; Gulia, Preeti; Yadav, Sangeeta; Chatterjee, Jyotir Moy

doi:10.1007/s11554-024-01439-3

A novel finetuned YOLOv8 model for real-time underwater trash detection

Research
Published: 08 March 2024

Volume 21, article number 48, (2024)
Cite this article

Journal of Real-Time Image Processing Aims and scope Submit manuscript

Chhaya Gupta¹,
Nasib Singh Gill¹,
Preeti Gulia¹,
Sangeeta Yadav¹ &
…
Jyotir Moy Chatterjee²

404 Accesses
Explore all metrics

Abstract

When recognizing underwater images, problems, including poor image quality and complicated backdrops, are significant. The main problem of underwater images is the blurriness and invisibility of objects present in an image. This study presents a unique object identification design built on a YOLOv8 (You Only Look Once) framework upgraded to address these problems and further improve the models' accuracy. The study also helps in identifying underwater trash. The model is a two-phase detector model. The first phase has an Underwater Image Enhancer (UIE) data augmentation technique that works with Laplacian pyramids and gamma correctness methods to enhance the underwater images. The second phase, the proposed refined, innovative YOLOv8 model for classification purposes, takes the output from the first stage as its input. The YOLOv8 model's existing feature extractor is replaced in this study with a new feature extractor technique, HEFA, that yields superior results and better detection accuracy. The introduction of the UIE and HEFA feature extractor method represents the significant novelty of this paper. The proposed model is pruned simultaneously to eliminate unnecessary parameters and further condense the model. Pruning causes the model's accuracy to decline. Thus, the transfer learning procedure is employed to raise it. The trials’ findings show that the technique can detect objects with an accuracy of 98.5% and a mAP@50 of 98.1% and that its real-time detection speed on the GPU is double that of the YOLOv8m model's baseline performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

A review of object detection based on deep learning

Article 12 June 2020

Deep learning models for digital image processing: a review

Article 07 January 2024

Data availability

The data used in the work is freely accessible via Fulton et al. [38].

References

Namadi, P., Deng, Z.: Deep learning-based ensemble modeling of Vibrio parahaemolyticus concentration in marine environment. Environ. Monit. Assess.Monit. Assess. (2023). https://doi.org/10.1007/s10661-022-10836-9
Article Google Scholar
Zhao, W., Han, F., Qiu, X., Peng, X., Zhao, Y., Zhang, J.: Research on the identification and distribution of biofouling using underwater cleaning robot based on deep learning. Ocean Eng. 273, 113909 (2023). https://doi.org/10.1016/j.oceaneng.2023.113909
Article Google Scholar
Xu, S., Zhang, M., Song, W., Mei, H., He, Q., Liotta, A.: A systematic review and analysis of deep learning-based underwater object detection. Neurocomputing 527, 204–232 (2023). https://doi.org/10.1016/j.neucom.2023.01.056
Article Google Scholar
Farid, A., Hussain, F., Khan, K., Shahzad, M., Khan, U., Mahmood, Z.: A fast and accurate real-time vehicle detection method using deep learning for unconstrained environments. Appl. Sci. (2023). https://doi.org/10.3390/app13053059
Article Google Scholar
Sangeeta, G.P.: Improved video compression using variable emission step ConvGRU based architecture. Lect. Notes Data Eng. Commun. Technol. 61, 405–415 (2021). https://doi.org/10.1007/978-981-33-4582-9_31/COVER
Article Google Scholar
Gupta, C., Gill, N.S., Gulia, P., Chatterjee, J.M.: A novel finetuned YOLOv6 transfer learning model for real-time object detection. J. Real Time Image Process. (2023). https://doi.org/10.1007/s11554-023-01299-3
Article Google Scholar
Diwan, T., Anirudh, G., Tembhurne, J.V.: Object detection using YOLO: challenges, architectural successors, datasets and applications. Multimed. Tools Appl. (2022). https://doi.org/10.1007/s11042-022-13644-y
Article Google Scholar
Gupta, C., Gill, N.S., Gulia, P.: SSDT: distance tracking model based on deep learning. Int. J. Electr. Comput. Eng. Syst. 13, 339–348 (2022). https://doi.org/10.32985/ijeces.13.5.2
Article Google Scholar
Mittal, U., Chawla, P., Tiwari, R.: EnsembleNet: a hybrid approach for vehicle detection and estimation of traffic density based on faster R-CNN and YOLO models. Neural Comput. Appl.Comput. Appl. 35, 4755–4774 (2023). https://doi.org/10.1007/s00521-022-07940-9
Article Google Scholar
Qiu, Z., Rong, S., Ye, L.: YOLF-ShipPnet: improved RetinaNet with pyramid vision transformer. Int. J. Comput. Intell. Syst. (2023). https://doi.org/10.1007/s44196-023-00235-4
Article Google Scholar
Peng, W.Y., Peng, Y.T., Lien, W.C., Chen, C.S.: Unveiling of how image restoration contributes to underwater object detection. In: 2021 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), pp. 1–2. IEEE (2021). https://doi.org/10.1109/ICCE-TW52618.2021.9602998
Liu, K., Peng, L., Tang, S.: Underwater object detection using TC-YOLO with attention mechanisms. Sensors (2023). https://doi.org/10.3390/s23052567
Article Google Scholar
Wang, H., Sun, S., Bai, X., Wang, J., Ren, P.: A reinforcement learning paradigm of configuring visual enhancement for object detection in underwater scenes. IEEE J. Ocean. Eng. (2023). https://doi.org/10.1109/JOE.2022.3226202
Article Google Scholar
Song, P., Li, P., Dai, L., Wang, T., Chen, Z.: Boosting R-CNN: reweighting R-CNN samples by RPN’s error for underwater object detection. Neurocomputing 530, 150–164 (2023). https://doi.org/10.1016/j.neucom.2023.01.088
Article Google Scholar
Lee, M.F.R., Chen, Y.C.: Artificial intelligence based object detection and tracking for a small underwater robot. Processes (2023). https://doi.org/10.3390/pr11020312
Article Google Scholar
Yu, H., Li, X., Feng, Y., Han, S.: Multiple attentional path aggregation network for marine object detection. Appl. Intell.Intell. 53, 2434–2451 (2023). https://doi.org/10.1007/s10489-022-03622-0
Article Google Scholar
Son, Y.-T., Jin, S.-Y., Kang, T.-S.: Object detection and classification applying AI (computer vision) to underwater images. EGU23 (2023). https://doi.org/10.5194/EGUSPHERE-EGU23-2203
Wu, C.M., Sun, Y.Q., Wang, T.J., Liu, Y.L.: Underwater trash detection algorithm based on improved YOLOv5s. J. Real Time Image Process. 19, 911–920 (2022). https://doi.org/10.1007/s11554-022-01232-0
Article Google Scholar
Zhang, X., Fang, X., Pan, M., Yuan, L., Zhang, Y., Yuan, M., Lv, S., Yu, H.: A marine organism detection framework based on the joint optimization of image enhancement and object detection. Sensors 21, 1–17 (2021). https://doi.org/10.3390/s21217205
Article Google Scholar
Wang, C.C., Samani, H., Yang, C.Y.: Object Detection with Deep Learning for Underwater Environment. Proceedings of 4th International Conference Information Technology Res. Bridg. Digit. Divid. Through Multidiscip. Res, pp. 1–6. ICITR (2019). https://doi.org/10.1109/ICITR49409.2019.9407797
Ji, S.J., Ling, Q.H., Han, F.: An improved algorithm for small object detection based on YOLO v4 and multi-scale contextual information. Comput. Electr. Eng.. Electr. Eng. 105, 108490 (2023). https://doi.org/10.1016/j.compeleceng.2022.108490
Article Google Scholar
Zhang, J., Zhang, J., Zhou, K., Zhang, Y., Chen, H., Yan, X.: An improved YOLOv5-based underwater object-detection framework. Sensors 23, 1–21 (2023)
Google Scholar
Liu, K., Sun, Q., Sun, D., Peng, L., Yang, M., Wang, N.: Underwater target detection based on improved YOLOv7. Mar. Sci. Eng. (2023). https://doi.org/10.23919/CCC55666.2022.9901920
Article Google Scholar
Lou, H., Duan, X., Guo, J., Liu, H., Gu, J., Bi, L., Chen, H.: DC-YOLOv8: small-size object detection algorithm based on camera sensor. Electronics 12(10), 2323 (2023). https://doi.org/10.20944/preprints202304.0124.v1
Article Google Scholar
Li, Y., Fan, Q., Huang, H., Han, Z., Gu, Q.: A modified YOLOv8 detection network for UAV aerial image recognition. Drones 7, 304 (2023)
Article Google Scholar
Kim, J.H., Kim, N., Won, C.S.: High-speed drone detection based on Yolo-V8. In: ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–2. IEEE (2023)
Lou, H., Duan, X., Guo, J., Liu, H., Gu, J., Bi, L., Chen, H.: DC-YOLOv8: small-size object detection algorithm based on camera sensor. Electron 12, 1–14 (2023). https://doi.org/10.3390/electronics12102323
Article Google Scholar
Wang, C.Y., Mark Liao, H.Y., Wu, Y.H., Chen, P.Y., Hsieh, J.W., Yeh, I.H.: CSPNet: a new backbone that can enhance learning capability of CNN. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. Work. 2020-June, pp. 1571–1580 (2020). https://doi.org/10.1109/CVPRW50498.2020.00203
Ju, R.-Y., Cai, W.: Fracture detection in pediatric wrist trauma X-ray images using YOLOv8 algorithm. Sci. Rep. Rep 13, 1–12 (2023)
Google Scholar
GitHub - ultralytics/yolov5: YOLOv5 in PyTorch > ONNX > CoreML > TFLite, https://github.com/ultralytics/yolov5
Wang, C.-Y., Bochkovskiy, A., Liao, H.-Y.M.: YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. 1–15 (2022)
Li, X., Yu, H., Chen, H.: Multi-scale aggregation feature pyramid with cornerness for underwater object detection. Vis. Comput.Comput. (2023). https://doi.org/10.1007/s00371-023-02849-3
Article Google Scholar
Feng, C., Zhong, Y., Gao, Y., Scott, M.R., Huang, W.: TOOD: task-aligned one-stage object detection. Proc. IEEE Int. Conf. Comput. Vis. (2021). https://doi.org/10.1109/ICCV48922.2021.00349
Article Google Scholar
Corrigan, B.C., Tay, Z.Y., Konovessis, D.: Real-time instance segmentation for detection of underwater litter as a plastic source. J. Mar. Sci. Eng. (2023). https://doi.org/10.3390/jmse11081532
Article Google Scholar
Wang, Z., Zhang, G., Luan, K., Yi, C., Li, M.: Image-fused-guided underwater object detection model based on improved YOLOv7. Electron 12, 1–12 (2023). https://doi.org/10.3390/electronics12194064
Article Google Scholar
Yuan, X., Fang, S., Li, N., Ma, Q., Wang, Z., Gao, M., Tang, P., Yu, C., Wang, Y.: Performance comparison of sea cucumber detection by the Yolov5 and DETR approach. (2023)
Walia, J.S., Seemakurthy, K.: Optimized custom dataset for efficient detection of underwater trash, pp. 292–303. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-43360-3_24
Book Google Scholar
Fulton, M., Hong, J., Islam, M.J., Sattar, J.: Robotic detection of marine litter using deep visual detection models. In: 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, pp. 5752–5758 (2019). https://doi.org/10.1109/ICRA.2019.8793975

Download references

Author information

Authors and Affiliations

Department of Computer Science and Applications, Maharshi Dayanand University, Rohtak, Haryana, India
Chhaya Gupta, Nasib Singh Gill, Preeti Gulia & Sangeeta Yadav
Department of CSE, Graphic Era University, Dehradun, India
Jyotir Moy Chatterjee

Authors

Chhaya Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Nasib Singh Gill
View author publications
You can also search for this author in PubMed Google Scholar
Preeti Gulia
View author publications
You can also search for this author in PubMed Google Scholar
Sangeeta Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Jyotir Moy Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors have equally contributed for this paper.

Corresponding author

Correspondence to Jyotir Moy Chatterjee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Gupta, C., Gill, N.S., Gulia, P. et al. A novel finetuned YOLOv8 model for real-time underwater trash detection. J Real-Time Image Proc 21, 48 (2024). https://doi.org/10.1007/s11554-024-01439-3

Download citation

Received: 17 July 2023
Accepted: 14 February 2024
Published: 08 March 2024
DOI: https://doi.org/10.1007/s11554-024-01439-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel finetuned YOLOv8 model for real-time underwater trash detection

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

A review of object detection based on deep learning

Deep learning models for digital image processing: a review

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A novel finetuned YOLOv8 model for real-time underwater trash detection

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

A review of object detection based on deep learning

Deep learning models for digital image processing: a review

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation