Investigating the Transferability of YOLOv5-Based Water Surface Object Detection Model in Maritime Applications

Guo, Yu; Chen, Zhuo; Wang, Qi; Bao, Tao; Zhou, Zexing

doi:10.1007/978-981-99-5847-4_8

Yu Guo^12,13,
Zhuo Chen^12,13,
Qi Wang^12,13,
Tao Bao^12,13 &
…
Zexing Zhou^12,13

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1870))

Included in the following conference series:

International Conference on Neural Computing for Advanced Applications

678 Accesses

Abstract

Object detection on the water surface is crucial for unmanned surface vehicles in maritime environments. Despite the challenges posed by variable lighting and ocean conditions, advancements in this field are necessary. In this paper, we investigate the transferability of YOLOv5-based water surface object detection models in cross-domain scenarios. The evaluation is based on publicly available datasets and two newly proposed datasets, Taihu Trial Dataset(TTD) and Fuxian Trial Dataset(FTD), which contain similar target classes but distinct scene and features. Results from extensive experiments indicate that zero-shot transfer is challenging, but a limited number of samples from the target domain can greatly enhance model performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

An Improved YOLOv5 Approach for Multi-object Detection of Unmanned Surface Vehicles

Cross-Modal Ship Grounding: Towards Large Model for Enhanced Few-Shot Learning

Bi2F-YOLO: a novel framework for underwater object detection based on YOLOv7

Article Open access 26 March 2025

References

Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. Int. J. Comput. Vision 88, 303–308 (2009)
Article Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Jocher, G.: YOLOv5 by Ultralytics. https://doi.org/10.5281/zenodo.3908559. https://github.com/ultralytics/yolov5
Kuznetsova, A., et al.: The open images dataset v4: unified image classification, object detection, and visual relationship detection at scale. Int. J. Comput. Vision 128(7), 1956–1981 (2020)
Article Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Liu, S., Qi, L., Qin, H., Shi, J., Jia, J.: Path aggregation network for instance segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8759–8768 (2018)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Pang, Y., Yuan, Y., Li, X., Pan, J.: Efficient hog human detection. Sig. Process. 91(4), 773–781 (2011)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems 28 (2015)
Google Scholar
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019)
Google Scholar
Whitehill, J., Omlin, C.W.: Haar features for FACS AU recognition. In: 7th International Conference on Automatic Face and Gesture Recognition (FGR06), pp. 5–101. IEEE (2006)
Google Scholar
Yan, J., Lei, Z., Wen, L., Li, S.Z.: The fastest deformable part model for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2497–2504 (2014)
Google Scholar
Zhao, Z.Q., Zheng, P., Xu, S.T., Wu, X.: Object detection with deep learning: a review. IEEE Trans. Neural Netw. Learn. Syst. 30(11), 3212–3232 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

China Ship Scientific Research Center, Wuxi, 214000, People’s Republic of China
Yu Guo, Zhuo Chen, Qi Wang, Tao Bao & Zexing Zhou
Taihu Lake Laboratory of Deep Sea Technological Science, Wuxi, 214000, People’s Republic of China
Yu Guo, Zhuo Chen, Qi Wang, Tao Bao & Zexing Zhou

Authors

Yu Guo
View author publications
You can also search for this author in PubMed Google Scholar
Zhuo Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tao Bao
View author publications
You can also search for this author in PubMed Google Scholar
Zexing Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu Guo .

Editor information

Editors and Affiliations

Harbin Institute of Technology, Shenzhen, China
Haijun Zhang
Chaohu University, Hefei, China
Yinggen Ke
Chongqing University, Chongqing, China
Zhou Wu
South China Normal University, Guangzhou, China
Tianyong Hao
Hefei University of Technology, Hefei, China
Zhao Zhang
Technical University of Denmark, Kongens Lyngby, Denmark
Weizhi Meng
Chaohu University, Hefei, China
Yuanyuan Mu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guo, Y., Chen, Z., Wang, Q., Bao, T., Zhou, Z. (2023). Investigating the Transferability of YOLOv5-Based Water Surface Object Detection Model in Maritime Applications. In: Zhang, H., et al. International Conference on Neural Computing for Advanced Applications. NCAA 2023. Communications in Computer and Information Science, vol 1870. Springer, Singapore. https://doi.org/10.1007/978-981-99-5847-4_8

Download citation

DOI: https://doi.org/10.1007/978-981-99-5847-4_8
Published: 30 August 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5846-7
Online ISBN: 978-981-99-5847-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Investigating the Transferability of YOLOv5-Based Water Surface Object Detection Model in Maritime Applications