Fast Accurate Fish Recognition with Deep Learning Based on a Domain-Specific Large-Scale Fish Dataset

Lin, Yuan; Chu, Zhaoqi; Korhonen, Jari; Xu, Jiayi; Liu, Xiangrong; Liu, Juan; Liu, Min; Fang, Lvping; Yang, Weidi; Ghose, Debasish; You, Junyong

doi:10.1007/978-3-031-27077-2_40

Yuan Lin¹⁵,
Zhaoqi Chu¹⁶,
Jari Korhonen¹⁷,
Jiayi Xu¹⁸,
Xiangrong Liu¹⁸,
Juan Liu¹⁶,
Min Liu¹⁹,
Lvping Fang¹⁹,
Weidi Yang¹⁹,
Debasish Ghose¹⁵ &
…
Junyong You²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13833))

Included in the following conference series:

International Conference on Multimedia Modeling

1370 Accesses

Abstract

Fish species recognition is an integral part of sustainable marine biodiversity and aquaculture. The rapid emergence of deep learning methods has shown great potential on classification and recognition tasks when trained on a large scale dataset. Nevertheless, some practical challenges remain for automating the task, e.g., the lack of appropriate methods applied to a complicated fish habitat. In addition, most publicly accessible fish datasets have small-scale and low resolution, imbalanced data distributions, or limited labels and annotations, etc. In this work, we aim to overcome the aforementioned challenges. First, we construct the OceanFish database with higher image quality and resolution that covers a large scale and diversity of marine-domain fish species in East China sea. The current version covers 63, 622 pictures of 136 fine-grained fish species. Accompanying the dataset, we propose a fish recognition testbed by incorporating two widely applied deep neural network based object detection models to exploit the facility of the enlarged dataset, which achieves a convincing performance in detection precision and speed. The scale and hierarchy of OceanFish can be further enlarged by enrolling new fish species and annotations. Interested readers may ask for access and re-use this benchmark datasets for their own classification tasks upon inquiries. We hope that the OceanFish database and the fish recognition testbed can serve as a generalized benchmark that motivates further development in related research communities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Goulletque, P., et al.: The importance of marine biodiversity. Biodiversity in the Marine Environment, pp 1–13 (2014)
Google Scholar
Yi-Haur, S., et al.: Fish observation, detection, recognition and verfication in the real world. In: Proceedings of the International Conference on Image Processing, Computer Vision, and Pattern Recognition(IPCV), p. 1, (2012)
Google Scholar
Katy, B., et al.: Fish species recognition from video using SVM classifier. In: Proceedings of the 3rd ACM International Workshop on Multimedia Analysis for Ecological Data, pp. 1–6 (2014)
Google Scholar
Mehdi, R., et al.: Automated fish detection in underwater images using shape based level sets. Photogram. Record. 30(149), 46–62 (2015)
Article Google Scholar
Qin, H.W., et al.: DeepFish: accurate underwater live fish recognition with a deep architecture. Neurocomputing 187, 49–58 (2016)
Article Google Scholar
Tamou, A.B., et al.: Underwater live fish recognition by deep learning. In: International Conference on Image and Signal Processing, pp. 275–283 (2018)
Google Scholar
Krizhevsky, A., et al.: ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, vol. 1, pp. 1097–1105 (2012)
Google Scholar
https://www.kaggle.com/c/the-nature-conservancy-fisheries-monitoring. Kaggle Competition. The Nature Conservancy Fisheries Monitoring (2017)
Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, 2009. CVPR (2009)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Everingham, M., et al.: The PASCAL visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010). https://doi.org/10.1007/s11263-009-0275-4
Article Google Scholar
Torralba, A., et al.: 80 million tiny images: a large dataset for nonparametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1958–1970 (2008)
Article Google Scholar
Ahn, L.V., et al.: Labeling images with a computer game. In: CHI04 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 319–326 (2004)
Google Scholar
Fisher, R. et al.: Overview of the Fish4Knowledge project. In: Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data, pp. 1–17 (2016)
Google Scholar
Alexis, J., et al.: Life CLEF 2015: multimedia life species identification challenges. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. Springer International Publishing, pp. 462–483 (2015)
Google Scholar
Anantharajah, K., et al.: Local inter-session variability modelling for object classification. In: IEEE Winter Conference on Applications of Computer Vision, pp 309–316 (2014)
Google Scholar
Saleh, A., et al.: A realistic fish-habitat dataset to evaluate algorithms for underwater visual analysis. Sci. Rep. 10, 14671 (2020)
Article Google Scholar
J. Key, et al.: The fishnet open images database: a dataset for fish detection and fine-grained categorization in fisheries. In: 8th Workshop on Fine-Grained Visual Categorization at CVPR (2021)
Google Scholar
Girshick, R., et al.: Feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 2014, pp. 580–587 (2014)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2015, pp. 1440–1448 (2015)
Google Scholar
Ren, S.Q., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 2015, pp. 91–99 (2015)
Google Scholar
Redmon, J., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Villon, S., et al.: Coral reef fish detection and recognition in underwater videos by supervised machine learning: comparison between deep learning and HOG \(+\) SVM methods. In: International Conference on Advanced Concepts for Intelligent Vision Systems, ACIVS, pp. 160–171 (2016)
Google Scholar
Deng, J., et al.: http://www.image-net.org/challenges/LSVRC/2012/. In: ILSVRC-2012 (2012)
Takahashi, R., Matsubara, T.: Data augmentation using random image cropping and patching for deep CNNs. In: arXiv (2018)
Google Scholar
He, K., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR2016), pp. 770–778 (2016)
Google Scholar
Krizhevsky, A.: Learning multiple layers of features from tiny images. Technical report, University of Toronto, pp. 1–60 (2009)
Google Scholar
Simonyan, K., et al.: Very deep convolutional networks for large-scale image recognition. In: arXiv:1409.1556 (2014)

Download references

Author information

Authors and Affiliations

School of Economics, Innovation, and Technology, Kristiania University College, Oslo, Norway
Yuan Lin & Debasish Ghose
School of Aerospace Engineering, Xiamen University, Xiamen, China
Zhaoqi Chu & Juan Liu
School of Natural and Computing Sciences, University of Aberdeen, Aberdeen, UK
Jari Korhonen
School of Information Science and Technology, Xiamen University, Xiamen, China
Jiayi Xu & Xiangrong Liu
School of Ocean and Earth, Xiamen University, Xiamen, China
Min Liu, Lvping Fang & Weidi Yang
Norwegian Research Centre (NORCE), Bergen, Norway
Junyong You

Authors

Yuan Lin
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoqi Chu
View author publications
You can also search for this author in PubMed Google Scholar
Jari Korhonen
View author publications
You can also search for this author in PubMed Google Scholar
Jiayi Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangrong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Juan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Min Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lvping Fang
View author publications
You can also search for this author in PubMed Google Scholar
Weidi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Debasish Ghose
View author publications
You can also search for this author in PubMed Google Scholar
Junyong You
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Liu .

Editor information

Editors and Affiliations

University of Bergen, Bergen, Norway
Duc-Tien Dang-Nguyen
Dublin City University, Dublin, Ireland
Cathal Gurrin
Radboud University Nijmegen, Nijmegen, The Netherlands
Martha Larson
Dublin City University, Dublin, Ireland
Alan F. Smeaton
University of Amsterdam, Amsterdam, The Netherlands
Stevan Rudinac
National Institute of Information and Communications Technology, Tokyo, Japan
Minh-Son Dao
Department of Information Science and Media Studies, University of Bergen, Bergen, Norway
Christoph Trattner
La Trobe University, Melbourne, VIC, Australia
Phoebe Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, Y. et al. (2023). Fast Accurate Fish Recognition with Deep Learning Based on a Domain-Specific Large-Scale Fish Dataset. In: Dang-Nguyen, DT., et al. MultiMedia Modeling. MMM 2023. Lecture Notes in Computer Science, vol 13833. Springer, Cham. https://doi.org/10.1007/978-3-031-27077-2_40

Download citation

DOI: https://doi.org/10.1007/978-3-031-27077-2_40
Published: 29 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-27076-5
Online ISBN: 978-3-031-27077-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fast Accurate Fish Recognition with Deep Learning Based on a Domain-Specific Large-Scale Fish Dataset