Abstract
Fish species recognition is an integral part of sustainable marine biodiversity and aquaculture. The rapid emergence of deep learning methods has shown great potential on classification and recognition tasks when trained on a large scale dataset. Nevertheless, some practical challenges remain for automating the task, e.g., the lack of appropriate methods applied to a complicated fish habitat. In addition, most publicly accessible fish datasets have small-scale and low resolution, imbalanced data distributions, or limited labels and annotations, etc. In this work, we aim to overcome the aforementioned challenges. First, we construct the OceanFish database with higher image quality and resolution that covers a large scale and diversity of marine-domain fish species in East China sea. The current version covers 63, 622 pictures of 136 fine-grained fish species. Accompanying the dataset, we propose a fish recognition testbed by incorporating two widely applied deep neural network based object detection models to exploit the facility of the enlarged dataset, which achieves a convincing performance in detection precision and speed. The scale and hierarchy of OceanFish can be further enlarged by enrolling new fish species and annotations. Interested readers may ask for access and re-use this benchmark datasets for their own classification tasks upon inquiries. We hope that the OceanFish database and the fish recognition testbed can serve as a generalized benchmark that motivates further development in related research communities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goulletque, P., et al.: The importance of marine biodiversity. Biodiversity in the Marine Environment, pp 1–13 (2014)
Yi-Haur, S., et al.: Fish observation, detection, recognition and verfication in the real world. In: Proceedings of the International Conference on Image Processing, Computer Vision, and Pattern Recognition(IPCV), p. 1, (2012)
Katy, B., et al.: Fish species recognition from video using SVM classifier. In: Proceedings of the 3rd ACM International Workshop on Multimedia Analysis for Ecological Data, pp. 1–6 (2014)
Mehdi, R., et al.: Automated fish detection in underwater images using shape based level sets. Photogram. Record. 30(149), 46–62 (2015)
Qin, H.W., et al.: DeepFish: accurate underwater live fish recognition with a deep architecture. Neurocomputing 187, 49–58 (2016)
Tamou, A.B., et al.: Underwater live fish recognition by deep learning. In: International Conference on Image and Signal Processing, pp. 275–283 (2018)
Krizhevsky, A., et al.: ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, vol. 1, pp. 1097–1105 (2012)
https://www.kaggle.com/c/the-nature-conservancy-fisheries-monitoring. Kaggle Competition. The Nature Conservancy Fisheries Monitoring (2017)
Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, 2009. CVPR (2009)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Everingham, M., et al.: The PASCAL visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010). https://doi.org/10.1007/s11263-009-0275-4
Torralba, A., et al.: 80 million tiny images: a large dataset for nonparametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1958–1970 (2008)
Ahn, L.V., et al.: Labeling images with a computer game. In: CHI04 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 319–326 (2004)
Fisher, R. et al.: Overview of the Fish4Knowledge project. In: Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data, pp. 1–17 (2016)
Alexis, J., et al.: Life CLEF 2015: multimedia life species identification challenges. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. Springer International Publishing, pp. 462–483 (2015)
Anantharajah, K., et al.: Local inter-session variability modelling for object classification. In: IEEE Winter Conference on Applications of Computer Vision, pp 309–316 (2014)
Saleh, A., et al.: A realistic fish-habitat dataset to evaluate algorithms for underwater visual analysis. Sci. Rep. 10, 14671 (2020)
J. Key, et al.: The fishnet open images database: a dataset for fish detection and fine-grained categorization in fisheries. In: 8th Workshop on Fine-Grained Visual Categorization at CVPR (2021)
Girshick, R., et al.: Feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 2014, pp. 580–587 (2014)
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2015, pp. 1440–1448 (2015)
Ren, S.Q., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 2015, pp. 91–99 (2015)
Redmon, J., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Villon, S., et al.: Coral reef fish detection and recognition in underwater videos by supervised machine learning: comparison between deep learning and HOG \(+\) SVM methods. In: International Conference on Advanced Concepts for Intelligent Vision Systems, ACIVS, pp. 160–171 (2016)
Deng, J., et al.: http://www.image-net.org/challenges/LSVRC/2012/. In: ILSVRC-2012 (2012)
Takahashi, R., Matsubara, T.: Data augmentation using random image cropping and patching for deep CNNs. In: arXiv (2018)
He, K., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR2016), pp. 770–778 (2016)
Krizhevsky, A.: Learning multiple layers of features from tiny images. Technical report, University of Toronto, pp. 1–60 (2009)
Simonyan, K., et al.: Very deep convolutional networks for large-scale image recognition. In: arXiv:1409.1556 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lin, Y. et al. (2023). Fast Accurate Fish Recognition with Deep Learning Based on a Domain-Specific Large-Scale Fish Dataset. In: Dang-Nguyen, DT., et al. MultiMedia Modeling. MMM 2023. Lecture Notes in Computer Science, vol 13833. Springer, Cham. https://doi.org/10.1007/978-3-031-27077-2_40
Download citation
DOI: https://doi.org/10.1007/978-3-031-27077-2_40
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-27076-5
Online ISBN: 978-3-031-27077-2
eBook Packages: Computer ScienceComputer Science (R0)