Development of outdoor swimmers detection system with small object detection method based on deep learning

Xiao, Hanguang; Li, Yuewei; Xiu, Yu; Xia, Qingling

doi:10.1007/s00530-022-00995-7

Development of outdoor swimmers detection system with small object detection method based on deep learning

Regular Paper
Published: 15 September 2022

Volume 29, pages 323–332, (2023)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Hanguang Xiao¹,
Yuewei Li ORCID: orcid.org/0000-0001-5192-6717¹^na1,
Yu Xiu²^na1 &
…
Qingling Xia¹^na1

517 Accesses
3 Citations
Explore all metrics

Abstract

Wild swimming, or swimming in prohibited outdoor places, is a major source of drowning occurrences and a key problem in outdoor water safety management. Currently, manual patrol and warning signs are the basic methods adopted by the local government for outdoor water safety management to inspect drowning accidents. However, they are inefficient, costly, and of little avail. To this goal, a novel object detector for outdoor swimmers was developed via transfer learning utilizing the Microsoft Common Objects in Context (MS COCO) dataset as a training starting point. The model was then evaluated and retrained to possess the capacity to classify swimmers, suspected swimmers, and pedestrians. The total precision and detection time of our proposed swimmer detection with small object detection approach are 99.45% and 43.44 ms, respectively, which are greater than those of existing methods and traditional data augmentation methods. We verified the effectiveness of the proposed method on small target detection and designed two prototypes of hardware systems (fixed monitoring device and drone monitoring device) to meet the requirements of stationary and movable detection scenarios that can identify and warn of the possible phenomenon of wild swimming efficiently. This scheme can provide a more comprehensive reference for other innovative city applications that rely on cameras and can be valuable for society.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

YOLO-based Object Detection Models: A Review and its Applications

Article 14 March 2024

A review of object detection based on deep learning

Article 12 June 2020

References

Chatterjee, R., Chatterjee, A., Islam, S., Khan, M.K.: An object detection-based few-shot learning approach for multimedia quality assessment. Multimed. Syst. 1–14 (2022). https://doi.org/10.1007/s00530-021-00881-8. Accessed 29 Jan 2022
Chua, S.D., Lim, S., Lai, S., Chang, T.: Development of a child detection system with artificial intelligence using object detection method. J. Electr. Eng. Technol. 14(6), 2523–2529 (2019)
Article Google Scholar
Zou, Z., Shi, Z., Guo, Y., Ye J.: Object detection in 20 years: a survey. (2019). arXiv preprint arXiv:1905.05055
Vahab, A., Naik, M.S., Raikar, P.G.: Applications of object detection system. Int. Res. J. Eng. Technol. 6(4), 4186–4192 (2019)
Google Scholar
Pranav, M V., Shreyas Madhav, A V., Meena, J.: DeepRecog: threefold underwater image deblurring and object recognition framework for AUV vision systems. Multimed. Syst. 28(2), 583–593 (2021)
Heo, S., Cho, S., Kim, Y.: Real-time object detection system with multi-path neural networks. In: 2020 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), pp. 174–187 (2020)
Liu, Y., Sun, P., Wergeles, N.: A survey and performance evaluation of deep learning methods for small object detection. Expert Syst. Appl. 172, 114602 (2021)
Article Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57(2), 137–154 (2004)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), pp. 886–893 (2005)
Sun, Z., Bebis, G., Miller, R.: On-road vehicle detection: a review. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 694–711 (2006)
Article Google Scholar
Eng, H.-L., Wang, J., Kam, A. H., Yau, W.-Y.: Novel region-based modeling for human detection within highly dynamic aquatic environment. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’ 04), p. II (2004)
Eng, H.-L., Wang, J., Wah, A.S., Yau, W.-Y.: Robust human detection within a highly dynamic aquatic environment in real time. IEEE Trans. Image Process. 15(6), 1583–1600 (2006)
Article Google Scholar
Zecha, D., Greif, T., Lienhart, R.: Swimmer detection and pose estimation for continuous stroke-rate determination. In: Multimedia on Mobile Devices 2012; and Multimedia Content Access: Algorithms and Systems VI, p. 830410 (2012)
Eng, H.-L., Toh, K.-A., Yau, W.-Y., Wang, J.: DEWS: a live visual surveillance system for early drowning detection at pool. IEEE Trans. Circuits Syst. Video Technol. 18(2), 196–210 (2008)
Article Google Scholar
Chan, K.L.: Detection of swimmer using dense optical flow motion map and intensity information. Mach. Vis. Appl. 24(1), 75–101 (2013)
Article Google Scholar
Hong, D., Kim, Y.: Efficient swimmer detection algorithm using CNN-based SVM. J. Korean Inf. Sci. Soc. 22(12), 79–85 (2017)
Google Scholar
Tong, K., Wu, Y., Zhou, F.: Recent advances in small object detection based on deep learning: a review. Image Vis. Comput. 97, 103910 (2020)
Article Google Scholar
Shrestha, A., Mahmood, A.: Review of deep learning algorithms and architectures. IEEE Access 7, 53040–53065 (2019)
Article Google Scholar
Shrivastava, A., Sukthankar, R., Malik, J., Gupta, A.: Beyond skip connections: top-down modulation for object detection. (2016). arXiv preprint arXiv:06851
Bell, S., Zitnick, C. L., Bala, K., Girshick, R.: Inside–outside net: detecting objects in context with skip pooling and recurrent neural network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2874–2883 (2016)
Lin, T.-Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Dai, J., Li, Y., He, K., Sun, J.: R-fcn: object detection via region-based fully convolutional networks. (2016). arXiv preprint arXiv:06409
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-cnn. In: Proceedings of the Ieee International Conference on Computer Vision, pp. 2961–2969 (2017)
Ren, S., He, K., Girshick, R., Zhang, X., Sun, J.: Object detection networks on convolutional feature maps. IEEE Trans. Pattern Anal. Mach. Intell. 39(7), 1476–1481 (2016)
Article Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. (2015). arXiv preprint arXiv:01497
Yang, F., Choi, W., Lin, Y.: Exploit all the layers: fast and accurate cnn object detector with scale dependent pooling and cascaded rejection classifiers. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2129–2137 (2016)
Deng, H., Ergu, D., Liu, F., Ma, B., Cai, Y.: An embeddable algorithm for automatic garbage detection based on complex marine environment. Sensors 21(19), 6391 (2021)
Article Google Scholar
Li, X., Shang, M., Qin, H., Chen, L.: Fast accurate fish detection and recognition of underwater images with fast r-cnn. In: OCEANS 2015-MTS/IEEE Washington, pp. 1–5 (2015)
Girshick, R.: Fast r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Redmon, J., Farhadi, A.: Yolov3: An incremental improvement. (2018). arXiv preprint arXiv:02767
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y. M.: Yolov4: optimal speed and accuracy of object detection. (2020). arXiv preprint arXiv:10934
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A. C.: Ssd: single shot multibox detector. In: European Conference on Computer Vision, pp. 21–37 (2016)
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755 (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9), 1904–1916 (2015)
Article Google Scholar
Liu, S., Qi, L., Qin, H., Shi, J., Jia, J.: Path aggregation network for instance segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8759–8768 (2018)
Misra, D.: Mish: a self regularized non-monotonic neural activation function. (2019). https://doi.org/10.48550/arXiv.1908.08681. arXiv:1605.06409
Xu, J., Li, Z., Du, B.: Reluplex made more practical: leaky ReLU. In: 2020 IEEE Symposium on Computers and Communications (ISCC), pp. 1–7 (2020)
Ghiasi, G., Lin, T.-Y., Le, Q. V.: Dropblock: a regularization method for convolutional networks. (2018). https://doi.org/10.48550/arXiv.1810.12890. arXiv:1810.12890
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Li, J., Liang, X., Shen, S., Xu, T., Feng, J., Yan, S.: Scale-aware fast R-CNN for pedestrian detection. IEEE Trans. Multimed. 20(4), 985–996 (2017)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)
Article Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
Yu, J., Jiang, Y., Wang, Z., Cao, Z., Huang, T.: Unitbox: an advanced object detection network. In: Proceedings of the 24th ACM International Conference on Multimedia, pp. 516–520 (2016)
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-IoU loss: faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 12993–13000 (2020)

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China (Grant nos. 61971078, 61501070) and Chongqing Municipal Education Commission (Grant no. CYS21478).

Author information

Yuewei Li, Yu Xiu and Qingling Xia contributed equally to this work.

Authors and Affiliations

School of Artificial Intelligence, Chongqing University of Technology, Chongqing, 401135, China
Hanguang Xiao, Yuewei Li & Qingling Xia
School of Information and Computer, Anhui Polytechnic University, Anhui, 241000, China
Yu Xiu

Authors

Hanguang Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yuewei Li
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xiu
View author publications
You can also search for this author in PubMed Google Scholar
Qingling Xia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hanguang Xiao.

Additional information

Communicated by I. Bartolini.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xiao, H., Li, Y., Xiu, Y. et al. Development of outdoor swimmers detection system with small object detection method based on deep learning. Multimedia Systems 29, 323–332 (2023). https://doi.org/10.1007/s00530-022-00995-7

Download citation

Received: 15 October 2021
Accepted: 19 August 2022
Published: 15 September 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s00530-022-00995-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Development of outdoor swimmers detection system with small object detection method based on deep learning

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

YOLO-based Object Detection Models: A Review and its Applications

A review of object detection based on deep learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Development of outdoor swimmers detection system with small object detection method based on deep learning

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

YOLO-based Object Detection Models: A Review and its Applications

A review of object detection based on deep learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation