skip to main content
10.1145/3372278.3390710acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

YOLO-mini-tiger: Amur Tiger Detection

Published: 08 June 2020 Publication History

Abstract

In this paper, we present our solution for tiger detection in the 2019 Computer Vision for Wildlife Conservation Challenge (CVWC2019). We introduce an efficient deep tiger detector, which consists of the convnet channel adaptation method and an improved tiger detection method based on You Only Look Once version 3 (YOLOv3). Considering the limited memory and computing power of tiny embedded devices, we have used EfficientNet-B0 and Darknet-53 as backbone networks for detection and adapted them to balance their depth and width inspired by the channel pruning method and knowledge distillation method. Our results show that after an architecture adjustment of Darknet-53, the floating-point computation decreases by 93%, its model size decreases by 97%, and its accuracy only decreases by 1%; after an architecture adjustment of EfficientNet-B0, the floating-point computation decreases by 66%, its model size decreases by 70% with its accuracy only decreased by 1%. We also compare GIoU loss and MSE loss in the training stage. The GIoU loss has the advantage that it increases the average AP for IoU from 0.5 to 0.95 without affecting training speed and the interface speed, so it is experimentally reasonable for tiger detection in the wild. This proposed method outperforms previous Amur tiger detection methods presented at CVWC2019.

References

[1]
Asner G P, Martin R E, Knapp D E, et al. Airborne laser-guided imaging spectroscopy to map forest trait diversity and guide conservation[J]. Science, 2017, 355(6323): 385--389.
[2]
Li S, Li J, Lin W, et al. Amur Tiger Re-identification in the Wild[J]. arXiv preprint arXiv:1906.05586, 2019.
[3]
Liu Z, Li J, Shen Z, et al. Learning efficient convolutional networks through network slimming[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 2736--2744.
[4]
Ye J, Lu X, Lin Z, et al. Rethinking the smaller-norm-less-informative assumption in channel pruning of convolution layers[J]. arXiv preprint arXiv:1802.00124, 2018.
[5]
Zhang P, Zhong Y, Li X. SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications[C]//Proceedings of the IEEE International Conference on Computer Vision Workshops. 2019: 0-0.
[6]
Hinton G, Vinyals O, Dean J. Distilling the knowledge in a neural network[J]. arXiv preprint arXiv:1503.02531, 2015.
[7]
Fukuda T, Suzuki M, Kurata G, et al. Efficient Knowledge Distillation from an Ensemble of Teachers[C]//Interspeech. 2017: 3697--3701.
[8]
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014.
[9]
He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770--778.
[10]
Huang G, Liu Z, Van Der Maaten L, et al. Densely connected convolutional networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 4700--4708.
[11]
Tan M, Le Q V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks[J]. arXiv preprint arXiv:1905.11946, 2019.
[12]
Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018.
[13]
Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263--7271.
[14]
Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779--788.
[15]
He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904--1916.
[16]
Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7132--7141.
[17]
Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[C]//Advances in neural information processing systems. 2012: 1097--1105.
[18]
Rezatofighi H, Tsoi N, Gwak J Y, et al. Generalized intersection over union: A metric and a loss for bounding box regression[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 658--666.
[19]
Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117--2125.
[20]
Golik P, Doetsch P, Ney H. Cross-entropy vs. squared error training: a theoretical and experimental comparison[C]//Interspeech. 2013, 13: 1756--1760.
[21]
Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//European conference on computer vision. Springer, Cham, 2016: 21--37.
[22]
Howard A G, Zhu M, Chen B, et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications[J]. arXiv preprint arXiv:1704.04861, 2017.
[23]
Sandler M, Howard A, Zhu M, et al. Mobilenetv2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 4510--4520.
[24]
Shen Z, Liu Z, Li J, et al. Dsod: Learning deeply supervised object detectors from scratch[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 1919--1927.
[25]
Schaetti N. Unine at clef 2018: Character-based convolutional neural network and resnet18 for twitter author profiling[C]//Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018). 2018.
[26]
Goyal P, Dollár P, Girshick R, et al. Accurate, large minibatch sgd: Training imagenet in 1 hour[J]. arXiv preprint arXiv:1706.02677, 2017.
[27]
Li Y, Li J, Lin W, et al. Tiny-dsod: Lightweight object detection for resource-restricted usages[J]. arXiv preprint arXiv:1807.11013, 2018.

Cited By

View all
  • (2024)Evaluation of Automated Object-Detection Algorithms for Koala Detection in Infrared Aerial ImagerySensors10.3390/s2421704824:21(7048)Online publication date: 31-Oct-2024
  • (2023) AF‐TigerNet: A lightweight anchor‐free network for real‐time Amur tiger ( Panthera tigris altaica ) detection Wildlife Letters10.1002/wll2.120081:1(32-41)Online publication date: 20-Apr-2023

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval
June 2020
605 pages
ISBN:9781450370875
DOI:10.1145/3372278
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 June 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. YOLO
  2. architecture adjustment
  3. model slimming
  4. tiger detection

Qualifiers

  • Research-article

Funding Sources

  • the National Natural Science Foundation of China
  • the Key Project of the Education Commission of Beijing Municipal
  • the project of the Education Commission of Beijing Municipal

Conference

ICMR '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)1
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Evaluation of Automated Object-Detection Algorithms for Koala Detection in Infrared Aerial ImagerySensors10.3390/s2421704824:21(7048)Online publication date: 31-Oct-2024
  • (2023) AF‐TigerNet: A lightweight anchor‐free network for real‐time Amur tiger ( Panthera tigris altaica ) detection Wildlife Letters10.1002/wll2.120081:1(32-41)Online publication date: 20-Apr-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media