research-article

Lightweight Improved Based on YOLOv4 Object Detection Algorithm

Authors:

Yuehang LiAuthors Info & Claims

AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

Pages 311 - 318

https://doi.org/10.1145/3573942.3574027

Published: 16 May 2023 Publication History

Abstract

To address the problem that the existing object detection network models are large in size and complex in operation and cannot satisfy both detection speed and accuracy under the limited resources and small size platform. Based on YOLOv4 as the benchmark network, a lightweight object detection model LW-YOLO is proposed. Firstly, the backbone feature extraction network is replaced with MobileNetv1, while the number of feature fusion network parameters is significantly reduced by the depth separable convolutional module. Then the BN layer coefficients are used as scaling factors for the importance of the convolutional channels, the scaling factors are sparse using polarization regularization, the errors before and after pruning are reconstructed using least squares and channel weighting methods. The appropriate pruning thresholds are obtained by minimizing the reconstructed errors, the channels with small scaling factor values are eliminated to achieve the lightweight. The experimental results on the VOC (Visual Object Classes) dataset show that the detection accuracy of LW-YOLO is 87.00%, and the FPS(Frames Per Second ) reaches 48.89, which is better than the original YOLOv4 algorithm. It also significantly reduces the number of parameters, computation, and model size, which is more suitable for application in resource-poor embedded mobile devices.

References

[1]

Gupta S, Girshick R, Arbeláez, Pablo Learning rich features from RGB-D images for object detection and segmentation[C]//European conference on computer vision. Springer, Cham, 2014: 345-360.

[2]

Girshick R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision. 2015: 1440-1448.

[3]

Ren S, He K, Girshick R, Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 39(6): 1137-1149.

Digital Library

[4]

Zhang YP, Wu JW, Ma ZG, Compression and implementation of neural network models based on YOLOv3[J]. Micro- and nanoelectronics and smart manufacturing, 2020, 2(01): 79-84.

[5]

Huang J, Rathod V, Sun C, Speed/accuracy trade-offs for modern convolutional object detectors[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, 2017: 7310-7311.

[6]

Redmon J, Divvala S, Girshick R, You only look once: unified, real- time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2016: 779-788.

[7]

Zhang ZM, Huo H, Zhao FY. A review of object detection algorithms for deep convolutional neural networks [J]. Small Microcomputer Systems, 2019, 40(9): 1825-1831.

[8]

Li ZM, Peng C, Yu G, Light-Head R-CNN: In defense of two-stage object detector [J]. ArXiv: 1711.07264, 2017.

[9]

Pedoeem J, Huang R. YOLO-LITE:A real-time object detection algorithm optimized for Non-GPU computers [J]. ArXiv:1811.05588, 2018.

[10]

Shao WP, Wang X, Cao ZR, Lightweight convolutional neural network design based on MobileNet and YOLOv3 [J]. Computer Applications, 2020, 40(S1): 8-13.

[11]

He K, Zhang X, Ren S, Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904-1916.

Digital Library

[12]

Wang W, Xie E, Song X, Efficient and accurate arbitrary-shaped text detection with pixel aggregation network[C]//ICCV 2019: Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2019: 8440-8449.

[13]

Howard A G, Zhu M, Chen B, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications[J]. 2017.

[14]

Chollet F. Xception: Deep Learning with Depthwise Separable Convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2017: 1800-1807.

[15]

Zhuang T, Zhang Z, Huang Y, Neuron-level Structured Pruning using Polarization Regularizer[C]// NeurIPS 2020. 2020.

[16]

Everingham M, Gool L V, Williams C, The Pascal Visual Object Classes (VOC) Challenge[J]. International Journal of Computer Vision, 2010, 88(2):303-338.

Digital Library

Index Terms

Lightweight Improved Based on YOLOv4 Object Detection Algorithm
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object detection

Recommendations

Object detection method based on lightweight YOLOv4 and attention mechanism in security scenes
Abstract
Object detection methods based on deep learning generally suffer from problems such as large size and complex structure, which lead to poor performance of mobile robots in security scenes. It must create a trade-off between speed and accuracy. To ...
SlimYOLOv4: lightweight object detector based on YOLOv4
Abstract
Object detection is a valuable but challenging technology in computer vision research. Although existing methods could attain satisfactory results on high-performance computers, but the huge number of network parameters brings great operating ...
Lightweight target detection algorithm based on YOLOv4
Abstract
Aiming at the problem that the model parameters of YOLOv4 algorithm are large and difficult to deploy in edge computing devices, a lightweight target detection algorithm (Light-YOLOv4) is proposed based on YOLOv4 algorithm. The algorithm uses the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

September 2022

1221 pages

ISBN:9781450396899

DOI:10.1145/3573942

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 May 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

AIPR 2022

AIPR 2022: 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

September 23 - 25, 2022

Xiamen, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
33
Total Downloads

Downloads (Last 12 months)7
Downloads (Last 6 weeks)2

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten