skip to main content
10.1145/3573942.3574027acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

Lightweight Improved Based on YOLOv4 Object Detection Algorithm

Published: 16 May 2023 Publication History

Abstract

To address the problem that the existing object detection network models are large in size and complex in operation and cannot satisfy both detection speed and accuracy under the limited resources and small size platform. Based on YOLOv4 as the benchmark network, a lightweight object detection model LW-YOLO is proposed. Firstly, the backbone feature extraction network is replaced with MobileNetv1, while the number of feature fusion network parameters is significantly reduced by the depth separable convolutional module. Then the BN layer coefficients are used as scaling factors for the importance of the convolutional channels, the scaling factors are sparse using polarization regularization, the errors before and after pruning are reconstructed using least squares and channel weighting methods. The appropriate pruning thresholds are obtained by minimizing the reconstructed errors, the channels with small scaling factor values are eliminated to achieve the lightweight. The experimental results on the VOC (Visual Object Classes) dataset show that the detection accuracy of LW-YOLO is 87.00%, and the FPS(Frames Per Second ) reaches 48.89, which is better than the original YOLOv4 algorithm. It also significantly reduces the number of parameters, computation, and model size, which is more suitable for application in resource-poor embedded mobile devices.

References

[1]
Gupta S, Girshick R, Arbeláez, Pablo Learning rich features from RGB-D images for object detection and segmentation[C]//European conference on computer vision. Springer, Cham, 2014: 345-360.
[2]
Girshick R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision. 2015: 1440-1448.
[3]
Ren S, He K, Girshick R, Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 39(6): 1137-1149.
[4]
Zhang YP, Wu JW, Ma ZG, Compression and implementation of neural network models based on YOLOv3[J]. Micro- and nanoelectronics and smart manufacturing, 2020, 2(01): 79-84.
[5]
Huang J, Rathod V, Sun C, Speed/accuracy trade-offs for modern convolutional object detectors[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, 2017: 7310-7311.
[6]
Redmon J, Divvala S, Girshick R, You only look once: unified, real- time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2016: 779-788.
[7]
Zhang ZM, Huo H, Zhao FY. A review of object detection algorithms for deep convolutional neural networks [J]. Small Microcomputer Systems, 2019, 40(9): 1825-1831.
[8]
Li ZM, Peng C, Yu G, Light-Head R-CNN: In defense of two-stage object detector [J]. ArXiv: 1711.07264, 2017.
[9]
Pedoeem J, Huang R. YOLO-LITE:A real-time object detection algorithm optimized for Non-GPU computers [J]. ArXiv:1811.05588, 2018.
[10]
Shao WP, Wang X, Cao ZR, Lightweight convolutional neural network design based on MobileNet and YOLOv3 [J]. Computer Applications, 2020, 40(S1): 8-13.
[11]
He K, Zhang X, Ren S, Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904-1916.
[12]
Wang W, Xie E, Song X, Efficient and accurate arbitrary-shaped text detection with pixel aggregation network[C]//ICCV 2019: Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE, 2019: 8440-8449.
[13]
Howard A G, Zhu M, Chen B, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications[J]. 2017.
[14]
Chollet F. Xception: Deep Learning with Depthwise Separable Convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition, New York: IEEE Press, 2017: 1800-1807.
[15]
Zhuang T, Zhang Z, Huang Y, Neuron-level Structured Pruning using Polarization Regularizer[C]// NeurIPS 2020. 2020.
[16]
Everingham M, Gool L V, Williams C, The Pascal Visual Object Classes (VOC) Challenge[J]. International Journal of Computer Vision, 2010, 88(2):303-338.

Index Terms

  1. Lightweight Improved Based on YOLOv4 Object Detection Algorithm

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition
    September 2022
    1221 pages
    ISBN:9781450396899
    DOI:10.1145/3573942
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 May 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. MobileNet
    2. Multi-threshold channel pruning
    3. Polarization regularization
    4. YOLOv4

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    AIPR 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 33
      Total Downloads
    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media