skip to main content
10.1145/3573428.3573662acmotherconferencesArticle/Chapter ViewAbstractPublication PageseitceConference Proceedingsconference-collections
research-article

A Method of Mask-Wearing Detection Based on Improved YOLOv4

Published:15 March 2023Publication History

ABSTRACT

In this paper, a lightweight network based on improved YOLOv4 is proposed to solve the problems of complex model structure and unsatisfactory performance of detection speed in fast-moving situations. Firstly, the inverted residual blocks (IRB) based on Mobilenetv2 are adopted into the backbone feature extraction network to reduce the complexity of the model structure. Then, the feature fusion network based on the depth-wise separable convolution is applied to minimize model calculations and parameters. Experimental results show that the proposed model has the advantages of high accuracy, fast detection speed, and lightweight, which has satisfied the requirements of real-time detection of mask-wearing in fast-moving situations.

References

  1. CHENG V C C, WONG S C, CHUANG V W M, The role of community-wide wearing of face mask for control of coronavirus disease 2019 (COVID-19) epidemic due to SARS-CoV-2. Journal of Infection, 2020, 81(1), 107-114.Google ScholarGoogle ScholarCross RefCross Ref
  2. GIRSHICK R, DONAHUE J, DARRELL T, Rich feature hierarchies for accurate object detection and semantic segmentatio. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, 580-587.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. GIRSHICK R. Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, 2015, 1440-1448.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. REN S, HE K, GIRSHICK R, Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6), 1137-1149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. HE K, GKIOXARI G, DOLLÁR P, Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision. 2017, 2961-2969.Google ScholarGoogle Scholar
  6. HE K, ZHANG X, REN S, Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9), 1904-1916.Google ScholarGoogle Scholar
  7. REDMON J, DIVVALA S, GIRSHICK R, You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, 779-788.Google ScholarGoogle ScholarCross RefCross Ref
  8. LIU W, ANGUELOV D, ERHAN D, SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, 2016, 21-37.Google ScholarGoogle ScholarCross RefCross Ref
  9. CHAVDA A, DSOUZA J, BADGUJAR S, Multi-stage CNN architecture for face mask detection. Proceedings of the IEEE International Conference for Convergence in Technology, 2021, 1-8.Google ScholarGoogle Scholar
  10. GUPTA P, SHARMA V, VARMA S. A novel algorithm for mask detection and recognizing actions of human. Expert Systems with Applications, 2022, 198, 1-10.Google ScholarGoogle Scholar
  11. WU P, LI H, ZENG N, FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public. Image and Vision Computing, 2022, 117, 104341.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. NAGRATH P, JAIN R, MADAN A, SSDMNV2: A real-time DNN-based face mask detection system using single shot multibox detector and MobileNetV2 [J]. Sustainable cities and society, 2021, 66, 102692.Google ScholarGoogle Scholar
  13. BOCHKOVSKIV A, WANG C Y, LIAO H Y M. Yolov4: Optimal speed and accuracy of object detection. ArXiv Preprint, arXiv:2004.10934, 2020.Google ScholarGoogle Scholar
  14. SANDLER M, HOWARD A, ZHU M, Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, 4510-4520.Google ScholarGoogle ScholarCross RefCross Ref
  15. HOWARD A G, ZHU M, CHEN B, Mobilenets: Efficient convolutional neural networks for mobile vision applications. ArXiv Preprint, ArXiv:1704.04861, 2017.Google ScholarGoogle Scholar

Index Terms

  1. A Method of Mask-Wearing Detection Based on Improved YOLOv4

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering
      October 2022
      1999 pages
      ISBN:9781450397148
      DOI:10.1145/3573428

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 March 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate508of972submissions,52%
    • Article Metrics

      • Downloads (Last 12 months)12
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format