skip to main content
10.1145/3511176.3511191acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvipConference Proceedingsconference-collections
research-article

Pedestrian Detection Algorithm Based on ViBe and YOLO

Authors Info & Claims
Published:12 March 2022Publication History

ABSTRACT

As more and more monitoring devices are deployed in various cities around the world, the technology of intelligent analysis and processing of video image data based on the computer is becoming more and more mature. This paper adopts an algorithm based on the combination of traditional ViBe and YOLO algorithm to realize the pedestrian detection of internal personnel in the surveillance video. Firstly, ViBe algorithm is used to detect pedestrians once, and some pedestrian frames are selected. Then the pedestrian frames are sent to YOLO network for secondary detection. The second pedestrian detection based on deep learning uses K-means algorithm to complete the clustering of prior frames, and then uses the CSPDarkNet53 network to extract pedestrian features. In order to improve the ability of YOLO small target detection, SPP-Net structure is added to the YOLO model to improve the accuracy of small target detection. The self-built pedestrian dataset used to train and test on the constructed network. The experimental results show that the detection algorithm based on the combination of ViBe and YOLO optimizes the regression of pedestrian boundary frame improves the positioning accuracy of pedestrians.

References

  1. Ren Shaoqing Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(6): 1137-1149.Google ScholarGoogle Scholar
  2. Qike Shao, Lu Li, Yu Zhou, Shihang Yan. Pedestrain detection in videos based on optimization algorithm using sliding window[J].Journal of Zhejiang University of Technology,2015,43(02):212-216..Google ScholarGoogle Scholar
  3. David G. Lowe. Distinctive Image Features from Scale-Invariant Keypoints[J]. International Journal of Computer Vision, 2004, 60(2): 91-110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Yangyang YE,Chi ZHANG,Xiaoli HAO.ARPNET: attention region proposal network for 3D object detection[J].Science China(Information Sciences),2019,62(12):44-52.Google ScholarGoogle Scholar
  5. Haar-features training parameters analysis in boosting based machine learning for improved face detection[J]. International Journal of Advanced Technology and Engineering Exploration (IJATEE), 2021, 8(80).Google ScholarGoogle Scholar
  6. Yucong Song. Traffic sign recognition based on HOG feature extraction[J]. Journal of Measurements in Engineering, 2021, 9(3): 142-155.Google ScholarGoogle ScholarCross RefCross Ref
  7. Yifei Geng and Geng Yifei and Lu Xiaobo. Vehicle and Driver Detection on Highway Based on Cascade R-CNN[J]. Journal of Physics: Conference Series, 2020, 1575(1) : 012017-.Google ScholarGoogle ScholarCross RefCross Ref
  8. Le Zhang and Jinsong Wang and Zhiyong An. Vehicle recognition algorithm based on Haar-like features and improved Adaboost classifier[J]. Journal of Ambient Intelligence and Humanized Computing, 2021, : 1-9.Google ScholarGoogle ScholarCross RefCross Ref
  9. Jiandong Zhao Detection of crowdedness in bus compartments based on ResNet algorithm and video images[J]. Multimedia Tools and Applications, 2021, : 1-28.Google ScholarGoogle Scholar
  10. Okwuashi Onuwa Deep support vector machine for PolSAR image classification[J]. International Journal of Remote Sensing, 2021, 42(17) : 6498-6536.Google ScholarGoogle ScholarCross RefCross Ref
  11. Multi-scale Pedestrian Detection in Thermal Imaging Using Deep Convolutional Neural Network and Adaptive NMS[J]. Journal of Korean Institute of Information Technology, 2018, 16(9) : 85-94.Google ScholarGoogle ScholarCross RefCross Ref
  12. Hosang J , Benenson R , Schiele B . Learning non-maximum suppression[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, 2017.Google ScholarGoogle Scholar
  13. Qiaokang Liang Automatic Basketball Detection in Sport Video Based on R-FCN and Soft-NMS[C]. , 2019.Google ScholarGoogle Scholar
  14. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. Proc. Advances in Neural Information Processing Systems, 2015.Google ScholarGoogle Scholar
  15. Fan Yang, Wongun Choi, and Yuanqing Lin. Exploit all the layers: Fast and accurate cnn object detector with scale dependent pooling and cascaded rejection classifiers. Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  16. Tianrui Liu, Mohamed Elmikaty , and Tania Stathaki. Sam-rcnn: Scale-aware multi-resolution multi-channel pedestrian detection. Proc. British Machine Vision Conference, 2018.Google ScholarGoogle Scholar
  17. Mengmeng Xu, Yancheng Bai, Sally Sisi Qu, and Bernard Ghanem. Semantic part rcnn for real-world pedestrian detection. Proc. IEEE Conference on Computer Vision and Pattern Recognition Workshop, 2019.Google ScholarGoogle Scholar
  18. Cao Jiale From Handcrafted to Deep Features for Pedestrian Detection: A Survey.[J]. IEEE transactions on pattern analysis and machine intelligence, 2021, PP.Google ScholarGoogle Scholar
  19. Chen Hui and Sun Shuai. U-YOLO: higher precision YOLOv4[C]. , 2021.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICVIP '21: Proceedings of the 2021 5th International Conference on Video and Image Processing
    December 2021
    219 pages
    ISBN:9781450385893
    DOI:10.1145/3511176

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 12 March 2022

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)19
    • Downloads (Last 6 weeks)2

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format