skip to main content
10.1145/3386164.3389086acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiscsicConference Proceedingsconference-collections
research-article

Real-time Head Detection for Automated Passenger Counting in Embedded Systems

Authors Info & Claims
Published:06 June 2020Publication History

ABSTRACT

Head detection is a key problem for automated passenger counting systems. In recent decades, considerable effort has been expended to develop an accurate and reliable head detector. However, head detection is still a challenging task because of problems caused by variations in pose and occlusions. Recently, general object detection algorithms based on convolutional neural networks (CNNs), such as Faster R-CNN, SSD and YOLO, have been successful. However, these algorithms require the use of a Graphics Processing Unit (GPU) for real-time performance. In this study, we focused on developing real-time head detection in an embedded system. Starting with the Tiny-YOLOv3 network, we applied the following strategies to achieve real-time performance in a non-GPU environment. First, we reduced the input image size to 224x224. Second, we added an extra yolo layer to detect smaller heads. Third, we removed batch normalization. Finally, we conducted depthwise separable convolution rather than traditional convolution. Three public datasets, HollywoodHeads, SCUT_HEAD, and CrowdHuman, were exploited to train and test the proposed network, and Average Precision (AP) at Intersection over Unit (IoU) = 0.5 were used to evaluate the tests. Experimental results showed that the proposed network perform better and faster than Tiny-YOLOv3.

References

  1. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861.Google ScholarGoogle Scholar
  2. D. G. Lowe (1999). Object recognition from local scale-invariant features. IEEE International Conference on Computer vision, 1150--1157.Google ScholarGoogle ScholarCross RefCross Ref
  3. D. Peng, Z. Sun, Z. Chen, Z. Cai, L. Xie, and L. Jin (2018). Detecting Heads using Feature Refine Net and Cascaded Multi-scale Architecture. IEEE International Conference on Pattern Recognition. 2528--2533.Conference Name:ACM Woodstock conference Conference Short Name:WOODSTOCK'18Google ScholarGoogle ScholarCross RefCross Ref
  4. G. Chen, X. Cai, H. Han, S. Shan, and X. Chen (2018). HeadNet: pedestrian head detection utilizing body in context. IEEE International Conference on Automatic Face & Gesture Recognition, 556--563.Google ScholarGoogle ScholarCross RefCross Ref
  5. J. Redmon and A. Farhadi (2017). YOLO9000: better, faster, stronger. Proceedings of the IEEE conference on computer vision and pattern recognition, 7263--7271.Google ScholarGoogle ScholarCross RefCross Ref
  6. J. Redmon and A. Farhadi (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.Google ScholarGoogle Scholar
  7. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi (2016). You only look once: unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, 779--788.Google ScholarGoogle ScholarCross RefCross Ref
  8. K. He, X. Zhang, S. Ren, and J. Sun (2016). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  9. M. Saqib, S. D. Khan, N. Sharma, and M. Blumenstein (2018). Person head detection in multiple scales using deep convolutional neural networks. International Joint Conference on Neural Networks, 1--7.Google ScholarGoogle ScholarCross RefCross Ref
  10. N. Dalal and B. Triggs (2005). Histograms of oriented gradients for human detection. IEEE Conference on Computer Vision and Pattern Recognition, 886--893.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Girshick (2015). Fast R-CNN. IEEE International Conference on Computer Vision, 1440--1448.Google ScholarGoogle Scholar
  12. R. Girshick, J. Donahue, T. Darrell, and J. Malik (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 580--587.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Huang, J. Pedoeem, and C. Chen (2018). YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers. IEEE International Conference on Big Data (Big Data), 2503--2510.Google ScholarGoogle Scholar
  14. S. Ren, K. He, R. Girshick, and J. Sun (2015). Faster R-CNN:Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 91--99.Google ScholarGoogle Scholar
  15. S. Shao, Z. Zhao, B. Li, T. Xiao, G. Yu, X. Zhang, and J. Sun (2018). CrowdHuman: a benchmark for detecting human in a crowd. arXiv preprint arXiv:1805.00123.Google ScholarGoogle Scholar
  16. T. Ahonen, A. Hadid, and M. Pietikainen (2006). Face description with local binary patterns: Application to face recognition. IEEE transactions on pattern analysis and machine intelligence, 28(12), 2037--2041.Google ScholarGoogle Scholar
  17. T. H. Vu, A. Osokin, and I. Laptev (2015). Context-aware CNNs for person head detection. IEEE International Conference on Computer Vision. 2893--2901.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg (2016). SSD: Single shot multibox detector. European conference on computer vision, 21--37.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Real-time Head Detection for Automated Passenger Counting in Embedded Systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ISCSIC 2019: Proceedings of the 2019 3rd International Symposium on Computer Science and Intelligent Control
      September 2019
      397 pages
      ISBN:9781450376617
      DOI:10.1145/3386164

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 6 June 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      ISCSIC 2019 Paper Acceptance Rate77of152submissions,51%Overall Acceptance Rate192of401submissions,48%
    • Article Metrics

      • Downloads (Last 12 months)11
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader