skip to main content
10.1145/3430199.3430201acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network

Authors Info & Claims
Published:21 December 2020Publication History

ABSTRACT

People counting based on surveillance camera is the basis of the important tasks, such as the analysis of crowd behavior, the optimal allocation of resources and public security. Aiming at the low accuracy of the people counting method based on object detection, a people counting method based on multi-scale region adaptive segmentation and deep neural network is proposed in this paper. The idea originates from the analysis and research of multi-scale objects, and it is found that the detection accuracy will be improved if the multi-scale objects match the size of multi-scale anchors. In this method, K-means is used to cluster the detection results of Faster-RCNN model. Then the image is segmented adaptively according to the clustered results. Finally, Faster-RCNN model is used to detect the segmented images. The experimental results show that the average accuracy of this method is 45.78% on mall dataset, which is higher than Faster-RCNN about 3.59%.

References

  1. Subburaman V B, Descamps A, Carincotte C. Counting People in the Crowd Using a Generic Head Detector[C]//2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance. IEEE, 2012.Google ScholarGoogle Scholar
  2. Vu T H, Osokin A, Laptev I. Context-aware CNNs for person head detection[J]. 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Zhang Y, Zhou D, Chen S, et al. Single-Image Crowd Counting via Multi-Column Convolutional Neural Network[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2016.Google ScholarGoogle Scholar
  4. He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904--1916.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Uijlings J R R, Van De Sande K E A, Gevers T, et al. Selective search for object recognition[J]. International journal of computer vision, 2013, 104(2): 154--171.Google ScholarGoogle Scholar
  6. Zitnick C L, Dollár P. Edge boxes: Locating object proposals from edges[C]//European conference on computer vision. Springer, Cham, 2014: 391--405.Google ScholarGoogle Scholar
  7. Lu Y, Javidi T, Lazebnik S. Adaptive object detection using adjacency and zoom prediction[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2351--2359.Google ScholarGoogle Scholar
  8. Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779--788.Google ScholarGoogle Scholar
  9. Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//European conference on computer vision. Springer, Cham, 2016: 21--37.Google ScholarGoogle Scholar
  10. Xia F, Wang P, Chen L C, et al. Zoom better to see clearer: Human and object parsing with hierarchical auto-zoom net[C]//European Conference on Computer Vision. Springer, Cham, 2016: 648--663.Google ScholarGoogle Scholar
  11. GUO Wen-sheng, BAO Ling, et al. People counting method based on adaptive overlapping segmentation and deep neural network, v.45(8):236--242.Google ScholarGoogle Scholar
  12. Vora, Aditya, Chilaka, Vinay. FCHD: Fast and accurate head detection in crowded scenes[J].Google ScholarGoogle Scholar
  13. Ren S, He K, Girshick R, et al. Faster r-cnn: Towards realtime object detection with region proposal networks[C]//Advances in neural information processing systems. 2015: 91--99.Google ScholarGoogle Scholar
  14. Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440--1448.Google ScholarGoogle Scholar
  15. Wagstaff K, Cardie C, Rogers S, et al. Constrained k-means clustering with background knowledge[C]//Icml. 2001, 1: 577--584.Google ScholarGoogle Scholar

Index Terms

  1. People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      AIPR '20: Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition
      June 2020
      250 pages
      ISBN:9781450375511
      DOI:10.1145/3430199

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 December 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader