research-article

People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network

Authors:
Feng Min

Hubei Province Key Laboratory of Intelligent Robot Wuhan, Institute of Technology, CHN

Hubei Province Key Laboratory of Intelligent Robot Wuhan, Institute of Technology, CHN
View Profile

,
Yansong Wang

Hubei Province Key Laboratory of Intelligent Robot Wuhan, Institute of Technology, CHN

Hubei Province Key Laboratory of Intelligent Robot Wuhan, Institute of Technology, CHN
View Profile

,
Sicheng Zhu

Hubei Province Key Laboratory of Intelligent Robot Wuhan, Institute of Technology, CHN

Hubei Province Key Laboratory of Intelligent Robot Wuhan, Institute of Technology, CHN
View Profile

AIPR '20: Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern RecognitionJune 2020Pages 79–83https://doi.org/10.1145/3430199.3430201

Published:21 December 2020Publication History

AIPR '20: Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition

Pages 79–83

ABSTRACT

People counting based on surveillance camera is the basis of the important tasks, such as the analysis of crowd behavior, the optimal allocation of resources and public security. Aiming at the low accuracy of the people counting method based on object detection, a people counting method based on multi-scale region adaptive segmentation and deep neural network is proposed in this paper. The idea originates from the analysis and research of multi-scale objects, and it is found that the detection accuracy will be improved if the multi-scale objects match the size of multi-scale anchors. In this method, K-means is used to cluster the detection results of Faster-RCNN model. Then the image is segmented adaptively according to the clustered results. Finally, Faster-RCNN model is used to detect the segmented images. The experimental results show that the average accuracy of this method is 45.78% on mall dataset, which is higher than Faster-RCNN about 3.59%.

References

Subburaman V B, Descamps A, Carincotte C. Counting People in the Crowd Using a Generic Head Detector[C]//2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance. IEEE, 2012.Google Scholar
Vu T H, Osokin A, Laptev I. Context-aware CNNs for person head detection[J]. 2015.Google ScholarDigital Library
Zhang Y, Zhou D, Chen S, et al. Single-Image Crowd Counting via Multi-Column Convolutional Neural Network[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2016.Google Scholar
He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904--1916.Google ScholarDigital Library
Uijlings J R R, Van De Sande K E A, Gevers T, et al. Selective search for object recognition[J]. International journal of computer vision, 2013, 104(2): 154--171.Google Scholar
Zitnick C L, Dollár P. Edge boxes: Locating object proposals from edges[C]//European conference on computer vision. Springer, Cham, 2014: 391--405.Google Scholar
Lu Y, Javidi T, Lazebnik S. Adaptive object detection using adjacency and zoom prediction[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2351--2359.Google Scholar
Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779--788.Google Scholar
Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//European conference on computer vision. Springer, Cham, 2016: 21--37.Google Scholar
Xia F, Wang P, Chen L C, et al. Zoom better to see clearer: Human and object parsing with hierarchical auto-zoom net[C]//European Conference on Computer Vision. Springer, Cham, 2016: 648--663.Google Scholar
GUO Wen-sheng, BAO Ling, et al. People counting method based on adaptive overlapping segmentation and deep neural network, v.45(8):236--242.Google Scholar
Vora, Aditya, Chilaka, Vinay. FCHD: Fast and accurate head detection in crowded scenes[J].Google Scholar
Ren S, He K, Girshick R, et al. Faster r-cnn: Towards realtime object detection with region proposal networks[C]//Advances in neural information processing systems. 2015: 91--99.Google Scholar
Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440--1448.Google Scholar
Wagstaff K, Cardie C, Rogers S, et al. Constrained k-means clustering with background knowledge[C]//Icml. 2001, 1: 577--584.Google Scholar

Index Terms

People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network
1. Computing methodologies
  1. Machine learning
    1. Machine learning algorithms

Recommendations

Deep People Counting with Faster R-CNN and Correlation Tracking
ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service

Crowd counting is a key problem for many computer vision tasks while most existing methods try to count people based on regression with hand-crafted features. Recently, the fast development of deep learning has resulted in many promising detectors of ...
Read More
Optic Disc and Fovea Detection Using Multi-Stage Region-Based Convolutional Neural Network
ISICDM 2018: Proceedings of the 2nd International Symposium on Image Computing and Digital Medicine

Detection of the optic disc (OD) and fovea in retinal images is an important step for automated detection of retinal disease in digital color photographs of the retina. Together with the vasculature, the optic disc and the fovea are the most important ...
Read More
A robust multi-scale deep learning approach for unconstrained hand detection aided by skin segmentation
Abstract
Robust detection of hands in images at different scales, especially, small-sized hands, has remained a challenge in computer vision. In this work, we design a multi-scale deep learning algorithm to detect hands in unconstrained scenarios as well ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

AIPR '20: Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition
June 2020
250 pages
ISBN:9781450375511
DOI:10.1145/3430199

Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 December 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Faster-RCNN
K-means algorithm
Multi-scale
Object detection
People counting
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 75
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network

AIPR '20: Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition

ABSTRACT

References

Cited By

Index Terms

Recommendations

Deep People Counting with Faster R-CNN and Correlation Tracking

Optic Disc and Fovea Detection Using Multi-Stage Region-Based Convolutional Neural Network

A robust multi-scale deep learning approach for unconstrained hand detection aided by skin segmentation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network

AIPR '20: Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition

ABSTRACT

References

Cited By

Index Terms

Recommendations

Deep People Counting with Faster R-CNN and Correlation Tracking

Optic Disc and Fovea Detection Using Multi-Stage Region-Based Convolutional Neural Network

A robust multi-scale deep learning approach for unconstrained hand detection aided by skin segmentation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media