short-paper

Deep People Counting in Extremely Dense Crowds

Authors:

Chuan Wang,

Hua Zhang,

Liang Yang,

Si Liu,

Xiaochun CaoAuthors Info & Claims

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Pages 1299 - 1302

https://doi.org/10.1145/2733373.2806337

Published: 13 October 2015 Publication History

Get Access

Abstract

People counting in extremely dense crowds is an important step for video surveillance and anomaly warning. The problem becomes especially more challenging due to the lack of training samples, severe occlusions, cluttered scenes and variation of perspective. Existing methods either resort to auxiliary human and face detectors or surrogate by estimating the density of crowds. Most of them rely on hand-crafted features, such as SIFT, HOG etc, and thus are prone to fail when density grows or the training sample is scarce. In this paper we propose an end-to-end deep convolutional neural networks (CNN) regression model for counting people of images in extremely dense crowds. Our method has following characteristics. Firstly, it is a deep model built on CNN to automatically learn effective features for counting. Besides, to weaken influence of background like buildings and trees, we purposely enrich the training data with expanded negative samples whose ground truth counting is set as zero. With these negative samples, the robustness can be enhanced. Extensive experimental results show that our method achieves superior performance than the state-of-the-arts in term of the mean and variance of absolute difference.

References

[1]

O. a. Arandjelovic. Crowd detection from still images. In BMVC, pages 53.1--53, 2008.

Crossref

Google Scholar

[2]

P. F. Felzenszwalb, R. B. Girshick, D. A. McAllester, and D. Ramanan. Object detection with discriminatively trained part-based models. TPAMI, 32(9):1627--1645, 2010.

Digital Library

Google Scholar

[3]

W. Ge and R. Collins. Marked point processes for crowd counting. In CVPR 2009, pages 2913--2920, 2009.

Crossref

Google Scholar

[4]

R. B. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR 2014, pages 580--587, 2014.

Digital Library

Google Scholar

[5]

K. Grauman and T. Darrell. The pyramid match kernel: Discriminative classification with sets of image features. In ICCV 2005, pages 1458--1465, 2005.

Digital Library

Google Scholar

[6]

H. Idrees, I. Saleemi, C. Seibert, and M. Shah. Multi-source multi-scale counting in extremely dense crowd images. In CVPR 2013, pages 2547--2554, 2013.

Digital Library

Google Scholar

[7]

K. Kang and X. Wang. Fully convolutional neural networks for crowd segmentation. CoRR, abs/1411.4464, 2014.

Google Scholar

[8]

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS 2012, pages 1097--1105, 2012.

Google Scholar

[9]

S. Liu, X. Liang, L. Liu, X. Shen, J. Yang, C. Xu, L. Lin, X. Cao, and S. Yan. Matching-cnn meets KNN: quasi-parametric human parsing. CoRR, abs/1504.01220, 2015.

Google Scholar

[10]

J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In ICCV, pages 1470--1477, 2003.

Digital Library

Google Scholar

Cited By

View all

Guo MChen BYan ZWang YYe Q(2025)Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd CountingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2024.335036336:2(2958-2972)Online publication date: Feb-2025
https://doi.org/10.1109/TNNLS.2024.3350363
Gao JHuang ZLei YShan HWang JWang FZhang J(2025)Deep Rank-Consistent Pyramid Model for Enhanced Crowd CountingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.333677436:1(299-312)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3336774
Wang SWu WLi YXu YLyu Y(2025)MIANet: Bridging the Gap in Crowd Density Estimation With Thermal and RGB InteractionIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.347829226:1(254-267)Online publication date: Jan-2025
https://doi.org/10.1109/TITS.2024.3478292
Show More Cited By

Index Terms

Deep People Counting in Extremely Dense Crowds
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Computer vision tasks
        Scene understanding

Recommendations

A Reliable People Counting System via Multiple Cameras

Reliable and real-time people counting is crucial in many applications. Most previous works can only count moving people from a single camera, which cannot count still people or can fail badly when there is a crowd (i.e., heavy occlusion occurs). In ...
Deep People Counting with Faster R-CNN and Correlation Tracking
ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service

Crowd counting is a key problem for many computer vision tasks while most existing methods try to count people based on regression with hand-crafted features. Recently, the fast development of deep learning has resulted in many promising detectors of ...
Real-time people counting for indoor scenes

People counting in indoor environment is a challenging task due to the coexistence of moving crowds with stationary crowds, recurrent occlusions and complex background information. The performance of existing crowd counting methods drops significantly ...

Comments

Information & Contributors

Information

Published In

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

October 2015

1402 pages

ISBN:9781450334594

DOI:10.1145/2733373

General Chairs:
Xiaofang Zhou
The University of Queensland, Australia
,
Alan F. Smeaton
Dublin City University, Ireland
,
Qi Tian
The University of Texas at San Antonio, USA
,
Program Chairs:
Dick C.A. Bulterman
FXPAL, USA
,
Heng Tao Shen
The University of Queensland, Australia
,
Ketan Mayer-Patel
The University of North Carolina, USA
,
Shuicheng Yan
National University of Singapore, Singapore

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 October 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

100 Talents Programme of The Chinese Academy of Sciences
National Natural Science Foundation of China
National Training Programs of Innovation and Entrepreneurship for Undergraduates
the Young Scholars by the Tianjin University of Commerce

Conference

MM '15

Sponsor:

SIGMM

MM '15: ACM Multimedia Conference

October 26 - 30, 2015

Brisbane, Australia

Acceptance Rates

MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

289
Total Citations
View Citations
1,796
Total Downloads

Downloads (Last 12 months)99
Downloads (Last 6 weeks)14

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Guo MChen BYan ZWang YYe Q(2025)Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd CountingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2024.335036336:2(2958-2972)Online publication date: Feb-2025
https://doi.org/10.1109/TNNLS.2024.3350363
Gao JHuang ZLei YShan HWang JWang FZhang J(2025)Deep Rank-Consistent Pyramid Model for Enhanced Crowd CountingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.333677436:1(299-312)Online publication date: Jan-2025
https://doi.org/10.1109/TNNLS.2023.3336774
Wang SWu WLi YXu YLyu Y(2025)MIANet: Bridging the Gap in Crowd Density Estimation With Thermal and RGB InteractionIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.347829226:1(254-267)Online publication date: Jan-2025
https://doi.org/10.1109/TITS.2024.3478292
Wang MZhou XChen Y(2025)A comprehensive survey of crowd density estimation and countingIET Image Processing10.1049/ipr2.1332819:1Online publication date: 27-Jan-2025
https://doi.org/10.1049/ipr2.13328
Hassani SMustapha SLi JMousavi MDackermann U(2025)Next-generation coupled structure-human sensing technology: Enhanced pedestrian-bridge interaction analysis using data fusion and machine learningInformation Fusion10.1016/j.inffus.2025.102983118(102983)Online publication date: Jun-2025
https://doi.org/10.1016/j.inffus.2025.102983
Alharbey RBanjar ASaid YAtri MAbid M(2024)A Human Face Detector for Big Data Analysis of Pilgrim Flow Rates in Hajj and UmrahEngineering, Technology & Applied Science Research10.48084/etasr.666814:1(12861-12868)Online publication date: 8-Feb-2024
https://doi.org/10.48084/etasr.6668
Ding GLiu LChen ZChen CCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Domain-Agnostic Crowd Counting via Uncertainty-Guided Style Diversity AugmentationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681310(1642-1651)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3681310
Zhu JZhao WYao LHe YHu MZhang XWang SLi TLu H(2024)Confusion Region Mining for Crowd CountingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.331102035:12(18039-18051)Online publication date: Dec-2024
https://doi.org/10.1109/TNNLS.2023.3311020
Chen ZZhang SZheng XZhao XKong Y(2024)Crowd Counting Based on Multiscale Spatial Guided Perception Aggregation NetworkIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.330434835:12(17465-17478)Online publication date: Dec-2024
https://doi.org/10.1109/TNNLS.2023.3304348
Yi JPang YZhou WZhao MZheng F(2024)A Perspective-Embedded Scale-Selection Network for Crowd Counting in Public TransportationIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.332800025:5(3420-3432)Online publication date: May-2024
https://doi.org/10.1109/TITS.2023.3328000
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

A Reliable People Counting System via Multiple Cameras

Deep People Counting with Faster R-CNN and Correlation Tracking

Real-time people counting for indoor scenes

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations