Mask-based generative adversarial networking for crowd counting

Guoxiu Duan; Aichun Zhu; Lu Zhao; Xiaomei Zhu; Fangqiang Hu; Xinjie Guan

doi:10.1117/1.JEI.30.4.043027

28 August 2021 Mask-based generative adversarial networking for crowd counting

Guoxiu Duan, Aichun Zhu, Lu Zhao, Xiaomei Zhu, Fangqiang Hu, Xinjie Guan

Author Affiliations +

Journal of Electronic Imaging, Vol. 30, Issue 4, 043027 (August 2021). https://doi.org/10.1117/1.JEI.30.4.043027

Abstract

Crowd counting is still a challenging task due to the variability of the distance scale, crowd occlusion, and complex background information. However, the deep convolution neural network has been proved to be effective in solving these problems. By loading input images, the network generates predicted density maps, and the average absolute error between the predicted density maps and given ground truth (GT) maps is a solid standard for evaluating the quality of the network. We propose a mask-based generative adversarial network (MBGAN) structure to generate accurate predicted density maps. The network consists of two parts: the generator and the discriminator. In the generator, we embed a fundamental feature extracting module, multiple level dilated convolution blocks, a predicted mask, and shortcut connection operations. The discriminator is mainly used to distinguish whether the density map comes from the generator or the GT and urges the generator to produce the density map that can confuse itself. The training of the proposed MBGAN model is through the joint action of density loss and adversarial loss. In the training strategy, we use the cross training of the generator and discriminator. Through experiments on five available datasets, the MBGAN achieved state-of-the-art performances that outperform other advanced methods.

Citation Download Citation

Guoxiu Duan, Aichun Zhu, Lu Zhao, Xiaomei Zhu, Fangqiang Hu, and Xinjie Guan "Mask-based generative adversarial networking for crowd counting," Journal of Electronic Imaging 30(4), 043027 (28 August 2021). https://doi.org/10.1117/1.JEI.30.4.043027

Received: 17 March 2021; Accepted: 11 August 2021; Published: 28 August 2021

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
16 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Convolution

Gallium nitride

Head

Performance modeling

Network architectures

Neural networks

Image quality

Show All Keywords

Keywords/Phrases

Search In:

Publication Years