9 January 2021 Adaptive spatial scale person reidentification
Shengyu Pei, Xinyu Fan, Xiaoping Fan, Yongzhou Li
Author Affiliations +
Abstract

Person reidentification (ReID) requires the discriminative features of an entire pedestrian image to handle the problems of inaccurate person bounding box detection, background confusion, and occlusion. Many recent person ReID methods have attempted to learn the feature information of an entire pedestrian image through parts feature representations, but it is often too time consuming. Person ReID relies on discriminative pedestrian features, and different spatial scales can distinguish features by differing degrees. We propose an innovative and effective adaptive spatial scale person ReID network model based on the residual neural network (ResNet) of an instance batch normalization. Through experimental visualizations, pedestrian features extracted by ResNet from four layers are analyzed, and two layers with discriminative features are selected. Using an adaptive dimension adjustment module, different spatial scale features are aggregated and merged by the aggregation layer. To effectively learn spatial channel correlations and prevent overfitting, a multilayer distribution normalization processing module is designed to implement end-to-end training and evaluate the person ReID networks. Compared with other methods, this network model showed excellent performance on four public person ReID datasets and is superior to most current methods.

© 2021 SPIE and IS&T 1017-9909/2021/$28.00 © 2021 SPIE and IS&T
Shengyu Pei, Xinyu Fan, Xiaoping Fan, and Yongzhou Li "Adaptive spatial scale person reidentification," Journal of Electronic Imaging 30(1), 013001 (9 January 2021). https://doi.org/10.1117/1.JEI.30.1.013001
Received: 12 August 2020; Accepted: 16 December 2020; Published: 9 January 2021
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Performance modeling

Data modeling

Cameras

Feature extraction

Lithium

Fluctuations and noise

RELATED CONTENT

Rotary transformer for image captioning
Proceedings of SPIE (September 09 2022)

Back to Top