Abstract
Person re-identification, having attracted much attention in the multimedia community, is still challenged by the accuracy and the robustness, as the images for the verification contain such variations as light, pose, noise and ambiguity etc. Such practical challenges require relatively robust and accurate feature learning technologies. We introduced a novel deep neural network with PF-BP(Particle Filter-Back Propagation) to achieve relatively global and robust performances of person re-identification. The local optima in the deep networks themselves are still the main difficulty in the learning, in despite of several advanced approaches. A novel neural network learning, or PF-BP, was first proposed to solve the local optima problem in the non-convex objective function of the deep networks. When considering final deep network to learn using BP, the overall neural network with the particle filter will behave as the PF-BP neural network. Also, a max-min value searching was proposed by considering two assumptions about shapes of the non-convex objective function to learn on. Finally, a salience learning based on the deep neural network with PF-BP was proposed to achieve an advanced person re-identification. We test our neural network learning with particle filter aimed to the non-convex optimization problem, and then evaluate the performances of the proposed system in a person re-identification scenario. Experimental results demonstrate that the corresponding performances of the proposed deep network have promising discriminative capability in comparison with other ones.
Similar content being viewed by others
References
Bengio Y (2009) Learning deep architectures for AI. In: Foundations and Trends in Machine Learning, vol 2, pp 1–127
Bottou L, LeCun Y (2004) Large scale online learning. Proceedings NIPS, 2004
Choe G et al (2016) An advanced association of particle filtering and kernel based object tracking. Multimedia Tools and Applications, 74(18)
Choe G et al (2016) Combined Salience based Person Re-identification. Multimedia Tools and Applications, 75(18)
Cui J et al (2013) Tracking generic human motion via fusion of low-and high-dimensional approaches. IEEE Trans. on Systems, Man and Cybernetics 43(4)
Dahl G et al (2010) Phone recognition with the mean-covariance restricted Boltzmann machine. Proc NIPS 23:469–477
Dahl G et al (2011) Context-dependent DBN-HMMs in large vocabulary continuous speech recognition. Proceedings ICASSP
Dahl G et al (2012) Context-dependent, pre-trained deep neural networks for large voca-bulary speech recognition. IEEE Trans Audio Speech Language Proc 20(1):30–42
Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recogn 18:2993–3003
Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2010) Person re-identification by symmetry-driven accumulation of local features. In: CVPR. IEEE, pp 2360-2367
Gao Y et al (2017) Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. IEEE Trans. on Image Processing 26(5)
Gray D, Tao H (2008) Viewpoint in variant pedestrian recognition with an ensemble of localized features. In: ECCV. Springer, pp 262-275
Hinton G et al (2012) Deep Neural Networks for Acoustic Modeling in Speech Recogni-tion. IEEE Signal Proc Mag 29(6):82–97
Krizhevsky A, Sutskever I, Hinton G (2012) Imagenet classification with deep convolutional neural networks. In: NIPS, pp 1097-1105
Layne R, Hospedales T, Gong S (2012) Towards person identification and re-identification with attributes. In: ECCV. Springer, pp 402-412
Li W, Zhao R, Wang X (2013) Human re-identification with transferred metric learning. In: ACCV. Springer, pp 31-44
Li W, Zhao R, Xiao T, Wang X (2014) Deep reid:Deep filter pairing neural network for person re-identification. In: CVPR, pp 152-159
Li Y et al (2018) Large-scale remote sensing image retrieval by deep hashing neural networks. IEEE Trans. on Geoscience and Remote Sensing 56(2)
Li Z, Chang S, Liang F, Huang T, Cao L, Smith J (2013) Learning locally-adaptive decision functions for person verification. In: CVPR. IEEE, pp 3610-3617
Li Z et al (2015) Weakly supervised deep metric learning for community-contributed image retrieval. IEEE Trans. on Image Processing 17(11)
Li Z et al (2017) Weakly supervised deep matrix factorization for social image understanding. IEEE Trans. on Image Processing 26(1)
Lin L, Liu X, Zhu S (2010) Layered graph matching with composite cluster sampling. IEEE Trans Pattern Anal Mach Intell 32(8):1426–1442
Lin L, Wu T, Porway J, Xu Z (2009) A stochastic graph grammar for compositional object representation and recognition. Pattern Recogn 42(7):1297–1307
Lin L, Zhang R, Duan X (2015) Adaptive scene category discovery with generative learning and compositional sampling. IEEE Trans Circuits Syst Video Technol 25(2):251–260
Liu H, Ma B, Qin L, Pang J, Zhang C, Huang Q (2015) Set-label modeling anddeep metric learning on person re-identification. Neurocomputing 151:1283–1292
Liu L et al (2016) Recognizing complex activities by a probabilistic interval-based model. National Conference on Artificial Intelligence(AAAI)
Liu W et al (2017) Multiview dimension reduction via Hessian multiset canonical correlations. Information Fusion. https://doi.org/10.1016/j.inffus.2017.09.001
Liu Y et al (2012) Fusion of low-and high-dimensional approaches by trackers sampling for generic human motion tracking. ICPR pp 898–901
Liu Y et al (2016) From action to activity: sensor-baesd activity recognition. Neurocomputing 181(12)
Ma J et al (2015) Non-rigid visible and infrared face registration via regularized Gaussian fields criterion. Pattern Recogn 48(3)
Martens J (2010) Deep learning with Hessian-free optimization. Proceedings ICML
Mohamed A et al (2010) Investigation of full-sequence training of deep belief networks for speech recognition. Proc. Interspeech
Mohamed A et al (2012) Acoustic Modeling Using Deep Belief Networks. IEEE Trans. Audio, Speech, Language Proc. 20(1)
Ranzato M, Boureau Y, LeCun Y (2007) Sparse Feature Learning for Deep Belief Networks, Proceedings NIPS
Rifai S et al (2011) Contractive autoencoders: Explicit invariance during feature extrac-tion. Proceedings ICML, pp 833–840
Seide F et al (2011) Feature engineering in context-dependent deep neural networks for conversational speech transcription. Proceedings ASRU, pp 24–29
Seide F, Li G, Yu D (2011) Conversational Speech Transcription Using Context-Dependent Deep Neural Networks. Interspeech, pp 437–440
Vincent P et al (2010) Stacked denoising autoencoders: Leaning useful representations in a deep network with a local denoising criterion. J Machine Learning Research 11:3371–3408
Vinyals O, Povey D (2012) Krylov Subspace Descent for Deep Learning. Proceedings AISTAT
Wang J, Leung T, Rosenberg C, Wang J, Philbin J, Chen B, Wu Y et al (2014) Learning fine-grained image similarity with deep ranking. In: CVPR
Wang X, Doretto G, Sebastian T, Rittscher J, Tu P (2007) Shape and appearance context modeling. In: ICCV. IEEE, pp 1-8
Wang Z et al (2016) Zero-shot person re-identification via cross-view consistency. IEEE Trans. on Multimedia 18(2)
Wang Z et al (2017) Statistical inference of Gaussian-Laplace distribution for person verification. ACM Multimedia(MM) pp 1607–1617
Yang X et al (2017) Canonical correlation analysis networks for two-view image recognition. Inf Sci 385-386:338–352
Yu D, Deng L, Li G, Seide F (2011) Discriminative pre-training of deep neural networks U.S. Patent Filing
Zhao R, Ouyang W, Wang X (2013) Unsupervised salience learning for person re-identification. In: CVPR. pp 4321–4328
Zhao Z et al (2017) Remarkable local resampling based on particle filter for visual tracking. Multimedia Tools and Applications, 76(1)
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Choe, G., Choe, C., Wang, T. et al. Deep learning with particle filter for person re-identification. Multimed Tools Appl 78, 6607–6636 (2019). https://doi.org/10.1007/s11042-018-6415-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6415-5