Abstract
Recognizing objects from disjoint camera views, known as person re-identification, is an important and challenging problem in the field of computer vision. Recent progress in person re-identification is due to new visual features and models that deal with cross-view differences. Existing appearance models focus on visual features in the normal sense, e.g., color histogram, Scale-invariant Feature Transform (SIFT) and Histogram of Oriented Gradients (HOG). In this paper, we propose a new appearance based method using the generative information of local image features and their encoding. In this paradigm, local image features which capture the color and structural cues of the human images are first extracted. A Gaussian Mixture Model (GMM) is then learned to approximate the generation process of these features. It provides a relatively comprehensive statistical representation. Finally, discriminative feature maps are obtained by calculating Free Energy Score Space (FESS) for GMM. The obtained feature maps are concatenated and encoded into a fixed-length feature vector for person re-identification. Our approach demonstrates promising performance on challenging datasets. It is also very practical: it has low computational cost both at training and testing. A GMM trained on images with different imaging conditions can be applied to other images without any significant loss in performance.
Similar content being viewed by others
References
Bak S, Corvee E, Bremond F, Thonnat M (2010) “Person re-identification using spatial covariance regions of human body parts.” in Proc IEEE Int Conf Adv Video Signal-Based Surveill (AVSS): 435–440
Bak S, Corvee E, Bremond F, Thonnat M (2010) “. Person re-identification using Haar-based and DCD-based signature.” in Proc IEEE Int Conf Adv Video Signal-Based Surveill (AVSS):1–8
Bak S, Corvee E, Bremond F, Thonnat M (2012) Boosted human re-identification using riemannian manifolds. Image Vis Comput 30(6):443–452
Bazzani L, Cristani M, Perina A, Murino V (2012) Multiple-shot person re-identification by chromatic and epitomic analyses. Pattern Recogn Lett 33(7):898–903
Cheng D, Cristani M, Stoppa M, Bazzani L, Murino V (2011) “Custom pictorial structures for re-identification.” in Proc Br Mach Vision Conf (BMVC)
Dikmen M, Akbas E, Huang T, Ahuja N (2010) “Pedestrian recognition with a learned metric,” in Proc Asian Conf Comput Vision (ACCV): 501–512
Ess A, Leibe B, Van Gool L (2007) “Depth and appearance for mobile scene analysis.” in Proc IEEE Int Conf Comput Vision (ICCV): 1–8
Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2013) Symmetry-driven accumulation of local features for human characterization and re-identification. Comput Vis Image Underst 117(2):130–144
Gheissari N, Sebastian T, Hartley R (2006) “Person reidentification using spatiotemporal appearance.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR): 1528–1535
Gray D, Tao T (2008) “Viewpoint invariant pedestrian recognition with an ensemble of localized features.” in Proc Eur Conf Comput Vision (ECCV): 262–275
Hirzer M, Beleznai C, Roth P, Bischof H (2011) “Person re-identification by descriptive and discriminative classification,” in Image Anal: 91–102
Holub A, Welling M, Perona P (2008) Hybrid generative-discriminative visual categorization. Int J Comput Vis 77(1):239–258
Jaakkola T, Haussler D (1999) “Exploiting generative models in discriminative classifiers.” Adv Neural Inf Process Syst: 487–493
Jordan M, Ghahramani Z, Jaakkola T, Saul L (1999) An introduction to variational methods for graphical models. Mach Learn 37:183–233
Kostinger M, Hirzer M, Wohlhart P, Roth P, Bischof H (2012) “Large scale metric learning from equivalence constraints,” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR): 2288–2295
Kviatkovsky I, Adam A, Rivlin E (2013) “Color invariants for person reidentification,”. IEEE Trans Pattern Anal Mach Intell (PAMI) 35(7):1622–1634
Li X, Lee T, Liu Y (2011) “Hybrid generative-discriminative classification using posterior divergence.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR): 2713–2720
Li X, Wang B, Liu Y, Lee T (2013) “Learning Discriminative Sufficient Statistics Score Space for Classification.” Mach Learn Knowl Discov Databases: 49–64
Li W, Zhao R, Wang X (2012) “Human reidentification with transferred metric learning.” in Proc Asian Conf Comput Vision (ACCV): 31–44
Ma B, Su Y, Jurie F (2012) “Local descriptors encoded by fisher vectors for person re-identification.” in Proc Eur Conf Comput Vision Workshops Demonstrations: 413–422
Ma B, Su Y, Jurie F (2012) “Bicov: a novel image representation for person re-identification and face verification.” in Proc Br Mach Vision Conf (BMVC)
Mignon A, Jurie F (2012) “PCCA: A new approach for distance learning from sparse pairwise constraints.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR): 2666–2672
Oneata D, Verbeek J, Schmid C (2013) “Action and event recognition with Fisher vectors on a compact feature set.” in Proc IEEE Int Conf Comput Vision (ICCV):1817–1824
Perina A, Cristani M, Castellani U, Murino V, Jojic N (2012) Free energy score spaces: using generative information in discriminative classifiers. IEEE Trans Pattern Anal Mach Intell 34(7):1249–1262
Perronnin F, Dance C (2007) “Fisher kernels on visual vocabularies for image categorization.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR):1–8
Prosser B, Zheng W, Gong S, Xiang T (2010) “Person re-identification by support vector ranking.” in Proc Br Mach Vision Conf (BMVC): 1–11
Schwartz W, Davis L (2009) “Learning discriminative appearance-based models using partial least squares.” in Proc Braz Symp Comput Graph Image Process (SIBGRAPI):322–329
Simonyan K, Parkhi O, Vedaldi A, Zisserman A (2013) “Fisher vector faces in the wild.” in Proc Br Mach Vision Conf (BMVC)
Sivic J, Zisserman A (2003) “Video google: a text retrieval approach to object matching in videos.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR): 1470–1477
Sun C, Nevatia R (2013) “Large-scale web video event classification by use of fisher vectors.” in Proc IEEE Workshop Appl Comput Vision (WACV): 15–22
Vedaldi A, Fullerson B (2010) “VLFeat – An open and portable library of computer vision algorithms.” in Proc ACM Int Conf Multimed
Wang X, Doretto G, Sebastian T, Rittscher J, Tu P (2007) “Shape and appearance context modeling.” in Proc IEEE Int Conf Comput Vision (ICCV):1–8
Wu Y, Minoh M, Mukunoki M, Lao S (2012) “Set based discriminative ranking for recognition.” in Proc Eur Conf Comput Vision (ECCV): 497–510
Zhang C, Li X, Ruan X, Zhao Y, Yang M (2013) “Discriminative generative contour detection.” in Proc Br Mach Vision Conf (BMVC)
Zhao R, Ouyang W, Wang X (2013) “Unsupervised Salience Learning for Person Re-identification.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR)
Zhao Y, Zhao X, Liu Y (2014) “Person re-identification by free energy score space encoding.” in Proc Int Conf Image Process (ICIP)
Zheng W, Gong S, Xiang T (2009) “Associating groups of people.” in Proc Br Mach Vision Conf (BMVC):6–16
Zheng W, Gong S, Xiang T (2011) “Person re-identification by probabilistic relative distance comparison.” in Proc IEEE Conf Comput Vision Pattern Recog (CVPR): 649–656
Acknowledgments
This research has been partially supported by the funding from China 2011CB302203, NSFC 61375019 and NSFC 61273285.
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Zhao, Y., Zhao, X., Luo, R. et al. Person Re-identification by encoding free energy feature maps. Multimed Tools Appl 75, 4795–4813 (2016). https://doi.org/10.1007/s11042-015-2503-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2503-y