Abstract
Person re-identification (re-id) plays an important role in video surveillance and forensics applications. In many cases, person re-id should be conducted between video clips, i.e., given a query pedestrian video from one camera, the re-id system should retrieve the video clips containing the same person from other cameras. However, person re-id between videos, which we call video-based person re-id, has not been well studied. In this paper, we propose a visual-appearance-level and spatial-temporal-level dictionary learning (VSDL) approach for video-based person re-id. Specifically, we first employ two kinds of models to represent each walking cycle in the video, i.e., visual-appearance features of all frames within the walking cycle, and a spatial-temporal feature vector. By separately learning a visual-appearance-level dictionary and a spatial-temporal-level dictionary from two kinds of representations, each walking cycle can be represented as a coding coefficient. To enhance the discriminative ability of the obtained coding coefficients, we design a representation coefficient discriminant term for VSDL. Experiments on the public iLIDS-VID and PRID 2011 datasets demonstrate the effectiveness of VSDL.
Similar content being viewed by others
References
Ahmed E, Jones M, Marks TK (2015) An improved deep learning architecture for person re-identification. In: IEEE conference on CVPR, pp 3908–3916
Baltieri D, Vezzani R, Cucchiara R (2013) Learning articulated body models for people re-identification. In: ACM conference on multimedia, pp 557–560
Chen D, Yuan Z, Hua G, Zheng N, Wang J (2015) Similarity learning on an explicit polynomial kernel feature map for person re-identification. In: IEEE conference on CVPR
Chen J, Wang Y, Tang YY (2015) Person re-identification by exploiting spatio-temporal cues and multi-view metric learning. IEEE Signal Process Lett 23(7):998–1002
Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2010) Person re-identification by symmetry-driven accumulation of local features. In: IEEE conference on CVPR, pp 2360–2367
Gangeh MJ, Ghodsi A, Kamel MS (2013) Kernelized supervised dictionary learning. IEEE Trans Signal Process 61(19):4753–4767
Gu S, Zhang L, Zuo W, Feng X (2014) Projective dictionary pair learning for pattern classification. In: NIPS, pp 793–801
Guillaumin M, Verbeek J, Schmid C (2009) Is that you? Metric learning approaches for face identification. In: IEEE conference on ICCV, pp 498–505
Hirzer M, Beleznai C, Roth PM, Bischof H (2011) Person re-identification by descriptive and discriminative classification. In: Proceedings of 17th Scandinavian conference on image analysis (SCIA) 2011, Ystad, Sweden, pp 91–102
Hirzer M, Roth PM, Bischof H (2012) Person re-identification by efficient impostor-based metric learning. In: IEEE conference on AVSS, pp 203–208
Hirzer M, Roth PM, Köstinger M, Bischof H (2012) Relaxed pairwise learned metric for person re-identification. In: ECCV, pp 780–793
Huang DA, Wang YCF (2013) Coupled dictionary and feature space learning with applications to cross-domain image synthesis and recognition. In: IEEE conference on ICCV, pp 2496–2503
Jiang Z, Lin Z, Davis LS (2013) Label consistent k-svd: learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664
Jing XY, Zhu X, Wu F, You X, Liu Q, Yue D, Hu R, Xu B (2015) Super-resolution person re-identification with semi-coupled low-rank discriminant dictionary learning. In: IEEE conference on CVPR, pp 695–704
Kostinger M, Hirzer M, Wohlhart P, Roth PM, Bischof H (2012) Large scale metric learning from equivalence constraints. In: IEEE conference on CVPR, pp 2288–2295
Li A, Liu L, Wang K, Liu S, Yan S (2015) Clothing attributes assisted person re-identification. IEEE Trans Circuits Syst Video Technol 25(5):869–878
Li S, Shao M, Fu Y (2015) Cross-view projective dictionary learning for person re-identification. In: IJCAI, pp 2155–2161
Li W, Wang X (2013) Locally aligned feature transforms across views. In: IEEE conference on CVPR, pp 3594–3601
Lisanti G, Masi I, Bagdanov A, Del Bimbo A (2015) Person re-identification by iterative re-weighted sparse ranking. IEEE Trans Pattern Anal Mach Intell 37(8):1629–1642
Liu C, Loy CC, Gong S, Wang G (2013) Pop: person re-identification post-rank optimisation. In: IEEE conference on ICCV, pp 441–448
Liu H, Ma B, Qin L, Pang J, Zhang C, Huang Q (2015) Set-label modeling and deep metric learning on person re-identification. Neurocomputing 151:1283–1292
Liu K, Ma B, Zhang W, Huang R (2015) A spatio-temporal appearance representation for viceo-based pedestrian re-identification. In: ICCV, pp 3810–3818
Liu X, Song M, Tao D, Zhou X, Chen C, Bu J (2014) Semi-supervised coupled dictionary learning for person re-identification. In: IEEE conference on CVPR, pp 3550–3557
Ma AJ, Yuen PC, Li J (2013) Domain transfer support vector ranking for person re-identification without target camera label information. In: IEEE conference on ICCV, pp 3567–3574
Mignon A, Jurie F (2012) Pcca: A new approach for distance learning from sparse pairwise constraints. In: IEEE conference on CVPR, pp 2666–2672
Pedagadi S, Orwell J, Velastin S, Boghossian B (2013) Local fisher discriminant analysis for pedestrian re-identification. In: IEEE conference on CVPR, pp 3318–3325
Prosser B, Zheng WS, Gong S, Xiang T, Mary Q (2010) Person re-identification by support vector ranking. In: BMVC, pp 1–11
Qiu Q, Ni J, Chellappa R (2014) Dictionary-based domain adaptation methods for the re-identification of faces. In: Person re-Identification. Springer, London, pp 269–285
Tao D, Jin L, Wang Y, Yuan Y, Li X (2013) Person re-identification by regularized smoothing kiss metric learning. IEEE Trans Circuits Syst Video Technol 23(10):1675–1685
Wang T, Gong S, Zhu X, Wang S (2014) Person re-identification by video ranking. In: ECCV, pp 688–703
Wang T, Gong S, Zhu X, Wang S (2016) Person re-identification by discriminative selection in video ranking. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2016.2522418
Wang Z, Hu R, Liang C, Jiang J, Sun K, Leng Q, Huang B (2015) Person re-identification using data-driven metric adaptation. In: MultiMedia modeling—21st international conference, MMM 2015, Sydney, NSW, Australia, 5–7 Jan 2015, proceedings, part II, pp 195–207
Wang Z, Hu R, Liang C, Yu Y, Jiang J, Ye M, Chen J, Leng Q (2016) Zero-shot person re-identification via cross-view consistency. IEEE Trans Multimed 18(2):260–272
Wang Z, Hu R, Liang C, Yu Y, Jiang J, Ye M, Chen J, Leng Q (2017) Noise robust face image super-resolution through smooth sparse representation. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2016.2594184
Wang Z, Hu R, Yu Y, Jiang J, Liang C, Wang J (2016) Scale-adaptive low-resolution person re-identification via learning a discriminating surface. In: Proceedings of the twenty-fifth international joint conference on artificial intelligence, IJCAI 2016, New York, NY, 9–15 July 2016, pp 2669–2675
Weinberger K, Saul L (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10:207–244
Wu Z, Li Y, Radke RJ (2015) Viewpoint invariant human re-identification in camera networks using pose priors and subject-discriminative features. IEEE Trans Pattern Anal Mach Intell 37(5):1095–1108
Yang J, Wright J, Huang TS, Ma Y (2010) Image super-resolution via sparse representation. IEEE Trans Image Process 19(11):2861–2873
Yang M, Zhang D, Feng X (2016) Fisher discrimination dictionary learning for sparse representation. In: IEEE conference on ICCV, pp 543–550
Yang M, Zhang L, Feng X, Zhang D (2011) Fisher discrimination dictionary learning for sparse representation. In: ICCV, pp 543–550
Yang Y, Yang J, Yan J, Liao S, Yi D, Li SZ (2014) Salient color names for person re-identification. In: ECCV, pp 536–551
You J, Wu A, Li X, Zheng WS (2016) Top-push video-based person re-identification. arXiv preprint arXiv:1604.08683
Zhang H, Cao X, Ho JKL, Chow TWS (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inform 13(2):520–531
Zhao R, Ouyang W, Wang X (2013) Person re-identification by salience matching. In: IEEE conference on ICCV, pp 2528–2535
Zhao R, Ouyang W, Wang X (2013) Unsupervised salience learning for person re-identification. In: IEEE conference on CVPR, pp 3586–3593
Zhao R, Ouyang W, Wang X (2014) Learning mid-level filters for person re-identification. In: IEEE conference on CVPR, pp 144–151
Zheng L, Shen L, Tian L, Wang S, Wang J, Bu J, Tian Q (2015) Scalable person re-identification: a benchmark. In: IEEE conference on ICCV, pp 1116–1124
Zheng WS, Gong S, Xiang T (2013) Reidentification by relative distance comparison. IEEE Trans Pattern Anal Mach Intell 35(3):653–668
Zhu X, Jing XY, Wu F, Feng H (2016) Video-based person re-identification by simultaneously learning intra-video and inter-video distance metrics. In: IJCAI, pp 3552–3559
Zhuang Y, Wang YF, Wu F, Zhang Y, Lu W (2013) Supervised coupled dictionary learning with group structures for multi-modal retrieval. In: AAAI, pp 1070–1076
Acknowledgements
The authors would like to thank the editors and anonymous reviewers for their constructive comments and suggestions. This work was supported by the National Key Research and Development Program of China under Grant No. 2017YFB0202001, NSFC Key Project of General Technology Fundamental Research United Fund No. U1736211, the National Nature Science Foundation of China under Grant Nos. 61672208, U1504611, 41571417, the Science and Technique Development Program of Henan under Grant Nos. 172102210186, 182102311066, the Medical Education Research Project of Henan No. Wjlx2016095.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interests regarding the publication of this paper.
Rights and permissions
About this article
Cite this article
Zhu, X., Jing, XY., Ma, F. et al. Simultaneous visual-appearance-level and spatial-temporal-level dictionary learning for video-based person re-identification. Neural Comput & Applic 31, 7303–7315 (2019). https://doi.org/10.1007/s00521-018-3529-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-018-3529-7