Abstract
Searching for specific persons from surveillance videos captured by different cameras, known as person re-identification, is a key yet under-addressed challenge. Difficulties arise from the large variations of human appearance in different poses, and from the different camera views that may be involved, making low-level descriptor representation unreliable. In this paper, we propose a novel Sparse Representations based Distributed Attribute Learning Model (SRDAL) to encode targets into semantic topics. Compared to other models such as ELF, our model performs best by imposing semantic restrictions onto the generation of human specific attributes and employing Deep Convolutional Neural Network to generate features without supervision for attributes learning model. Experimental results show that our method achieves state-of-the-art performance.
















Similar content being viewed by others
References
Baraniuk R, Davenport M, Devore R, Wakin M (2008) A simple proof of the restricted isometry property for random matrices. Constr Approx 28(3):253–263
Bazzani L, Cristani M, Murino V (2013) Symmetry-driven accumulation of local features for human characterization and re-identification. Comput Vis Image Underst 117(2):130–144
Bengio Y, Courville A, Pascal V (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828
Candes E, Romberg J, Tao T (2006) Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans Inf Theory 52(2):489–509
Cheng K, Tan X (2014) Sparse representations based attribute learning for flower classification. Neurocomputing 145(18):416–426
Dantcheva A (2011) Bag of soft biometrics for person identification: new trends and challenges. Multimed Tools Appl 51(2):739–777
Dean J, Ghemawat S (2008) Mapreduce: simplified data processing on large clusters. Commun ACM 51(1):107–113
Dean J, Corrado GS, Monga R, Chen K, Devin M, Le QV, Mao MZ, Ranzato MA, Senior A, Tucker P (2012) Large scale distributed deep networks. In: International conference on neural information processing systems, pp 1223–1231
Donoho DL (2006) Compressed sensing. IEEE Trans Inf Theory 52(4):1289–1306
Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2010) Person re-identification by symmetry-driven accumulation of local features. In: Computer vision and pattern recognition. IEEE, pp 2360–2367
Ferrari V, Zisserman A (2007) Learning visual attributes. In: Conference on neural information processing systems. Vancouver, British Columbia Canada, December, vol 1, pp 433–440
Fu Y, Hospedales TM, Xiang T, Gong S (2012) Attribute learning for understanding unstructured social activity. In: European conference on computer vision. Springer, pp 530–543
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Proceedings of the European conference on computer vision. Computer vision-ECCV 2008. Marseille, France, pp 262–275
Gray D, Brennan S, Tao H (2007) Evaluating appearance models for recognition, reacquisition, and tracking. In: Proceedings of the IEEE international workshop on performance evaluation for tracking and surveillance (PETS), vol 3
Gu L, Li H (2013) Memory or time: Performance evaluation foriterative operation on hadoop and spark. In: 013 IEEE international conference on high performance computing and communications & 2013 IEEE international conference on embedded and ubiquitous computing. IEEE, pp 721–727
Jain AK, Dass SC, Nandakumar K (2004) Soft biometric traits for personal recognition systems. Lect Notes Comput Sci, 731–738
Keval H (2006) Cctv control room collaboration and communication: does it work?. In: Proceedings of human centered technology workshop, pp 11–12
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: International conference on neural information processing systems, pp 1097–1105
Kumar N, Berg AC, Belhumeur PN, Nayar SK (2011) Describable visual attributes for face verification and image search. IEEE Trans Pattern Anal Mach Intell 33(10):1962–1977
Lampert CH, Nickisch H, Harmeling S (2009) Learning to detect unseen object classes by between-class attribute transfer. In: IEEE conference on computer vision and pattern recognition, 2009. CVPR 2009. IEEE, pp 951–958
Layne R, Hospedales TM, Gong S (2012) Towards person identification and re-identification with attributes. In: International conference on computer vision. Springer, pp 402–412
Layne R, Hospedales TM, Gong S (2012) Person re-identification by attributes. In: BMVC, vol 2, p 8
Layne R, Hospedales TM, Gong S (2014) Attributes-based re-identification. Springer, London
Li A, Liu L, Yan S (2014) Person re-identification by attribute-assisted clothes appearance. Springer
Ma B, Su Y, Jurie F (2012) Local descriptors encoded by fisher vectors for person re-identification. In: International conference on computer vision. Springer, pp 413–422
Miura K, Harada T (2015) Implementation of a practical distributed calculation system with browsers and javascript, and application to distributed deep learning. Comput Sci 37:82–99
Nortcliffe T (2011) People analysis cctv investigator handbook. Home Office Centre Appl Sci Technol 2(3)
Ong H, Chavez K, Hong A (2015) Distributed deep q-learning. arXiv:1508.04186
Ouyang W, Wang X (2013) Joint deep learning for pedestrian detection. In: IEEE international conference on computer vision, pp 2056–2063
Ouyang W, Zeng X, Wang X (2013) Modeling mutual visibility relationship in pedestrian detection. In: IEEE international conference on computer vision and pattern recognition, pp 3222–3229
Prosser B, Zheng W, Gong S, Xiang T, Mary Q (2010) Person re-identification by support vector ranking. In: BMVC, vol 2, p 6
Rohrbach M, Stark M, Szarvas G, Gurevych I (2010) What helps where c and why? Semantic relatedness for knowledge transfer. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 910–917
Satta R, Fumera G, Roli F (2012) A general method for appearance-based people search based on textual queries. Eur Conf Comput Vis, 453–461
Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, Overfeat YL (2013) Integrated recognition, localization and detection using convolutional networks. Eprint Arxiv
Sermanet P, Kavukcuoglu K, Chintala S, Lecun Y (2013) Pedestrian detection with unsupervised multi-stage feature learning. Proc IEEE Conf Comput Vis Pattern Recognit, 3626–3633
Siddiquie B, Feris RS, Davis LS (2011) Image ranking and retrieval based on multi-attribute queries. In: 2011 IEEE conference on computer vision and pattern recognition (CVPR), pp 801–808
Singh D, Reddy CK (2015) A survey on platforms for big data analytics. J Big Data 2(1):1–20
Umeda T, Sun Y, Irie G, Sudo K, Kinebuchi T (2016) Attribute discovery for person re-identification. Springer International Publishing
Vaquero D, Feris RS, Tran D, Brown LG, Hampapur A, Turk M (2009) Attribute-based people search in surveillance environments. In: Applications of computer vision (WACV), 2009 workshop, pp 1–8
Wang W, Chen G, Tuan Dinh AT, Gao J, Ooi BC, Tan KL, Wang S (2015) Singa: putting deep learning in the hands of multimedia users. In: ACM international conference on multimedia. ACM, pp 25–34
Wang Z, Hu R, Yu Y, Liang C, Huang W (2015) Multi-level fusion for person re-identification with incomplete marks. In: Proceedings of the 23rd ACM international conference on multimedia , pp 1267–1270
Williams D (2007) Effective cctv and the challenge of constructing legitimate suspicion using remote visual images. J Investig Psychol Offender Profiling 4(2):97–107
Wright J, Yang AY, Ganesh A, Shankar Sastry S, Yi M (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31 (2):210–227
Xiaogang C, Pengxu W, Wei K, Qixiang Y, Jianbin J (2014) Pedestrian detection with deep convolutional neural network. In: Asian conference on computer vision. Springer, pp 354–365
Yamaguchi K (2012) Parsing clothing in fashion photographs. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 3570–3577
Yang AY, Sastry SS, Ganesh A, Ma Y (2010) Fast l1 -minimization algorithms and an application in robust face recognition: a review. In: IEEE international conference on image processing. IEEE, pp 1849–1852
Yangqing J, Shelhamer E, Donahue J, Karayev S, Long J (2014) Caffe: convolutional architecture for fast feature embedding, 675–678
Ye M, Liang C, Wang Z, Leng Q, Chen J, Liu J (2015) Specific person retrieval via incomplete text description. In: Proceedings of the 5th ACM on international conference on multimedia retrieval , pp 547–550
Yoshua B (2009) Learning deep architectures for ai. Found Trends ® Mach Learn 2(1):1–55
Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: European conference on computer vision. Springer, pp 818–833
Zhao R, Ouyang W, Wang X (2013) Unsupervised salience learning for person re-identification. In: IEEE conference on computer vision and pattern recognition, pp 3586–3593
Zheng W, Gong S, Xiang T (2012) Quantifying and transferring contextual information in object detection. IEEE Trans Pattern Anal Mach Intell 34(4):762–777
Zheng W, Gong S, Xiang T (2013) Reidentification by relative distance comparison. IEEE Trans Pattern Anal Mach Intell 35(3):653–668
Acknowledgements
This research is supported by the Natural Science Foundation of China No.61602215, 61672268, the science foundation of Jiangsu province No.BK20150527, No. BE2015137, the science foundation of Zhenjiang city No.SH2014017, the scientific research funds for senior talents of Jiangsu University No.15JDG180, China State Scholarship Fund No.201608320098 and International Postdoctoral Exchange Fellowship Program No.201653.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Cheng, K., Hui, K., Zhan, Y. et al. Sparse representations based distributed attribute learning for person re-identification. Multimed Tools Appl 76, 25015–25037 (2017). https://doi.org/10.1007/s11042-017-4967-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-4967-4