Abstract
Attributes are expected to narrow down the semantic gap between low-level visual features and high-level semantic meanings. Such superiority motivates us to explore pedestrian attributes which has became a critical problem to boost image understanding and improve the performance of pedestrian detection, retrieval, re-identification, etc. Based on the PETA dateset, we manually relabel two subset VIPeR and PRID as our experimental dataset. Moreover, we proposed an evaluation protocol for researchers to evaluate pedestrian attribute classification algorithms. In this paper, we utilized two baseline methods to to demonstrate the performance of the attribute in pedestrian detection. The first one directly uses color and texture features to train Support Vector Machine (SVM) classification while the other one uses DSIFT (Dense SIFT) with Bag-of-Visual-Words (BoVW) to train SVM classification. Finally, we report and discuss the baseline performance on the database following the proposed evaluation protocol.















Similar content being viewed by others
Notes
The Y channel of YCbCr is selected as the luminance channel.
References
Bosch A, Zisserman A, Muoz X (2008) Scene classification using a hybrid generative/discriminative approach. Pattern Anal Machine Intelligence:712–727
Bossard L, Dantone M, Leistner C, Wengert C, Quack T, Van Gool L (2013) Apparel classification with style. In: Asian conference on computer vision, pp 321–335
Bourdev L, Malik J (2009) Poselets: body part detectors trained using 3d human pose annotations. In: Computer vision international conference, pp 1365–1372
Bourdev L, Maji S, Brox T, Malik J (2010) Detecting people using mutually consistent poselet activations. In: European conference on computer vision, pp 168–181
Bourdev L, Maji S, Malik J (2011) Describing people: a poselet-based approach to attribute classification. In: IEEE international conference on computer vision, pp 1543–1550
Chen H, Gallagher A, Girod B (2012) Describing clothing by semantic attributes. In: European conference on computer vision, pp 609–623
Chen B, Shu H, Coatrieux G, Chen G, Sun X, Coatrieux J (2015) Color image analysis by quaternion-type moments. J Math Imaging and Vision:124–144
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision, European conference on computer vision, pp 1–22
Deng Y, Luo P, Loy CC, Tang X (2014) Pedestrian attribute recognition at far distance
Dorko G, Schmid C (2005) Object class recognition using discriminative local features
Duan K, Parikh D, Crandall D, Grauman K (2012) Discovering localized attributes for fine-grained recognition. In: Computer vision and pattern recognition, pp 3474–3481
Ferrari V, Zisserman A (2007) Learning visual attributes, in advances in neural information processing systems:433–440
Fogel I, Sagi D (1989) Gabor filters as texture discriminator. Biol Cybern:103–113
Fu Y, Hospedales TM, Xiang T, Gong S (2012) Attribute learning for understanding unstructured social activity. In: European conference on computer vision, pp 521–534
Gu B, Sheng VS (2016) A robust regularization path algorithm for -support vector classification. IEEE Trans Neural Networks Learning Systems
Jingyuan C, Xuemeng S, Liqiang N, Xiang W, Hanwang Z, Tat-Seng C (2016) Micro tells macro: predicting the popularity of micro-videos via a transductive model. In: Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, pp 898–907
Layne R, Hospedales TM, Gong S, Mary Q (2012) Person re-identification by attributes. In: British Machine Vision Conference
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Computer vision and pattern recognition, pp 2169– 2178
Li A, Liu L, Wang K, Liu S, Yan S (2014) Clothing attributes assisted person re-identification, pp 869–878
Liu J, Kuipers B, Savarese S (2011) Recognizing human actions by attributes. In: Computer vision and pattern recognition, pp 3337–3344
Liu Z, Huang H, He Q, Chiew K, Gao Y (2015) Rare category exploration on linear time complexity. In: Database systems for advanced applications - 20th International Conference, DASFAA 2015, Hanoi, Vietnam, April 20-23, 2015, Proceedings, Part II, pp 37–54
Lowe DG (1999) Object recognition from local scale-invariant features. In: The proceedings of the IEEE international conference, pp 1150–1157
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis:91–110
Luming Z, Yahong H, Yi Y, Mingli S, Shuicheng Y, Qi T (2013) Discovering discriminative graphlets for aerial image categories recognition. IEEE Trans Image Process:5071–5084
Luming Z, Meng W, Richang H, Bao-Cai Y, Xuelong L (2016) Large-scale aerial image categorization using a multitask topological codebook. IEEE Trans Cybernetics:535–545
Luming Z, Yang Y, Meng W, Richang H, Liqiang N, Xuelong L (2016) Detecting densely distributed graph patterns for fine-grained image categorization. IEEE Trans Image Processing:553–565
Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: Computer vision and pattern recognition, pp 1–8
Ojala T, Pietikäinen M, Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recog:51–59
Schmid C (2001) Constructing models for content-based image retrieval. In: Computer vision and pattern recognition, p 39
Siddiquie B, Feris RS, Davis LS (2011) Image ranking and retrieval based on multi-attribute queries. In: Computer vision and pattern recognition, pp 801–808
Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings ninth IEEE international conference on computer vision, p 1470
Song X, Nie L, Zhang L, Liu M, Chua T (2015) Interest inference via structure-constrained multi-source multi-task learning. In: Proceedings of the twenty-fourth international joint conference on artificial intelligence. IJCAI, Buenos Aires, pp 2371–2377
Vaquero D, Feris RS, Tran D, Brown L, Hampapur A, Turk M, et al. (2009) Attribute-based people search in surveillance environments. In: Workshop on applications of computer vision, pp 1–8
Wang S, Joo J, Wang Y, Zhu S-C (2013) Weakly supervised learning for attribute localization in outdoor scenes. In: Computer vision and pattern recognition, pp 3111–3118
Wang W, Yan Y, Zhang L, Hong R, Sebe N (2016) Collaborative sparse coding for multiview action recognition, pp 80–87
Wen X, Shao L, Xue Y, Fang W (2015) A rapid learning algorithm for vehicle classification. Inf Sci:395–406
Xuemeng S, Zhaoyan M, Liqiang N, Yi-Liang Z, Tat-Seng C (2016) Volunteerism tendency prediction via harvesting multiple social networks. ACM Trans Inf Syst:10
Yang M, Yu K (2011) Real-time clothing recognition in surveillance videos. In: IEEE international conference image processing, pp 2937–2940
Zhang L, Song M, Sun L, Liu X, Wang Y, Tao D, Bu J, Chen C (2012) Spatial graphlet matching kernel for recognizing aerial image categories. In: International Conference on Pattern Recognition, pp 2813–2816
Zhang L, Song M, Liu X, Bu J, Chen C (2013) Fast multi-view segment graph kernel for object classification. Signal Process:1597–1607
Zhang L, Gao Y, Xia Y, Dai Q, Li X (2015) A fine-grained image categorization system by cellet-encoded spatial pyramid modeling. IEEE Trans Ind Electron:564–571
Zheng Y, Jeon B, Xu D, Wu QMJ, Zhang H (2015) Image segmentation by generalized hierarchical fuzzy c-means algorithm. J Intell Fuzzy Syst:961–973
Zhenguang L, Kevin C, Qinming H, Hao H, Butian H (2014) Prior-free rare category detection: more effective and efficient solutions. Expert Syst Appl:7691–7706
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (61502337, 61472275, 61170239, 61303208), the Tianjin Research Program of Application Foundation and Advanced Technology (15JCYBJC162000), and the grant of Elite Scholar Program of Tianjin University (2014XRG-0046).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, J., Li, FW., Nie, WZ. et al. Visual attribute detction for pedestrian detection. Multimed Tools Appl 78, 26833–26850 (2019). https://doi.org/10.1007/s11042-016-4258-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-4258-5