Abstract
Automatic object classification in traffic scene videos is an important issue for intelligent visual surveillance with great potential for all kinds of security applications. However, this problem is very challenging for the following reasons. Firstly, regions of interest in videos are of low resolution and limited size due to the capacity of conventional surveillance cameras. Secondly, the intra-class variations are very large due to changes of view angles, lighting conditions, and environments. Thirdly, real-time performance of algorithms is always required for real applications. In this paper, we evaluate the performance of local feature descriptors for automatic object classification in traffic scenes. Image intensity or gradient information is directly used to construct effective feature vectors from regions of interest extracted via motion detection. This strategy has great advantages of efficiency compared to various complicated texture features. We not only analyze and evaluate the performance of different feature descriptors, but also fuse different scales and features to achieve better performance. Numerous experiments are conducted and experimental results demonstrate the efficiency and effectiveness of this strategy with robustness to noise, variance of view angles, lighting conditions, and environments.
Similar content being viewed by others
References
Brown L. View independent vehicle/person classification. In: Proceedings of the ACM 2nd International Workshop on Video Surveillance & Sensor Networks. 2004, 114–123
Rivlin E, Rudzsky M, Goldenberg R, Bogomolov U, Lepchev S. A realtime system for classification of moving objects. In: Proceedings of the 16th International Conference on Pattern Recognition. 2002, 688–691
Grimson W, Stauffer C, Romano R, Lee L. Using adaptive tracking to classify and monitor activities in a site. In: Proceedings of the 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1998, 22–29
Zhou Q, Aggarwal J. Tracking and classifying moving objects from video. In: Proceedings of IEEE Workshop on Performance Evaluation of Tracking and Surveillance. 2001
Zhang Z, Cai Y, Huang K, Tan T. Real-time moving object classification with automatic scene division. In: Proceedings of the 2007 IEEE International Conference on Image Processing, ICIP’ 07. 2007, 149–152
Tan T, Sullivan G, Baker K. Model-based localisation and recognition of road vehicles. International Journal of Computer Vision, 1998, 27(1): 5–25
Zhang Z, Dong W, Huang K, Tan T. Eda approach for model based localization and recognition of vehicles. In: Proceedings of the 2007 IEEE Conference on Computer Vision and Pattern Recognition. 2007, 1–8
Yan P, Khan S, Shah M. 3D model based object class detection in an arbitrary view. In: Proceedings of the IEEE 11th International Conference on Computer Vision, ICCV’ 07. 2007, 1–6
Everingham M, Gool L, Williams C, Winn J, Zisserman A. The pascal visual object classes (VOC) challenge. International Journal of Computer Vision, 2010, 88(2): 303–338
Han F, Shan Y, Cekander R, Sawhney H, Kumar R. A two-stage approach to people and vehicle detection with hog-based svm. In: Proceedings of Performance Metrics for Intelligent Systems Workshop in Conjunction with the IEEE Safety, Security, and Rescue Robotics Conference. 2006
Viola P, Jones M, Snow D. Detecting pedestrians using patterns of motion and appearance. In: Proceedings of the 9th IEEE International Conference on Computer Vision. 2003, 734–741
Bicego M, Castellani U, Murino V. A hidden markov model approach for appearance-based 3D object recognition. Pattern Recognition Letters, 2005, 26(16): 2588–2599
Zitova B, Flusser J. Image registration methods: a survey. Image and Vision Computing, 2003, 21(11): 977–1000
Lowe D. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91–110
Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(4): 509–522
Mikolajczyk K, Schmid C. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10): 1615–1630
Gehler P, Nowozin S. On feature combination for multiclass object classification. In: Proccedings of the IEEE 12th International Conference on Computer Vision. 2009, 221–228
Munoz D, Bagnell J, Vandapel N, Hebert M. Contextual classification with functional max-margin markov networks. In: Proccedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, CVPR’ 09. 2009, 975–982
Grzegorzek M, Sav S, O’Connor N, Izquierdo E. Local wavelet features for statistical object classification and localization. IEEE Transactions on Multimedia, 2010, 17(1): 118
Ojala T, Pietikainen M, Maenpaa T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7): 971–987
Lazebnik S, Schmid C, Ponce J. A sparse texture representation using local affine regions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(8): 1265–1278
Burges C. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 1998, 2(2): 121–167
Friedman J, Hastie T, Tibshirani R. Additive logistic regression: a statistical view of boosting. Institute of Mathematical Statistics, 2000, 28(2): 337–407
Zhang Z, Li M, Huang K, Tan T. Boosting local feature descriptors for automatic objects classification in traffic scene surveillance. In: Proccedings of the 19th International Conference on Pattern Recognition, ICPR’ 08. 2008, 1–4
Stauffer C, Grimson W. Adaptive background mixture models for realtime tracking. In: Proccedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR’ 99. 1999
Liu Z, Huang K, Tan T, Wang L. Cast shadow removal with GMM for surface reflectance component. In: Proccedings of the 18th International Conference on Pattern Recognition, ICPR’ 06. 2006, 727–730
Author information
Authors and Affiliations
Corresponding author
Additional information
Zhaoxiang Zhang received his BS in Electronic Science and Technology from the University of Science and Technology of China, Hefei, China, in 2004. He received his PhD in Pattern Recognition and Intelligent Systems from the National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China, in 2009. In October 2009, he joined the Laboratory of Intelligent Recognition and Image Processing, Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering, Beihang University, Beijing, as a Lecturer. His research interests include computer vision, pattern recognition, image processing, and machine learning.
Yunhong Wang received her BS in Electronic Engineering from Northwestern Polytechnical University, Xi’an, China, in 1989 and her MS and PhD in Electronic Engineering from Nanjing University of Science and Technology, Nanjing, China, in 1995 and 1998, respectively. From 1998 to 2004, she was with the National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China. Since 2004, she has been a professor with the School of Computer Science and Engineering, Beihang University, Beijing, where she is also the Director of the Laboratory of Intelligent Recognition and Image Processing, Beijing Key Laboratory of Digital Media. Her research interests include biometrics, pattern recognition, computer vision, data fusion, and image processing.
Rights and permissions
About this article
Cite this article
Zhang, Z., Wang, Y. Automatic object classification using motion blob based local feature fusion for traffic scene surveillance. Front. Comput. Sci. 6, 537–546 (2012). https://doi.org/10.1007/s11704-012-1296-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11704-012-1296-7