Abstract
The huge and ever growing volume of traffic video poses a compelling demand for efficient automatic detection of traffic anomaly. In this paper, a new traffic anomaly detection algorithm is introduced. It firstly divides a traffic video into several video cubes in temporal domain, and each video cube is divided into video blocks in spatial domain. Each image block of a video block is described using the local invariant features and the visual codebook approach. Based on the descriptor of the image block, we count the category number of the block (CNB) of a video block. Then, a Gaussian distribution model for estimating the probability of normal traffic with respect to the CNB is learned. The learned Gaussian distribution model is then used to detect the traffic anomaly from the test traffic video. Eventually, the results of all video blocks are fused to achieve the final decision. Experimental results show that the proposed algorithm performs better than two existing algorithms on both the intersection traffic videos and main road traffic videos.
Similar content being viewed by others
References
Adam A, Rivlin E, Shimshoni I, Reinitz D (2008) Robust real-time unusual event detection using multiple fixed-location monitors. IEEE Trans Pattern Anal Mach Intell 30(3):555–560
Aköz Ö, Karsligil ME (2014) Traffic event classification at intersections based on the severity of abnormality. Mach Vis Appl 25(3):613–632
Andrade EL, Blunsden S, Fisher RB (2006) Modelling crowd scenes for event detection. In: 18th International Conference on Pattern Recognition, ICPR 2006. IEEE, Hong Kong, China 175–178
Ballan L, Bertini M, Del Bimbo A, Serra G (2010) Video event classification using string kernels. Multimed Tools Appl 48(1):69–87
Bi S, Han L, Zhong Y, Wang X (2011) All-day traffic states recognition system without vehicle segmentation. J China Univ Posts Telecommun 18(2):1–11
Einicke GA, Malos JT, Reid DC, Hainsworth DW (2009) Riccati equation and EM algorithm convergence for inertial navigation alignment. IEEE Trans Signal Process 57(1):370–375
Guo L, Xiong Y, Huang Q, Li X (2014) Image aesthetic assessment using both hand-crafting and semantic features. Neurocomputng 143:14–26
Helbing D (1997) Fundamentals of traffic flow. Phys Rev E55(3):3735
Huang Q, Chen B, Wang J, Mei T (2014) Personalized video recommendation through graph propagation. ACM Trans Multimed Comput Commun Appl
Huang Q, Yang F, Liu L, Li X (2014) Automatic segmentation of breast lesions for intelligent interaction in ultrasonic computer-aided diagnosis. Inf Sci
Hunt HEM (1991) Modelling of road vehicles for calculation of traffic-induced ground vibration as a random process. J Sound Vib 144(1):41–51
Jin X, Srinivasan D, Cheu RL (2001) Classification of freeway traffic patterns for incident detection using constructive probabilistic neural networks. IEEE Trans Neural Netw 12(5):1173–1187
Kamijo S, Matsushita Y, Ikeuchi K, Sakauchi M (2000) Traffic monitoring and accident detection at intersections. IEEE Trans Intell Transp Syst 1(2):108–118
Ki YK, Lee DY (2007) A traffic accident recording and reporting model at intersections. IEEE Trans Intell Transp Syst 8(2):188–194
Kuncheva LI, Jain LC (1999) Nearest neighbor classifier: simultaneous editing and feature selection. Pattern Recogn Lett 20(11):1149–1156
Lee IJ (2012) An accident detection system on highway through CCTV with calogero-moser system. In: IEEE 18th Asia-Pacific Conference on Communications, APCC2012. IEEE, Jeju Island 522–525
Li Y, Liu W, Huang Q, Li X (2014) Fuzzy bag of words for social image description. Multimed Tools Appl. In press
Li Y, Liu W, Li X, Huang Q, Li X (2013) GA-SIFT: a new scale invariant feature transform for multispectral image using geometric algebra. Inform Sci 281(10):559–572
Li F, Pietro P (2005) A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005. IEEE, San Diego, CA, USA, pp 524–531. doi: 10.1109/CVPR.2005.16
Lowe DG (1999) Object recognition from local scale-invariant features. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, ICCV 1999. IEEE, Kerkyra, Greece, pp 1150–1157. doi:10.1109/ICCV.1999.790410
Mei T, Li L, Hua X, Li S (2012) Imagesense: towards contextual image advertising. ACM Trans Multimed Comput Commun Appl 8(1):6:1–18
Mei T, Tang L, Tang J, Hua X (2013) Near-lossless semantic video summarization and its applications to video analysis. ACM Trans Multimed Comput Commun Appl 9(3):16
Mei T, Yang B, Hua XS, Li S (2011) Contextual video recommendation by multimodal relevance and user feedback. ACM Trans Inf Syst 29(2):1–24
Mo X, Monga V, Bala R, Fan Z (2014) Adaptive sparse representations for video anomaly detection. IEEE Trans Circuits Syst Video Technol 24(4):631–645
Qu X, Kuang Y, Oh E, Jin S (2014) Safety evaluation for expressways: a comparative study for macroscopic and microscopic indicators. Traffic Inj Prev 15(1):89–93
Shewhart WA (1931) Economic control of quality of manufactured product. ASQ Quality Press
Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2010. IEEE, San Francisco, CA, pp 3360–3367. doi:10.1109/CVPR.2010.5540018
Xu D, Chang SF (2008) Video event recognition using kernel methods with multilevel temporal alignment. IEEE Trans Pattern Anal Mach Intell 30(11):1985–1997
Yang F, Huang Q, Jin L, Liew AWC (2014) Segmentation and recognition of multi-model photo event. Neurocomputing
Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009. IEEE, Miami, FL, USA, pp 1794–1801. doi: 10.1109/CVPR.2009.5206757
Zhang T, Liu S, Xu C, Lu H (2012) Mining semantic context information for intelligent video surveillance of traffic scenes. IEEE Trans Ind Inf 9(1):149–160
Zhang T, Lu H, Li SZ (2009) Learning semantic scene models by object classification and trajectory clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009. IEEE, Miami, FL, pp:1940–1947. doi:10.1109/CVPR.2009.5206809
Zhou J, Ye M, Ding J, Mao S, Zhang HJ (2012) Rapid and robust traffic accident detection based on orientation map. Opt Eng 51(11):117201–117201
Zhou X, Zhuang X, Yan S, Chang SF, Hasegawa-Johnson M, Huang, TS (2008) Sift-bag kernel for video event analysis. In: Proceedings of the 16th ACM international conference on Multimedia, MM 2008. ACM, New York, USA 229–238
Acknowledgments
This work was supported by National Natural Science Funds of China (Nos. 61401286 and 61372007), Guangdong Provincial Project of Transportation Science and Technology (No. 2012-02-084), Natural Science Funds of Guangdong Province (Nos. S2012010009885 and S2013010012966), Projects of innovative science and technology, Department of Education, Guangdong Province (No.2013KJCX0012), and Natural Science Foundation of SZU (No. 201415).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Liu, W. & Huang, Q. Traffic anomaly detection based on image descriptor in videos. Multimed Tools Appl 75, 2487–2505 (2016). https://doi.org/10.1007/s11042-015-2637-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2637-y