Abstract
Abnormal event detection and localization is a challenging research problem in intelligent video surveillance. It is designed to automatically identify abnormal events from monitoring videos. The main difficulty of this task lies in that there is only one class called “normal event” in training video sequences. In recent years, many advanced algorithms have been proposed on the basis of hand-crafted features. Only a few algorithms are based on high-level features, but almost all these methods use two-stage learning. In this paper, we propose a novel end-to-end model which integrates the one-class Support Vector Machine (SVM) into Convolutional Neural Network (CNN), named Deep One-Class (DOC) model. Specifically, the robust loss function derived from the one-class SVM is proposed to optimize the parameters of this model. Compared with the hierarchical models, our model not only simplifies the complexity of the process, but also obtains the global optimal solution of the whole process. In the experiments, we validate our DOC model with a publicly available dataset and compare it with some state-of-art methods. The comparison results demonstrate that our model has great performance and it is effective for abnormal events detection from surveillance videos.





Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Adam A, Rivlin E, Shimshoni I, Reinitz D (2008) Robust real-time unusual event detection using multiple fixed-location monitors. IEEE Trans Pattern Anal Mach Intell 30(3):555–560
Bengio Y (2009) Learning deep architectures for AI. Foundations and Trends in Machine Learning 2(1):1–127
Boiman O, Irani M (2007) Detecting irregularities in images and in video. Int J Comput Vis 74(1):17–31
Chan T, Jia K, Gao S, Lu J, Zeng Z, Ma Y (2015) Pcanet: a simple deep learning baseline for image classification. IEEE Trans Image Processing 24(12):5017–5032
Chen Y, Zhou XS, Huang TS (2001) One-class SVM for learning in image retrieval. In: Proceedings of the 2001 international conference on image processing, ICIP 2001, Thessaloniki, Greece, October 7–10, 2001, pp 34–37
Cong Y, Yuan J, Liu J (2011) Sparse reconstruction cost for abnormal event detection. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3449–3456
Cui X, Liu Q, Gao M, Metaxas DN (2011) Abnormal detection using interaction energy potentials. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3161–3167
Fanello SR, Gori I, Metta G, Odone F (2013) One-shot learning for real-time action recognition. In: Pattern recognition and image analysis - 6th Iberian conference, IbPRIA 2013, Funchal, Madeira, Portugal, June 5–7, 2013. Proceedings, pp 31–40
Feichtenhofer C, Pinz A, Wildes RP (2016) Spatiotemporal residual networks for video action recognition. In: Advances in neural information processing systems 29: Annual conference on neural information processing systems 2016, December 5–10, 2016, Barcelona, Spain, pp 3468–3476
Feng Y, Yuan Y, Lu X (2016) Deep representation for abnormal event detection in crowded scenes. In: Proceedings of the 2016 ACM conference on multimedia conference, MM 2016, Amsterdam, The Netherlands, October 15–19, 2016, pp 591–595
Hu R, Zhu X, Cheng D, He W, Yan Y, Song J, Zhang S (2017) Graph self-representation method for unsupervised feature selection. Neurocomputing 220:130–137
Itti L, Baldi P (2005) A principled approach to detecting surprising events in video. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), 20–26 June 2005, San Diego, CA, USA, pp 631–637
Kratz L, Nishino K (2009) Anomaly detection in extremely crowded scenes using spatio-temporal motion pattern models. In: 2009 IEEE computer society conference on computer vision and pattern recognition (CVPR 2009), 20–25 June 2009, Miami, Florida, USA, pp 1446–1453
Li W, Mahadevan V, Vasconcelos N (2014) Anomaly detection and localization in crowded scenes. IEEE Trans Pattern Anal Mach Intell 36(1):18–32
Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 FPS in MATLAB. In: IEEE international conference on computer vision, ICCV 2013, Sydney, Australia, December 1–8, 2013, pp 2720–2727
Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. In: The twenty-third IEEE conference on computer vision and pattern recognition, CVPR 2010, San Francisco, CA, USA, 13–18 June 2010, pp 1975–1981
Reddy V, Sanderson C, Lovell BC (2011) Improved anomaly detection in crowded scenes via cell-based analysis of foreground speed, size and texture. In: IEEE conference on computer vision and pattern recognition, CVPR workshops 2011, Colorado Springs, CO, USA, 20-25 June, 2011, pp 55–61
Sabokrou M, Fayyaz M, Fathy M, Klette R (2017) Deep-cascade: cascading 3d deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Trans Image Processing 26(4):1992–2004
Wang P, Cao Y, Shen C, Liu L, Shen HT (2017) Temporal pyramid pooling based convolutional neural networks for action recognition. IEEE Trans Circuits Syst Video Techn. https://doi.org/10.1109/TCSVT.2016.2576761
Wang P, Liu L, Shen C, Huang Z, van den Hengel A, Shen HT (2016) What’s wrong with that object? Identifying images of unusual objects by modelling the detection score distribution. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 1573–1581
Wang P, Liu L, Shen C, Huang Z, van den Hengel A, Shen HT (2017) Multi-attention network for one shot learning. In: 2017 IEEE conference on computer vision and pattern recognition, CVPR 2017, Honolulu, Hawaii, USA, July 22–25, 2017, pp 2721–2729
Xu D, Ricci E, Yan Y, Song J, Sebe N (2015) Learning deep representations of appearance and motion for anomalous event detection. In: Proceedings of the british machine vision conference 2015, BMVC 2015, Swansea, UK, September 7–10, 2015, pp 8.1–8.12
Yuan Y, Feng Y, Lu X (2017) Statistical hypothesis detector for abnormal event detection in crowded scenes. IEEE Trans Cybernetics. https://doi.org/10.1109/TCYB.2016.2572609
Zhang B, Wang L, Wang Z, Qiao Y, Wang H (2016) Real-time action recognition with enhanced motion vector cnns. In: 2016 IEEE conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27–30, 2016, pp 2718–2726
Zhao B, Li F, Xing EP (2011) Online detection of unusual events in videos via dynamic sparse coding. In: The 24th IEEE conference on computer vision and pattern recognition, CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011, pp 3313–3320
Zhu X, Li X, Zhang S (2016) Block-row sparse multiview multilabel learning for image classification. IEEE Trans Cybernetics 46(2):450–461
Zhu X, Li X, Zhang S, Ju C, Wu X (2017) Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans Neural Netw Learning Syst 28(6):1263–1275
Zhu X, Zhang L, Huang Z (2014) A sparse embedding and least variance encoding approach to hashing. IEEE Trans Image Processing 23(9):3737–3750
Acknowledgements
This work is supported by the National Natural Science Foundation of China (grants No. 61672133 and No. 61632007), and the Fundamental Research Funds for the Central Universities (grants No. ZYGX2015J058 and No. ZYGX2014Z007).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sun, J., Shao, J. & He, C. Abnormal event detection for video surveillance using deep one-class learning. Multimed Tools Appl 78, 3633–3647 (2019). https://doi.org/10.1007/s11042-017-5244-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-5244-2