Abstract
Since from emergence of deep learning techniques, automatic video surveillance has received researchers’ attention. Such deep learning-based methods improve the accuracy but lead to higher computational complexities than the semi-automatic approach. A unique framework is proposed in this study to bridge the gap between automated and semi-automatic operations to save time and boost accuracy. The main advantage of the proposed model is reducing the computational complexity while improving the overall accuracy using the deep learning approach in video anomaly detection applications. The framework consists of keyframe extraction, robust features extraction, automatic features learning, and classification. Extracting keyframes from the input videos reduces the computational burden during features extraction and classification steps. The novel lightweight keyframe extraction algorithm using the histogram and dynamic thresholding technique is proposed in this paper. This research proposes a revolutionary lightweight keyframe extraction approach based on the histogram and dynamic thresholding technique. Efficient motion tracking among frames is a critical research challenge in video anomaly detection. We propose the novel Modified Spatio-Temporal (MST) approach to extract the interest points as features in this paper. Input frames are normalized and pre-processed using Gaussian filtering first in the proposed MST. Then motion tracking and cuboids are estimated from the pre-processed frames. From 3D cuboids, we applied Discrete Wavelet Transform (DWT) with Principal Component Analysis (PCA) to generate the codebook. This codebook passed as sequential input to Recurrent Neural Network (RNN) using Long Term Short Memory (LSTM) classifier called RNN-LSTM. The novel training and classification model of deep learning is designed to estimate the probability of each class (i.e., anomalous or normal) of the video sequence. The experimental results of the proposed MST-RNN-LSTM model achieved significant improvement in computational and detection efficiencies compared to state-of-art methods. The keyframe extraction algorithm discarded 50% of the frames in an input video sequence, resulting in less computing overhead. The accuracy of the proposed model is improved by 4.5% and the processing time is reduced by 21% compared to state-of-art methods.
Similar content being viewed by others
Data availability
The datasets analysed during the current study are available from the corresponding author on reasonable request.
References
Aköz, Ö, Karsligil, E (2014) Traffic event classification at intersections based on the severity of abnormality Machine Vision and Applications 25. https://doi.org/10.1007/s00138-011-0390-4
Alhayani, B, Kwekha-Rashid, AS, Mahajan, HB et al. (2022) 5G standards for the Industry 4.0 enabled communication systems using artificial intelligence: perspective of smart healthcare system. Appl Nanosci. https://doi.org/10.1007/s13204-021-02152-4.
Bermejo NE, Deniz SO, Bueno GG, Sukthankar R (2011) Violence Detection in Video Using Computer Vision Techniques. In: Real P, Diaz-Pernil D, Molina-Abril H, Berciano A, Kropatsch W (eds) Computer Analysis of Images and Patterns. CAIP 2011. Lecture notes in computer science, vol 6855. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23678-5_39
Deepak, K, Srivathsan, G, Roshan, S, Chandrakala S (2020) Deep multi-view representation learning for video anomaly detection using spatiotemporal autoencoders. Circuits Syst Signal Process. https://doi.org/10.1007/s00034-020-01522-7
Deepak K, Chandrakala S, Mohan CK (2021) Residual spatiotemporal autoencoder for unsupervised video anomaly detection. SIViP 15:215–222. https://doi.org/10.1007/s11760-020-01740-1
Esen, E, Arabaci, MA, Soysal, M (2013) Fight detection in surveillance videos. 2013 11th international workshop on content-based multimedia indexing (CBMI). https://doi.org/10.1109/cbmi.2013.6576569.
Hospedales, T, Li, J, Gong, S, Xiang, T. (2011) Identifying Rare and Subtle Behaviors: A Weakly Supervised Joint Topic Model IEEE Trans Pattern Anal Mach Intell 33. https://doi.org/10.1109/TPAMI.2011.81.
Hu J, Zhu E, Wang S, Liu X, Guo X, Yin J (2019) An efficient and robust unsupervised anomaly detection method using ensemble random projection in surveillance videos. Sensors 19(19):4145. https://doi.org/10.3390/s19194145
Huang, C, Wu, Z, Jie, W, Xu, Y, Jiang, Q, Wang, Y (2021) Abnormal event detection using deep contrastive learning for intelligent video surveillance system. IEEE Trans Indust Inf PP. 1–1. https://doi.org/10.1109/TII.2021.3122801.
Jeong H, Yoo Y, Yi K, Choi J (2014) Two-stage online inference model for traffic pattern analysis and anomaly detection. Mach Vis Appl 25:1501–1517. https://doi.org/10.1007/s00138-014-0629-y
Kamijo, S, Matsushita, Y, Ikeuchi, K, Sakauchi, M (n.d.). Traffic monitoring and accident detection at intersections. Proceedings 199 IEEE/IEEJ/JSAI international conference on intelligent transportation systems (cat. No.99TH8383). https://doi.org/10.1109/itsc.1999.821147.
Khaire P, Kumar P (2022) A semi-supervised deep learning based video anomaly detection framework using RGB-D for surveillance of real-world critical environments. Forensic Sci Int Digit Investig 40:301346. https://doi.org/10.1016/j.fsidi.2022.301346
Kotkar, V. (2018) A comparative analysis of machine learning based anomaly detection techniques in video surveillance. J Eng Appl Sci (JEAS) 12(12SI):9376–9381. https://doi.org/10.36478/jeasci.2017.9376.9381
Kotkar V (2020) Scalable anomaly detection framework in video surveillance using Keyframe extraction and machine learning algorithms. J Adv Res Dyn Control Syst 12:395–408. https://doi.org/10.5373/JARDCS/V12I7/20202020
Lessard, A, Bélisle, F, Bilodeau, G-A, Saunier, N (2016). The CountingApp, or How to Count Vehicles in 500 Hours of Video. https://doi.org/10.1109/CVPRW.2016.198.
LI W-X, Mahadevan V, Vasconcelos N (2013) Anomaly Detection and Localization in Crowded Scenes. IEEE Trans Pattern Anal Mach Intell (TPAMI) 36:18–32. https://doi.org/10.1109/TPAMI.2013.111
Liu, W, Luo, W, Lian, D, Gao, S (2018) Future Frame Prediction for Anomaly Detection - A New Baseline. 6536–6545. https://doi.org/10.1109/CVPR.2018.00684.
Liu, Z, Nie, Y, Long, C, Zhang, Q, Li, G (2021) A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction. 13568–13577. https://doi.org/10.1109/ICCV48922.2021.01333.
Lu, C, Shi, J, Jia, J (2013) Abnormal event detection at 150 FPS in MATLAB. 2013 IEEE international conference on computer vision. https://doi.org/10.1109/iccv.2013.338.
Luo, W, Liu, W, Lian, D, Tang, J, Duan, L, Peng, X, Gao, S (2019) Video anomaly detection with sparse coding inspired deep neural networks. IEEE transactions on pattern analysis and machine intelligence, 1–1. https://doi.org/10.1109/tpami.2019.2944377.
Luo, W, Liu, W, Lian, D, Gao, S (2021) Future frame prediction network for video anomaly detection. IEEE Trans Pattern Anal Mach Intell PP. 1–1. https://doi.org/10.1109/TPAMI.2021.3129349.
Mabrouk A, Zagrouba E (2018) Abnormal behavior recognition for intelligent video surveillance systems: A review. Exp Syst Appl 91:480–491. https://doi.org/10.1016/j.eswa.2017.09.029
Mahajan HB, Badarla A (2021) Cross-layer protocol for WSN-assisted IoT smart farming applications using nature inspired algorithm. Wirel Pers Commun 121:3125–3149. https://doi.org/10.1007/s11277-021-08866-6
Mahajan HB, Badarla A, Junnarkar AA (2021) CL-IoT: cross-layer internet of things protocol for intelligent manufacturing of smart farming. J Ambient Intell Humaniz Comput 12:7777–7791. https://doi.org/10.1007/s12652-020-02502-0
Mahajan, HB, Rashid, AS, Junnarkar, AA et al. (2022) Integration of Healthcare 4.0 and blockchain into secure cloud-based electronic health records systems. Appl Nanosci https://doi.org/10.1007/s13204-021-02164-0.
Mehrsan JR, Levine M (2013) An on-line, real-time learning method for detecting anomalies in videos using spatio-temporal compositions. Comput Vis Image Underst 117:1436–1452. https://doi.org/10.1016/j.cviu.2013.06.007
Mikhail, A, Kamil, IA, Mahajan, H (2017) Increasing SCADA System Availability by Fault Tolerance Techniques. 2017 International conference on computing, communication, Control Autom (ICCUBEA) https://doi.org/10.1109/iccubea.2017.8463911.
Mikhail, A, Kareem, HH, Mahajan, H (2017) Fault Tolerance to Balance for Messaging Layers in Communication Society. 2017 International conference on computing, communication, Control and Automation (ICCUBEA) https://doi.org/10.1109/iccubea.2017.8463871.
Mo X, Monga V, Bala R, Fan Z (2014) Adaptive Sparse Representations for Video Anomaly Detection. Circuits Syst Vid Technol IEEE Trans 24:631–645. https://doi.org/10.1109/TCSVT.2013.2280061
Mohammadi S, Perina A, Kiani H, Murino V (2016) Angry crowds: detecting violent events in videos. Lect Notes Comput Sci:3–18. https://doi.org/10.1007/978-3-319-46478-7_1
Narasimhan MG (2018) S., S.K. dynamic video anomaly detection and localization using sparse denoising autoencoders. Multimed Tools Appl 77:13173–13195. https://doi.org/10.1007/s11042-017-4940-2
Nasaruddin N, Muchtar K, Afdhal A, Dwiyantoro APJ (2020) Deep anomaly detection through visual attention in surveillance videos. J Big Data 7(1). https://doi.org/10.1186/s40537-020-00365-y
Nawaratne, R, Alahakoon, D, Silva, D, Yu, X (2019) Spatiotemporal anomaly detection using deep learning for real-time video surveillance. IEEE Transactions on Industrial Informatics. PP. 1–1. https://doi.org/10.1109/TII.2019.2938527.
Ngo, D, Toan, D, Nguyen, L (2016) Anomaly detection in video surveillance: A novel approach based on sub-trajectory. 1–4. https://doi.org/10.1109/ELINFOCOM.2016.7562953.
Patino L, Ferryman J (2014) Multiresolution semantic activity characterisation and abnormality discovery in videos. Appl Soft Comput 25:485–495. https://doi.org/10.1016/j.asoc.2014.08.039
Popoola O, Wang K (2012) Video-Based Abnormal Human Behavior Recognition—A Review. IEEE Trans Syst Man Cybern Part C (Appl Rev) 42:865–878. https://doi.org/10.1109/TSMCC.2011.2178594
Ramchandran A, Sangaiah AK (2020) Unsupervised deep learning system for local anomaly event detection in crowded scenes. Multimed Tools Appl 79:35275–35295. https://doi.org/10.1007/s11042-019-7702-5
Shirazi S, Mohammad, Morris, B. (2016) Looking at intersections: a survey of intersection monitoring, Behavior and Safety Analysis of Recent Studies. IEEE Transactions on Intelligent Transportation Systems. PP. 1–21. https://doi.org/10.1109/TITS.2016.2568920.
Sivaraman S, Trivedi M (2013) Looking at Vehicles on the Road: A Survey of Vision-Based Vehicle Detection, Tracking, and Behavior Analysis. Intell Transp Syst, IEEE Trans 14:1773–1795. https://doi.org/10.1109/TITS.2013.2266661
Sodemann, A, Ross, M, Borghetti, B. (2011) A Review of Anomaly Detection in Automated Surveillance. IEEE transactions on systems, man, and cybernetics, part C. 42. https://doi.org/10.1109/TSMCC.2012.2215319.
Srinivasan A, Gnanavel VK (2019) Multiple feature set with feature selection for anomaly search in videos using hybrid classification. Multimed Tools Appl 78:7713–7725. https://doi.org/10.1007/s11042-018-6348-z
Sultani, W, Choi, JY (2010) Abnormal traffic detection using intelligent driver model. 2010 20th international conference on pattern recognition. https://doi.org/10.1109/icpr.2010.88.
Sultani, W, Chen, C, Shah, M (2018) Real-world anomaly detection in surveillance videos. 2018 IEEE/CVF conference on computer vision and pattern recognition. https://doi.org/10.1109/cvpr.2018.00678.
Wang T, Snoussi H (2014) Detection of Abnormal Visual Events via Global Optical Flow Orientation Histogram. Inf Forensic Secur IEEE Trans 9:988–998. https://doi.org/10.1109/TIFS.2014.2315971
Wang, J, Wang, M, Liu, Q, Yin G, Zhang Y (2020) Deep anomaly detection in expressway based on edge computing and deep learning. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-020-02574-y
Wang W, Chang F, Liu C (2022) Mutuality-oriented reconstruction and prediction hybrid network for video anomaly detection. SIViP 16:1747–1754. https://doi.org/10.1007/s11760-021-02131-w
Wu, C, Shao, S, Tunc, C, Satam, P, Hariri, S (2021) An explainable and efficient deep learning framework for video anomaly detection. Cluster computing, 1–23. Advance online publication. https://doi.org/10.1007/s10586-021-03439-5
Xue Z, Wu W (2020) Anomaly detection by exploiting the tracking trajectory in surveillance videos. SCIENCE CHINA Inf Sci 63:154101. https://doi.org/10.1007/s11432-018-9792-8
Yang B, Cao J, Ni R, Zou L (2018) Anomaly detection in moving crowds through spatiotemporal autoencoding and additional attention. Adv Multimed 2018:1–8. https://doi.org/10.1155/2018/2087574
Yoa, S, Lee, S, Kim, C, Kim, H (2021) Self-Supervised Learning for Anomaly Detection With Dynamic Local Augmentation. IEEE Access. PP. 1–1. https://doi.org/10.1109/ACCESS.2021.3124525.
Yu B, Liu Y, Sun Q (2017) A content-adaptively sparse reconstruction method for abnormal events detection with low-rank property. IEEE Trans Syst Man Cybern Syst 47(4):704–716
Yun, K, Jeong, H, Yi, K, Kim, S, Choi, J (2014) Motion Interaction Field for Accident Detection in Traffic Surveillance Video. Proceed Int Conf Pattern Recognit 3062–3067. https://doi.org/10.1109/ICPR.2014.528.
Zhang, Y, Lu, H, Zhang, L, Ruan, X (2015) Combining motion and appearance cues for anomaly detection. Pattern Recogn 51. https://doi.org/10.1016/j.patcog.2015.09.005
Zhang, Q, Feng, G, Wu, H (2022) Surveillance video anomaly detection via non-local U-net frame prediction. Multimed Tools Appl. https://doi.org/10.1007/s11042-021-11550-3
Funding
No Funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All authors declares that they has no conflict of interest.
Ethical approval
This article does not contain any studies with human participants performed by any of the authors.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kotkar, V.A., Sucharita, V. Fast anomaly detection in video surveillance system using robust spatiotemporal and deep learning methods. Multimed Tools Appl 82, 34259–34286 (2023). https://doi.org/10.1007/s11042-023-14840-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14840-0