Abstract
This paper presents a method of detecting abnormal activity in crowd videos while considering the direction of the dominant crowd motion. One main goal of our approach is to be able to run at the edge of the surveillance network close to the surveillance cameras so as to reduce network congestion and decision latency. To capture motion features while considering the direction of dominant crowd direction we propose a generalised shear transform based spatio-temporal region. To detect abnormal activity, an autoencoder based method is adopted considering the requirement for running the method at the network edge. During training, the autoencoder learns motion features for each spatio-temporal region from video frames containing normal activity. While testing, those motion features from each spatio-temporal region that cannot be reconstructed satisfactorily by the autoencoder indicate abnormal activity. This approach allows coarse localisation as well as detection of abnormal activity. The approach demonstrated \(\mathcal {O}(n)\) behaviour with ability to work at higher frame rates by trading off accuracy. The approach has been verified against recent works on standard abnormal activity datasets: UCSD dataset and Subway dataset.
Similar content being viewed by others
References
Adam A, Rivlin E, Shimshoni I, Reinitz D (2008) Robust real-time unusual event detection using multiple fixed-location monitors. IEEE Trans Pattern Anal Mach Intell 30(3):555–560
Afiq AA, et al. (2019) A review on classifying abnormal behavior in crowd scene. J Vis Commun Image Represent 58:285–303
Biswas S, Babu RV (2017) Anomaly detection via short local trajectories. Neurocomputing 242: 63–72
Bouguet JY (2000) Pyramidal implementation of the Lucas Kanade feature tracker description of the algorithm. OpenCV Document, Intel, Microprocessor Research Labs
Bouindour S, Hu R, Snoussi H (2019) Enhanced convolutional neural network for abnormal event detection in video streams. IEEE Int Conf Artif Intell Knowl Eng (AIKE) 172–178
Chalapathy R, Chawla S (2019) Deep learning for anomaly detection: a survey. arXiv:1901.03407
Chaudhry R, Ravichandran A, Hager G, Vidal R (2009) Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions. IEEE Conf Comput Vis Pattern Recognit 1932–1939
Chen N, Chen Y, Blasch E, Ling H, You Y, Ye X (2017) Enabling smart urban surveillance at the edge. IEEE Int Conf Smart Cloud 109–119. https://doi.org/10.1109/SmartCloud.2017.24
Chong YS, Tay YH (2017) Abnormal event detection in videos using spatiotemporal Autoencoder. Int Symp Neural Netw. 189–196
Cicirelli F, et al. (2018) Edge computing and social internet of things for Large-Scale smart environments development. IEEE Internet Things J 5(4):2557–2571. https://doi.org/10.1109/JIOT.2017.2775739
Cisco Annual Internet Report, 2018–2023 - Whitepaper (2020)
Cisco Visual Networking Index: Complete Forecast Update, 2017–2022 - White Paper (2018)
Colque RVHM, Junior CAC, Schwartz WR (2015) Histograms of optical flow orientation and magnitude to detect anomalous events in videos. SIBGRAPI Conf Graph Patterns Images 126–133
Colque RVHM, Caetano C, de Andrade MTL, Schwartz WR (2017) Histograms of optical flow orientation and magnitude and entropy to detect anomalous events in videos. IEEE Trans Circ Sys Video Tech 27(3):673–682
Colque RM, et al. (2018) Novel anomalous event detection based on human-object interactions. Int Conf Comput Vis Theory Appl 293–300
Fan Y, Wen G, Li D, Qiu S, Levine MD (2018) Video anomaly detection and localization via gaussian mixture fully convolutional variational autoencoder. arXiv:1805.11223
George M, Jose BR, Mathew J, Kokare P (2019) Autoencoder-based abnormal activity detection using parallelepiped spatiotemporal region. IET Comput Vis 13(1):23–30. https://doi.org/10.1049/iet-cvi.2018.5240
Hasan M, Choi J, Neumann J, Roy-Chowdhury AK, Davis L (2016) Learning temporal regularity in video sequences. IEEE Conf Comput Vis Pattern Recognit 733–742. https://doi.org/10.1109/CVPR.2016.86
He Y, Zhao J (2019) Temporal convolutional networks for anomaly detection in time series. J Phys Conf Ser 1213
Howard AG, et al. (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861
Hu X, Huang Y, Gao X, Luo L, Duan Q (2019) Squirrel-cage local binary pattern and its application in video anomaly detection. IEEE Trans Inf Forensics Secur 14(4):1007–1022
Khan MUK, Park H, Kyung C (2019) Rejecting motion outliers for efficient crowd anomaly detection. IEEE Trans Inf Forensics Secur 14(2):541–556. https://doi.org/10.1109/TIFS.2018.2856189
Klaser A, Marszalek M, Schmid C (2008) A spatio-temporal descriptor based on 3D-gradients. Br Mach Vis Conf 275:1–10
Lea C, Flynn M, Vidal R, Reiter A, Hager G (2017) Temporal convolutional networks for action segmentation and detection. IEEE Conf Comput Vis Pattern Recognit (CVPR) 1003–1012. https://doi.org/10.1109/CVPR.2017.113
Lei Z, Deng F, Yang X (2019) Spatial temporal balanced generative adversarial autoencoder for anomaly detection. Int Conf Image Video Signal Process 1–7
Leyva R, Sanchez V, Li CT (2017) Video anomaly detection with compact feature sets for online performance. IEEE Trans Image Proc 26(7):3463–3478
Li W, Mahadevan V, Vasconcelos N (2014) Anomaly detection and localization in crowded scenes. IEEE Trans Pattern Anal Mach Intell 36(1):18–32
Liu P, Tao Y, Zhao W, Tang X (2017) Abnormal crowd motion detection using double sparse representation. Neurocomputing 269:3–12
Lloyd K, Marshall D, Moore SC, Rosin PL (2017) Detecting violent and abnormal crowd activity using temporal analysis of grey level co-occurrence matrix (GLCM) based texture measures. Mach Vis Appl 28:361–371
Lu C, Shi J, Wang W, Jia J (2018) Fast abnormal event detection. Int J Comput Vis 1–18
Mabrouk AB, Zagrouba E (2018) Abnormal behavior recognition for intelligent video surveillance systems: a review. Expert Syst Appl 91:480–491
Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. Proc IEEE Conf Comput Vis Pattern Recognit (CVPR) 1975–1981
Miraftabzadeh SA, Rad P, Choo KR, Jamshidi M (2018) A privacy-aware architecture at the edge for autonomous real-time identity reidentification in crowds. IEEE Internet Things J 5(4):2936–2946. https://doi.org/10.1109/JIOT.2017.2761801
Nikouei SY, Chen Y, Song S, Xu R, Choi BY, Faughnan T (2018) Real-time human detection as an edge service enabled by a lightweight CNN. IEEE Int Conf Edge Comput 125–129
Nikouei SY, Chen Y, Song S, Xu R, Choi BY, Faughnan T (2018) Intelligent surveillance as an edge network service: from harr-cascade, SVM to a lightweight CNN. arXiv:1805.00331
Quigley PA, et al. (2019) Outcomes of patient-engaged video surveillance on falls and other adverse events. Clin Geriatr Med 35(2):253–263
Rabiee H, Mousavi H, Nabi M, Ravanbakhsh M (2017) Detection and localization of crowd behavior using a novel tracklet-based model. Int J Mach Learn Cybern 1–12
Ravanbakhsh M, Nabi M, Mousavi H, Sangineto E, Sebe N (2018) Plug-and-play CNN for crowd motion analysis: an application in abnormal event detection. IEEE Winter Conf Appl Comput Vis (WACV) 1689–1698
Rumelhart D, Hinton G, Williams R (1986) Learning representations by back-propagating errors. Nature 323:533–536. https://doi.org/10.1038/323533a0
Shi W, Cao J, Zhang Q, Li Y, Xu L (2016) Edge computing: vision and challenges. IEEE Internet Things J 3(5):637–646
Shi X, Chen Z, Wang H, Yeung DY, Wong W, Woo W (2015) Convolutional LSTM network: a machine learning approach for precipitation nowcasting. Int Conf Neural Inf Process Syst 802–810
Sun J, Wang X, Xiong N, Shao J (2018) Learning sparse representation with variational Auto-Encoder for anomaly detection. IEEE Access 6:33353–33361. https://doi.org/10.1109/ACCESS.2018.2848210
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. Int Conf Neural Inf Process Syst 3104–3112
Szegedy C, et al. (2015) Going deeper with convolutions. IEEE Conf Comput Vis Pattern Recognit (CVPR) 1–9
Tsakanikas V, Dagiuklas T (2017) Video surveillance systems-current status and future trends. Comput Electr Eng. https://doi.org/10.1016/j.compeleceng.2017.11.011
Usman M, Jan MA, He X, Chen J (2019) A survey on big multimedia data processing and management in smart cities. ACM Comput Surv 52 (3):54:1–54:29
Wang J, Xu Z (2016) Spatio-temporal texture modelling for real-time crowd anomaly detection. Comput Vis Image Underst 144:177–187
Wang J, Cherian A, Porikli F (2017) Ordered pooling of optical flow sequences for action recognition. IEEE Winter Conf Appl Comput Vis (WACV) 168–176. https://doi.org/10.1109/WACV.2017.26
Wang L, Zhou F, Li Z, Zuo W, Tan H (2018) Abnormal event detection in videos using hybrid spatio-temporal autoencoder. IEEE Int Conf Image Process 2276–2280
Werbos PJ (1990) Backpropagation through time: what it does and how to do it. Proc IEEE 78(10):1550–1560
Xu R, et al. (2018) Real-time human objects tracking for smart surveillance at the edge. IEEE Int Conf Commun 1–6. https://doi.org/10.1109/ICC.2018.8422970
Yang B, Cao J, Ni R, Zou L (2018) Anomaly detection in moving crowds through spatiotemporal autoencoding and additional attention. Adv Multimed 2018:1–8
Yuan Y, Feng Y, Lu X (2017) Statistical hypothesis detector for abnormal event detection in crowded scenes. IEEE Trans Cybern 47(11):3597–3608
Yuan Y, Feng Y, Lu X (2018) Structured dictionary learning for abnormal event detection in crowded scenes. Pattern Recognit 73:99–110
Zhang T, et al. (2015) The design and implementation of a wireless video surveillance system. Annu Int Conf Mob Comput Netw 426–438
Zhou F, Wang L, Li Z, Zuo W, Tan H (2019) Unsupervised learning approach for abnormal event detection in surveillance video by hybrid autoencoder. Neural Process Lett. https://doi.org/10.1007/s11063-019-10113-w
Zitouni MS, Sluzek A, Bhaskar H (2019) Visual analysis of socio-cognitive crowd behaviors for surveillance: a survey and categorization of trends and methods. Eng Appl Artif Intell 82:294–312
Acknowledgments
This work was supported by Technical Education Quality Improvement Programme (TEQIP) Research Seed Money Project (No. TEQIP/PTRA/2017); APJ Abdul Kalam Technological University - Center for Engineering Research & Development (APJAKTU-CERD) Research Seed Money Project (No. KTU/RESEARCH 2/4068/2019).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
George, M., Jose, B.R. & Mathew, J. Abnormal activity detection using shear transformed spatio-temporal regions at the surveillance network edge. Multimed Tools Appl 79, 27511–27532 (2020). https://doi.org/10.1007/s11042-020-09277-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09277-8