Abstract
Anomaly detection in surveillance videos is a challenging and demanding task. Autoencoders trained on segments of normal events are expected to give high reconstruction error for abnormal events than that for normal events. However, the assumption of autoencoders giving high reconstruction error is not always true in practice. Since the autoencoder sometimes offers better generalization, it also reconstructs abnormal events well, leading to slightly degraded performance for anomaly detection. Another issue is that the performance of real-time anomalous activity detection in surveillance videos still needs improvement. To address these issues, we propose an Object-centric and Memory-guided residual spatiotemporal autoencoder (OM-RSTAE) to detect video anomalies. The proposed technique achieved improved results over benchmark datasets, namely UCSD-Ped2, Avenue, ShanghaiTech and UCF-Crime datasets.
Similar content being viewed by others
References
Deepak, K., Chandrakala, S., Mohan, C.K.: Residual spatiotemporal autoencoder for unsupervised video anomaly detection. Signal, Image Video Process 15, 215–222 (2021)
Del Giorno, A., Bagnell, J. A., Hebert, M.: A discriminative framework for anomaly detection in large videos. In: European Conference on Computer Vision. Springer, pp 334–349 (2016)
Dong, F., Zhang, Y., Nie, X.: Dual discriminator generative adversarial network for video anomaly detection. IEEE Access 8, 88170–88176 (2020)
Gong, D., Liu, L., Le, V., et al.: Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1705–1714 (2019)
Hasan, M., Choi, J., Neumann, J., et al.: Learning temporal regularity in video sequences. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 733–742 (2016)
Ionescu, R.T., Smeureanu, S., Popescu, M., et al.: Detecting abnormal events in video using narrowed normality clusters. In: 2019 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, pp 1951–1960 (2019)
Li, N., Chang, F., Liu, C.: Spatial-temporal cascade autoencoder for video anomaly detection in crowded scenes. IEEE Trans Multimedia 23, 203–215 (2021)
Liu, W., Luo, W., Lian, D., et al.: Future frame prediction for anomaly detection—a new baseline. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 6536–6545 (2018)
Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 fps in matlab. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2720–2727 (2013)
Luo, W., Liu, W., Gao, S.: A revisit of sparse coding based anomaly detection in stacked RNN framework. In: Proceedings of the IEEE International Conference on Computer Vision, pp 341–349 (2017)
Mahadevan, V., Li, W., Bhalodia, V., et al.: Anomaly detection in crowded scenes. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, pp 1975–1981 (2010)
Mehran, R., Oyama, A., Shah, M.: Abnormal crowd behavior detection using social force model. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp 935–942 (2009)
Pkulzc, R.V., Neal, W.: Tensorflow detection model zoo. GitHub [Online] (2019). https://github.com/tensor-flow/models/blob/master/research/object.detection/g3doc/detection.model.zoo.md
Ravanbakhsh, M., Nabi, M., Sangineto, E., et al.: Abnormal event detection in videos using generative adversarial nets. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1577–1581 (2017)
Shi, Y., Tian, Y., Wang, Y., et al.: Sequential deep trajectory descriptor for action recognition with three-stream CNN. IEEE Trans. Multimedia 19(7), 1510–1520 (2017)
Sultani, W., Chen, C., Shah, M.: Real-world anomaly detection in surveillance videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 6479–6488 (2018)
Tran, H.T., Hogg, D.: Anomaly detection using a convolutional winner-take-all autoencoder. In: Proceedings of the British Machine Vision Conference 2017. British Machine Vision Association (2017)
Tudor Ionescu, R., Smeureanu, S., Alexe, B., et al.: Unmasking the abnormal events in video. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2895–2903 (2017)
Zhao, Y., Deng, B., Shen, C., et al.: Spatio-temporal autoencoder for video anomaly detection. In: Proceedings of the 25th ACM International Conference on Multimedia, pp 1933–1941 (2017)
Zhao, Y., Deng, B., Shen, C., et al.: Spatio-temporal autoencoder for video anomaly detection. In: ACM Multimedia (2017)
Acknowledgements
This work was supported by No. DST/CSRI/2017/131(G) under Cognitive Science Research Initiative (CSRI), Department of Science and Technology, Government of India.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chandrakala, S., Shalmiya, P., Srinivas, V. et al. Object-centric and memory-guided network-based normality modeling for video anomaly detection. SIViP 16, 2001–2007 (2022). https://doi.org/10.1007/s11760-022-02161-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02161-y