skip to main content
10.1145/3501409.3501698acmotherconferencesArticle/Chapter ViewAbstractPublication PageseitceConference Proceedingsconference-collections
research-article

3D U-Net for Video Anomaly Detection

Authors Info & Claims
Published:31 December 2021Publication History

ABSTRACT

With the widespread of surveillance video application, anomaly detection in surveillance video is significant for safety maintenance. In this paper, we propose a new abnormal detection framework, which contains an improved future frame prediction model based on GAN network to locate the abnormal events in the video. Unlike the previous U-Net network which is usually used as prediction model in abnormal event detection, 3D U-Net is adopted in our proposed network for predicting future frames which can fuse the spatial and temporal features simultaneously by using 3-dimensional convolution. For normal events, the improved prediction network takes into account the temporal features, resulting in low abnormal scores, while abnormal events have high abnormal scores. Experiments show that this method can get good results on the Avenue, UCSD Ped2 and ShanghaiTech datasets, and the AUC values on these datasets could reach 86.0%, 96.3% and 73.6% respectively.

References

  1. Ji S, Xu W, Yang M, et al. 3D convolutional neural networks for human action recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2013, 35(1): 221--231.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Abati, A. Porrello, S. Calderara, R. Cucchiara, Latent space autoregression for novelty detection, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 481--490.Google ScholarGoogle Scholar
  3. N. Li, F. Chang and C. Liu(2021)Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes. in IEEE Transactions on Multimedia, vol. 23, pp. 203--215.Google ScholarGoogle Scholar
  4. W. Liu, W. Luo, D. Lian and S. Gao, "Future Frame Prediction for Anomaly Detection - A New Baseline," in Proc. IEEE Conf. Comput. Vision Pattern Recognit., 2018, pp. 6536--6545.Google ScholarGoogle Scholar
  5. M. Ravanbakhsh, M. Nabi, E. Sangineto, L. Marcenaro, C. Regazzoni and N. Sebe, "Abnormal event detection in videos using generative adversarial nets," in Proc IEEE Int. Conf. Image Process., 2017, pp. 1577--158.Google ScholarGoogle Scholar
  6. Dalal N, Triggs B. Histograms of Oriented Gradients for Human Detection[C]//Computer Vision and Pattern Recognition, 2005. IEEE Computer Society, 2005:886--893.Google ScholarGoogle Scholar
  7. Kratz L, Nishino K. Anomaly Detection in Extremely Crowded Scenes Using Spatio-Temporal Motion Pattern Models[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2009: 1446--1453.Google ScholarGoogle Scholar
  8. Zhou X G, Zhang L Q. Abnormal Event Detection Using Recurrent Neural Network[C]//2015 International Conference on Computer Science and Applications (CSA). IEEE, 2015: 222--226.Google ScholarGoogle Scholar
  9. T.-N. Nguyen, J. Meunier, Anomaly detection in video sequence with appearance-motion correspondence, in: Proceedings of the IEEE International Conference on Computer Vision, 2019, pp. 1273--1283.Google ScholarGoogle Scholar
  10. J.R. Medel and A. Savakis, Anomaly detection in video using predictive convolutional long short-term memory networks, 2016, [online] Available: http://arxiv.org/abs/1612.00390.Google ScholarGoogle Scholar
  11. H. Song, C. Sun, X. Wu, M. Chen and Y. Jia, "Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos," IEEE Transactions on Multimedia, vol. 22, no. 8, pp. 2138--2148, Aug. 2020.Google ScholarGoogle ScholarCross RefCross Ref
  12. W. Luo, W. Liu, D. Lian, J. Tang, L. Duan, X. Peng, S. Gao, Video anomaly detection with sparse coding inspired deep neural networks, IEEE Transactions on Pattern Analysis and Machine Intelligence (99)(2019) 1--1.Google ScholarGoogle Scholar
  13. Y. Lu, K. M. Kumar, S. s. Nabavi and Y. Wang, "Future Frame Prediction Using Convolutional VRNN for Anomaly Detection," in Proc. IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019, pp. 1--8.Google ScholarGoogle Scholar
  14. J. T. Zhou, L. Zhang, Z. Fang, J. Du, X. Peng and Y. Xiao (2020) Attention-Driven Loss for Anomaly Detection in Video Surveillance. IEEE Transactions on Circuits and Systems for Video Technology, vol.30, no. 12, pp. 4639--4647.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Gong et al., "Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection," in Proc. IEEE Int. Conf. Comput. Vision, 2019, pp. 1705--1714.Google ScholarGoogle Scholar
  16. W. Luo, W. Liu and S. Gao, "Remembering history with convolutional LSTM for anomaly detection," in Proc. IEEE International Conference on Multimedia and Expo, 2017, pp. 439--444.Google ScholarGoogle Scholar
  17. Yiheng Cai, Jiaqi Liu, Yajun Guo, Shaobin Hu, Shinan Lang, "Video anomaly detection with multi-scale feature and temporal information fusion," Neuro computing, vol. 432, pp. 264--273, 2021.Google ScholarGoogle Scholar
  18. Sklearn documentation, http://scikit-learn.org/stable/Google ScholarGoogle Scholar

Index Terms

  1. 3D U-Net for Video Anomaly Detection

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      EITCE '21: Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering
      October 2021
      1723 pages
      ISBN:9781450384322
      DOI:10.1145/3501409

      Copyright © 2021 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 31 December 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      EITCE '21 Paper Acceptance Rate294of531submissions,55%Overall Acceptance Rate508of972submissions,52%
    • Article Metrics

      • Downloads (Last 12 months)24
      • Downloads (Last 6 weeks)6

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader