Skip to main content

Appearance-Motion Fusion Network for Video Anomaly Detection

  • Conference paper
  • First Online:
Pattern Recognition and Computer Vision (PRCV 2021)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 13019))

Included in the following conference series:

  • 2410 Accesses

Abstract

Detection of abnormal events in surveillance video is an important and challenging task, which has received much research interest over the past few years. However, existing methods often only considered appearance information or simply integrated appearance and motion information without considering their underlying relationship. In this paper, we propose an unsupervised anomaly detection approach based on deep auto-encoder, which can effectively exploit the complementarity of both appearance and motion information. Two encoders are used to extract appearance features and motion features from RGB and RGB difference frames, respectively, and then a feature fusion module is employed to fuse appearance and motion features to produce discriminative feature representations of regular events. Finally, the fused features are sent to their corresponding decoders to predict future RGB and RGB differential frames for determining anomaly events according to reconstruction errors. Experiments and ablation studies on some public datasets demonstrate the effectiveness of our approach.

Supported by the Natural Science Foundation of China under grant 61772032.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chang, Y., Tu, Z., Xie, W., Yuan, J.: Clustering driven deep autoencoder for video anomaly detection. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12360, pp. 329–345. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58555-6_20

    Chapter  Google Scholar 

  2. Dan, X.A., Yan, Y.D., Erb, C., Ns, A : Detecting anomalous events in videos by learning deep representations of appearance and motion. In: Computer Vision and Image Understanding, pp. 117–127. ScienceDirect (2017)

    Google Scholar 

  3. Gong, D., et al.: Memorizing normality to detect anomaly: memory-augmented deep autoencoder for unsupervised anomaly detection. In: 2019 IEEE International Conference on Computer Vision (ICCV) (2019)

    Google Scholar 

  4. Hasan, M., Choi, J., Neumann, J., Roy-Chowdhury, A.K., Davis, L.S.: Learning temporal regularity in video sequences. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)

    Google Scholar 

  5. Jie, H., Li, S., Gang, S.: Squeeze-and-excitation networks. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

    Google Scholar 

  6. Joze, H.R.V., Shaban, A., Iuzzolino, M.L., Koishida, K.: MMTM: multimodal transfer module for CNN fusion. In: 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)

    Google Scholar 

  7. Li, W., Mahadevan, V., Vasconcelos, N.: Anomaly detection and localization in crowded scenes. IEEE Trans. Pattern Anal. Mach. Intell. 36, 18–32 (2013)

    Google Scholar 

  8. Liu, W., Luo, W., Lian, D., Gao, S.: Future frame prediction for anomaly detection - a new baseline. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)

    Google Scholar 

  9. Lu, C., Shi, J., Jia, J.: Abnormal event detection at 150 FPS in MATLAB. In: 2014 IEEE International Conference on Computer Vision (ICCV) (2014)

    Google Scholar 

  10. Luo, W., Wen, L., Gao, S.: A revisit of sparse coding based anomaly detection in stacked RNN framework. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)

    Google Scholar 

  11. Luo, W., Wen, L., Gao, S. Remembering history with convolutional LSTM for anomaly detection. In: 2017 IEEE International Conference on Multimedia and Expo (ICME) (2017)

    Google Scholar 

  12. Park, H., Noh, J., Ham, B.: Learning memory-guided normality for anomaly detection. In: 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)

    Google Scholar 

  13. Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28

    Chapter  Google Scholar 

  14. Sabokrou, M., Fathy, M., Hoseini, M.: Video anomaly detection and localisation based on the sparsity and reconstruction error of auto-encoder. Electron. Lett. 52, 1122–1124 (2016)

    Article  Google Scholar 

  15. Song, Q.: Deep autoencoding gaussian mixture model for unsupervised anomaly detection. In: 2018 International Conference on Learning Representations (ICLR) (2018)

    Google Scholar 

  16. Yan, S., Smith, J.S., Lu, W., Zhang, B.: Abnormal event detection from videos using a two-stream recurrent variational autoencoder. IEEE Trans. Cogn. Dev. Syst. 12, 30–42 (2020)

    Article  Google Scholar 

  17. Yao, T., Lin, Z., Szab, C., Chen, G., Gla, B., Jian, Y.: Integrating prediction and reconstruction for anomaly detection. Pattern Recogn. Lett. 129, 123–130 (2020)

    Article  Google Scholar 

  18. Paffenroth, R.C., Chong, Z.: Anomaly detection with robust deep autoencoders. In: The 23rd ACM SIGKDD International Conference (2017)

    Google Scholar 

  19. Zhao, Y. Deng, B. Shen, C., Liu, Y., Lu, H., Hua, X.: Spatio-temporal autoencoder for video anomaly detection. In: 2017 ACM International Conference on Multimedia (ACM MM) (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, S., Xu, S., Tang, J. (2021). Appearance-Motion Fusion Network for Video Anomaly Detection. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13019. Springer, Cham. https://doi.org/10.1007/978-3-030-88004-0_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-88004-0_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-88003-3

  • Online ISBN: 978-3-030-88004-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics