skip to main content
10.1145/3489088.3489126acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesic3inaConference Proceedingsconference-collections
research-article

An Empirical Study of CNN-LSTM on Class Imbalance Datasets for Violence Video Detection

Published:13 February 2022Publication History

ABSTRACT

Violence detection has become an important topic of video surveillance in the last decade. Some studies in violence video detection demonstrated that learned features from Convolution Neural Network (CNN) gives high accuracy compared to handcrafted features. For this reason, we evaluate several CNN architectures to detect violence action in video. This work compares five pretrained networks VGG16, VGG19, ResNet50, Inception V3, and Xception. Then, the extracted features from each frame are forwarded to a long short-term memory (LSTM) network. We evaluate the pretrained networks on class imbalance datasets since violence video detection might suffer from class imbalance. Two public datasets are being used to evaluate the model; hockey fight dataset and violent crowd dataset. Our experiment results show that InceptionV3 achieved better performance in most cases.

References

  1. Mujtaba Asad, Jie Yang, Jiang He, Pourya Shamsolmoali, and Xiangjian He. 2021. Multi-frame feature-fusion-based model for violence detection. The Visual Computer 37(2021), 1415–1431.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Hugo Calderon-Vilca, Kent Cuadros Ramos, Elmer Diaz Quiroz, Jorge Angeles Rojas, René Calderon Vilca, and Alejandro Apaza Tarqui. 2021. The Best Model of Convolutional Neural Networks Combined with LSTM for the Detection of Interpersonal Physical Violence in Videos. In 29th Conference of Open Innovations Association (FRUCT).Google ScholarGoogle Scholar
  3. Qi Fan, Zhe Wang, Dongdong Li, Daqi Gao, and Hongyuan Zha. 2017. Entropy-based fuzzy support vector machine for imbalanced datasets. Knowledge-Based Systems 115 (2017), 87–99.Google ScholarGoogle ScholarCross RefCross Ref
  4. Mariana-Iuliana Georgescu, Radu Tudor Ionescu, and Radu Tudor Ionescu. 2019. Local Learning with Deep and Handcrafted Features for Facial Expression Recognition. IEEE Access 37(2019), 64827–64836.Google ScholarGoogle ScholarCross RefCross Ref
  5. Tal Hassner, Yossi Itcher, and Orit Kliper-Gross. 2012. Violent flows: Real-time detection of violent crowd behavior. In the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).Google ScholarGoogle ScholarCross RefCross Ref
  6. Amira Ben Mabrouk and Ezzeddine Zagrouba. 2018. Abnormal behavior recognition for intelligent video surveillance systems : a review. Expert Systems With Applications 91 (2018), 480–491.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Loris Nanni, Stefano Ghidoni, and Sheryl Brahnam. 2017. Handcrafted vs Non-Handcrafted Features for computer vision classification. Pattern Recognition 71(2017), 158–172.Google ScholarGoogle ScholarCross RefCross Ref
  8. Enrique Bermejo Nievas, Oscar Deniz Suarez, Gloria Bueno García, and Rahul Sukthankar. 2011. Violence Detection in Video Using Computer Vision Techniques. In International Conference on Computer Analysis of Images and Patterns.Google ScholarGoogle Scholar
  9. Francisco A. Pujol, Higinio Mora, and Maria Luisa Pertegal. 2019. A soft computing approach to violence detection in social media for smart cities. Soft Computing 24(2019), 11007–11017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Fath U Min Ullah, Amin Ullah, Khan Muhammad, Ijaz Ul Haq, and Sung Wook Baik. 2019. Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network. sensors 19, 11 (2019), 1–15.Google ScholarGoogle Scholar
  11. Ting Xiao, Lei Liu, Kai Li, Wenjian Qin, Shaode Yu, and Zhicheng Li. 2018. Comparison of Transferred Deep Neural Networks in Ultrasonic Breast Masses Discrimination. BioMed Research International 2018 (2018), 1–9.Google ScholarGoogle Scholar
  12. Tao Zhang, Wenjing Jia, Baoqing Yang, Jie Yang, Xiangjian He, and Zhonglong Zheng. 2017. MoWLD: a robust motion image descriptor for violence detection. Multimed Tools Appl 76(2017), 1419–1438.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Peipei Zhou, Qinghai Ding, Haibo Luo, and Xinglin Hou. 2018. Violence detection in surveillance video using low-level features. PLOS ONE 13(2018), 1–15.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. An Empirical Study of CNN-LSTM on Class Imbalance Datasets for Violence Video Detection
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          IC3INA '21: Proceedings of the 2021 International Conference on Computer, Control, Informatics and Its Applications
          October 2021
          204 pages
          ISBN:9781450385244
          DOI:10.1145/3489088

          Copyright © 2021 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 February 2022

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited
        • Article Metrics

          • Downloads (Last 12 months)13
          • Downloads (Last 6 weeks)3

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format