Abstract
Many available object detectors are already used in fire detection, such as Faster RCNN, SSD, YOLO, etc., to localize the fire in images. Although these approaches perform well, they require object-level annotations for training, which are manually labeled and very expensive. In this paper, we propose a method based on the Class Activation Map (CAM) and non-local attention to explore the Weakly Supervised Fire Detection (WSFD) given only image-level annotations. Specifically, we first train a deep neural network with non-local attention as the classifier for identifying fire and non-fire images. Then, we use the classifier to create a CAM for every fire image in the inference stage and finally generate a corresponding bounding box according to each connected domain of the CAM. To evaluate the availability of our method, a benchmark dataset named WS-FireNet is constructed, and comprehensive experiments are performed on the WS-FireNet dataset. The experimental results demonstrate that our approach is effective in image-level supervised fire detection.
Similar content being viewed by others
References
Liau H, Yamini N, Wong Y (2018) Fire ssd: Wide fire modules based single shot detector on edge device. arXiv preprint arXiv:1806.05363
Kang L.-W, Wang I.-S, Chou K.-L, Chen S.-Y, Chang C.-Y (2019) Image-based real-time fire detection using deep learning with data augmentation for vision-based surveillance applications. In: 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 1–4 . IEEE
Barmpoutis P, Dimitropoulos K, Kaza K, Grammalidis N (2019) Fire detection from images using faster r-cnn and multidimensional texture analysis. In: ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 8301–8305, IEEE
Li P, Zhao W (2020) Image fire detection algorithms based on convolutional neural networks. Case Stud Thermal Eng 19:100625
Goyal S, Shagill M, Kaur A, Vohra H, Singh A (2020) A yolo based technique for early forest fire detection. Int J Innov Technol Explor Eng (IJITEE) 9:1357–1362
Qin Y-Y, Cao J-T, Ji X-F (2021) Fire detection method based on depthwise separable convolution and yolov3. Int J Autom Comput 18(2):300–310
Krizhevsky A, Sutskever I, Hinton G.E (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Su Y, Lin G, Zhu J, Wu, Q (2020) Human interaction learning on 3d skeleton point clouds for video violence recognition. In: European conference on computer vision, Springer, pp 74–90
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T0, Dehghani M, Minderer M, Heigold G, Gelly S, et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
Smahi MI, Hadjila F, Tibermacine C, Benamar A (2021) A deep learning approach for collaborative prediction of web service Qos. SOCA 15(1):5–20
Su Y, Sun R, Lin G, Wu Q (2021) Context decoupling augmentation for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 7004–7014
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Su Y, Lin G, Wu Q (2021) Self-supervised 3d skeleton action representation learning with motion consistency and continuity. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 13328–13338
Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28:91–99
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, Springer, pp 21–37
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767
Bochkovskiy A, Wang C-Y, Liao H-YM (2020) Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vision 88(2):303–338
Bilen H, Vedaldi A (2016) Weakly supervised deep detection networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2846–2854
Tang P, Wang X, Bai X, Liu W (2017) Multiple instance detection network with online instance classifier refinement. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2843–2851
Tang P, Wang X, Bai S, Shen W, Bai X, Liu W, Yuille A (2018) Pcl: Proposal cluster learning for weakly supervised object detection. IEEE Trans Pattern Anal Mach Intell 42(1):176–191
Wei Y, Shen Z, Cheng B, Shi H, Xiong J, Feng J, Huang T (2018) Ts2c: Tight box mining with surrounding segmentation context for weakly supervised object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 434–450
Zeng Z, Liu B, Fu J, Chao H, Zhang L (2019) Wsod2: Learning bottom-up and top-down objectness distillation for weakly-supervised object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 8292–8300
Li X, Kan M, Shan S, Chen X (2019) Weakly supervised object detection with segmentation collaboration. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9735–9744
Ren Z, Yu Z, Yang X, Liu M-Y, Lee YJ, Schwing AG, Kautz J (2020) Instance-aware, context-focused, and memory-efficient weakly supervised object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10598–10607
Huang Z, Zou Y, Kumar B, Huang D (2020) Comprehensive attention self-distillation for weakly-supervised object detection. Adv Neural Inf Process Syst 33:16797–16807
Uijlings JR, Van De Sande KE, Gevers T, Smeulders AW (2013) Selective search for object recognition. Int J Comput Vision 104(2):154–171
Zitnick CL, Dollár P (2014) Edge boxes: locating object proposals from edges. In: European conference on computer vision, Springer, pp 391–405
Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba, A (2016) Learning deep features for discriminative localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2921–2929
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Lai L, Chen J, Huang H, Wu Q (2021) Exploring a cam-based approach for weakly supervised fire detection task
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Namozov A, Im Cho Y (2018) An efficient deep learning algorithm for fire and smoke detection with limited data. Adv Electr Comput Eng 18(4):121–128
Sadewa RP, Irawan B, Setianingsih C (2019) Fire detection using image processing techniques with convolutional neural networks. In: 2019 international seminar on research of information technology and intelligent systems (ISRITI), IEEE, pp 290–295
Jadon A, Omama M, Varshney A, Ansari MS, Sharma R (2019) Firenet: a specialized lightweight fire & smoke detection model for real-time iot applications. arXiv preprint arXiv:1905.11922
Valikhujaev Y, Abdusalomov A, Cho YI (2020) Automatic fire and smoke detection method for surveillance systems based on dilated cnns. Atmosphere 11(11):1241
Huang R, Pedoeem J, Chen C (2018) Yolo-lite: a real-time object detection algorithm optimized for non-gpu computers. In: 2018 IEEE international conference on big data (Big Data), IEEE, pp 2503–2510
Dai J, Li Y, He K, Sun J (2016) R-fcn: object detection via region-based fully convolutional networks. Adv Neural Inf Process Syst 29
Celik T (2010) Fast and efficient method for fire detection using image processing. ETRI J 32(6):881–890
Harkat H, Nascimento J, Bernardino A (2020) Fire segmentation using a deeplabv3+ architecture. In: Image and signal processing for remote sensing XXVI, vol 11533. International Society for Optics and Photonics, p 115330
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Wang Q, Wu B, Zhu P, Li P, Hu Q (2020) Eca-net: Efficient channel attention for deep convolutional neural networks. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Wang F, Jiang M, Qian C, Yang S, Li C, Zhang H, Wang X, Tang X (2017) Residual attention network for image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3156–3164
Hu, J., Shen, L., Albanie, S., Sun, G., Vedaldi, A (2018) Gather-excite: Exploiting feature context in convolutional neural networks. Adv Neural Inf Process Syst 31:9401–9411
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Park J, Woo S, Lee J-Y, Kweon I-S (2018) Bam: Bottleneck attention module. In: British Machine Vision Conference (BMVC). British Machine Vision Association (BMVA)
Roy AG, Navab N, Wachinger C (2018) Concurrent spatial and channel ’squeeze & excitation’in fully convolutional networks. In: International conference on medical image computing and computer-assisted intervention, Springer, pp 421–429
Santoro A, Raposo D, Barrett DG, Malinowski M, Pascanu, R, Battaglia P, Lillicrap T (2017) A simple neural network module for relational reasoning. Adv Neural Inf Process Syst 30:4967–4976
Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: Icml
Fire-Detection-Image-Dataset. https://github.com/cair/Fire-Detection-Image-Dataset
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M (2015) Imagenet large scale visual recognition challenge. Int J Comput Vision 115(3):211–252
Everingham M, Eslami S, Van Gool L, Williams CK, Winn J, Zisserman A (2015) The pascal visual object classes challenge: a retrospective. Int J Comput Vision 111(1):98–136
Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, Springer, pp 740–755
Funding
This work was supported by the National Natural Science Foundation of China (Grant No. 61876208 and 62072186), the Guangdong Basic and Applied Basic Research Foundation (Grant No. 2019B1515130001), the Opening Project of Guangdong Key Laboratory of Big Data Analysis and Processing, and the Opening Project of Ministry of Education Key Laboratory of Big Data and Intelligent Robot (South China University of Technology) (No. 202105).
Author information
Authors and Affiliations
Contributions
Lai and Wang developed the proposed method and drafted the manuscript. Wu and Chen supervised the project, contributed to discussion and analysis, and provided important suggestions for the paper. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no conflict of interest
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wang, W., Lai, L., Chen, J. et al. CAM-based non-local attention network for weakly supervised fire detection. SOCA 16, 133–142 (2022). https://doi.org/10.1007/s11761-022-00336-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11761-022-00336-6