CAM-based non-local attention network for weakly supervised fire detection

Wang, Wenjun; Lai, Lvlong; Chen, Jian; Wu, Qingyao

doi:10.1007/s11761-022-00336-6

CAM-based non-local attention network for weakly supervised fire detection

Original Research Paper
Published: 10 June 2022

Volume 16, pages 133–142, (2022)
Cite this article

Service Oriented Computing and Applications Aims and scope Submit manuscript

Wenjun Wang^1,2,3^na1,
Lvlong Lai^1,3^na1,
Jian Chen¹ &
…
Qingyao Wu^1,4

388 Accesses
4 Citations
Explore all metrics

Abstract

Many available object detectors are already used in fire detection, such as Faster RCNN, SSD, YOLO, etc., to localize the fire in images. Although these approaches perform well, they require object-level annotations for training, which are manually labeled and very expensive. In this paper, we propose a method based on the Class Activation Map (CAM) and non-local attention to explore the Weakly Supervised Fire Detection (WSFD) given only image-level annotations. Specifically, we first train a deep neural network with non-local attention as the classifier for identifying fire and non-fire images. Then, we use the classifier to create a CAM for every fire image in the inference stage and finally generate a corresponding bounding box according to each connected domain of the CAM. To evaluate the availability of our method, a benchmark dataset named WS-FireNet is constructed, and comprehensive experiments are performed on the WS-FireNet dataset. The experimental results demonstrate that our approach is effective in image-level supervised fire detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An efficient deep neural network with color-weighted loss for fire detection

Article 02 May 2022

Fire Detection Approach Based on Vision Transformer

Fire Detection from Images Based on Single Shot MultiBox Detector

References

Liau H, Yamini N, Wong Y (2018) Fire ssd: Wide fire modules based single shot detector on edge device. arXiv preprint arXiv:1806.05363
Kang L.-W, Wang I.-S, Chou K.-L, Chen S.-Y, Chang C.-Y (2019) Image-based real-time fire detection using deep learning with data augmentation for vision-based surveillance applications. In: 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 1–4 . IEEE
Barmpoutis P, Dimitropoulos K, Kaza K, Grammalidis N (2019) Fire detection from images using faster r-cnn and multidimensional texture analysis. In: ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 8301–8305, IEEE
Li P, Zhao W (2020) Image fire detection algorithms based on convolutional neural networks. Case Stud Thermal Eng 19:100625
Article Google Scholar
Goyal S, Shagill M, Kaur A, Vohra H, Singh A (2020) A yolo based technique for early forest fire detection. Int J Innov Technol Explor Eng (IJITEE) 9:1357–1362
Article Google Scholar
Qin Y-Y, Cao J-T, Ji X-F (2021) Fire detection method based on depthwise separable convolution and yolov3. Int J Autom Comput 18(2):300–310
Article Google Scholar
Krizhevsky A, Sutskever I, Hinton G.E (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Su Y, Lin G, Zhu J, Wu, Q (2020) Human interaction learning on 3d skeleton point clouds for video violence recognition. In: European conference on computer vision, Springer, pp 74–90
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T0, Dehghani M, Minderer M, Heigold G, Gelly S, et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
Smahi MI, Hadjila F, Tibermacine C, Benamar A (2021) A deep learning approach for collaborative prediction of web service Qos. SOCA 15(1):5–20
Article Google Scholar
Su Y, Sun R, Lin G, Wu Q (2021) Context decoupling augmentation for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 7004–7014
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Su Y, Lin G, Wu Q (2021) Self-supervised 3d skeleton action representation learning with motion consistency and continuity. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 13328–13338
Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28:91–99
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, Springer, pp 21–37
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767
Bochkovskiy A, Wang C-Y, Liao H-YM (2020) Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vision 88(2):303–338
Article Google Scholar
Bilen H, Vedaldi A (2016) Weakly supervised deep detection networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2846–2854
Tang P, Wang X, Bai X, Liu W (2017) Multiple instance detection network with online instance classifier refinement. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2843–2851
Tang P, Wang X, Bai S, Shen W, Bai X, Liu W, Yuille A (2018) Pcl: Proposal cluster learning for weakly supervised object detection. IEEE Trans Pattern Anal Mach Intell 42(1):176–191
Article Google Scholar
Wei Y, Shen Z, Cheng B, Shi H, Xiong J, Feng J, Huang T (2018) Ts2c: Tight box mining with surrounding segmentation context for weakly supervised object detection. In: Proceedings of the European conference on computer vision (ECCV), pp 434–450
Zeng Z, Liu B, Fu J, Chao H, Zhang L (2019) Wsod2: Learning bottom-up and top-down objectness distillation for weakly-supervised object detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 8292–8300
Li X, Kan M, Shan S, Chen X (2019) Weakly supervised object detection with segmentation collaboration. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9735–9744
Ren Z, Yu Z, Yang X, Liu M-Y, Lee YJ, Schwing AG, Kautz J (2020) Instance-aware, context-focused, and memory-efficient weakly supervised object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10598–10607
Huang Z, Zou Y, Kumar B, Huang D (2020) Comprehensive attention self-distillation for weakly-supervised object detection. Adv Neural Inf Process Syst 33:16797–16807
Google Scholar
Uijlings JR, Van De Sande KE, Gevers T, Smeulders AW (2013) Selective search for object recognition. Int J Comput Vision 104(2):154–171
Article Google Scholar
Zitnick CL, Dollár P (2014) Edge boxes: locating object proposals from edges. In: European conference on computer vision, Springer, pp 391–405
Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba, A (2016) Learning deep features for discriminative localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2921–2929
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Lai L, Chen J, Huang H, Wu Q (2021) Exploring a cam-based approach for weakly supervised fire detection task
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Namozov A, Im Cho Y (2018) An efficient deep learning algorithm for fire and smoke detection with limited data. Adv Electr Comput Eng 18(4):121–128
Article Google Scholar
Sadewa RP, Irawan B, Setianingsih C (2019) Fire detection using image processing techniques with convolutional neural networks. In: 2019 international seminar on research of information technology and intelligent systems (ISRITI), IEEE, pp 290–295
Jadon A, Omama M, Varshney A, Ansari MS, Sharma R (2019) Firenet: a specialized lightweight fire & smoke detection model for real-time iot applications. arXiv preprint arXiv:1905.11922
Valikhujaev Y, Abdusalomov A, Cho YI (2020) Automatic fire and smoke detection method for surveillance systems based on dilated cnns. Atmosphere 11(11):1241
Article Google Scholar
Huang R, Pedoeem J, Chen C (2018) Yolo-lite: a real-time object detection algorithm optimized for non-gpu computers. In: 2018 IEEE international conference on big data (Big Data), IEEE, pp 2503–2510
Dai J, Li Y, He K, Sun J (2016) R-fcn: object detection via region-based fully convolutional networks. Adv Neural Inf Process Syst 29
Celik T (2010) Fast and efficient method for fire detection using image processing. ETRI J 32(6):881–890
Article Google Scholar
Harkat H, Nascimento J, Bernardino A (2020) Fire segmentation using a deeplabv3+ architecture. In: Image and signal processing for remote sensing XXVI, vol 11533. International Society for Optics and Photonics, p 115330
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Wang Q, Wu B, Zhu P, Li P, Hu Q (2020) Eca-net: Efficient channel attention for deep convolutional neural networks. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Wang F, Jiang M, Qian C, Yang S, Li C, Zhang H, Wang X, Tang X (2017) Residual attention network for image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3156–3164
Hu, J., Shen, L., Albanie, S., Sun, G., Vedaldi, A (2018) Gather-excite: Exploiting feature context in convolutional neural networks. Adv Neural Inf Process Syst 31:9401–9411
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Park J, Woo S, Lee J-Y, Kweon I-S (2018) Bam: Bottleneck attention module. In: British Machine Vision Conference (BMVC). British Machine Vision Association (BMVA)
Roy AG, Navab N, Wachinger C (2018) Concurrent spatial and channel ’squeeze & excitation’in fully convolutional networks. In: International conference on medical image computing and computer-assisted intervention, Springer, pp 421–429
Santoro A, Raposo D, Barrett DG, Malinowski M, Pascanu, R, Battaglia P, Lillicrap T (2017) A simple neural network module for relational reasoning. Adv Neural Inf Process Syst 30:4967–4976
Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: Icml
Fire-Detection-Image-Dataset. https://github.com/cair/Fire-Detection-Image-Dataset
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M (2015) Imagenet large scale visual recognition challenge. Int J Comput Vision 115(3):211–252
Article MathSciNet Google Scholar
Everingham M, Eslami S, Van Gool L, Williams CK, Winn J, Zisserman A (2015) The pascal visual object classes challenge: a retrospective. Int J Comput Vision 111(1):98–136
Article Google Scholar
Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, Springer, pp 740–755

Download references

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 61876208 and 62072186), the Guangdong Basic and Applied Basic Research Foundation (Grant No. 2019B1515130001), the Opening Project of Guangdong Key Laboratory of Big Data Analysis and Processing, and the Opening Project of Ministry of Education Key Laboratory of Big Data and Intelligent Robot (South China University of Technology) (No. 202105).

Author information

Wenjun Wang and Lvlong Lai have contributed equally to this work.

Authors and Affiliations

School of Software Engineering, South China University of Technology, Guangzhou, China
Wenjun Wang, Lvlong Lai, Jian Chen & Qingyao Wu
School of Data Science and Information Engineering, Guizhou Minzu University, Guiyang, China
Wenjun Wang
Key Laboratory of Big Data and Intelligent Robot, Ministry of Education, Guangzhou, China
Wenjun Wang & Lvlong Lai
Pazhou Lab, Guangzhou, China
Qingyao Wu

Authors

Wenjun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lvlong Lai
View author publications
You can also search for this author in PubMed Google Scholar
Jian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qingyao Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Lai and Wang developed the proposed method and drafted the manuscript. Wu and Chen supervised the project, contributed to discussion and analysis, and provided important suggestions for the paper. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jian Chen or Qingyao Wu.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 749 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, W., Lai, L., Chen, J. et al. CAM-based non-local attention network for weakly supervised fire detection. SOCA 16, 133–142 (2022). https://doi.org/10.1007/s11761-022-00336-6

Download citation

Received: 28 March 2022
Accepted: 30 April 2022
Published: 10 June 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s11761-022-00336-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CAM-based non-local attention network for weakly supervised fire detection

Abstract

Access this article

Similar content being viewed by others

An efficient deep neural network with color-weighted loss for fire detection

Fire Detection Approach Based on Vision Transformer

Fire Detection from Images Based on Single Shot MultiBox Detector

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 749 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

CAM-based non-local attention network for weakly supervised fire detection

Abstract

Access this article

Similar content being viewed by others

An efficient deep neural network with color-weighted loss for fire detection

Fire Detection Approach Based on Vision Transformer

Fire Detection from Images Based on Single Shot MultiBox Detector

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (pdf 749 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation