A high-precision forest fire smoke detection approach based on ARGNet

doi:10.1016/j.compag.2022.106874

Computers and Electronics in Agriculture

Volume 196, May 2022, 106874

https://doi.org/10.1016/j.compag.2022.106874 Get rights and content

Highlights

•
A novel object detection network model is proposed for forest fire smoke detection.
•
Build a UAV-IoT system to capture remote sensing images to monitor forest fires.
•
The proposed method can achieve precise positioning of forest fire smoke.

Abstract

The occurrence of forest fires can lead to ecological damage, property loss, and human casualties. Current forest fire smoke detection methods do not sufficiently consider the characteristics of smoke with high transparency and no clear edges and have low detection accuracy, which cannot meet the needs of complex aerial forest fire smoke detection tasks. In this paper, we propose Adjacent layer composite network based on a recursive feature pyramid with deconvolution and dilated convolution and global optimal nonmaximum suppression (ARGNet) for high-accuracy detection of forest fire smoke. First, the Adjacent layer composite network is proposed to enhance the extraction of smoke features with high transparency and no clear edges, and SoftPool in it is used to retain more feature information of smoke. Then, a recursive feature pyramid with deconvolution and dilated convolution (RDDFPN) is proposed to fuse shallow visual features and deep semantic information in the channel dimension to improve the accuracy of long-range aerial smoke detection. Finally, global optimal nonmaximum suppression (GO-NMS) sets the objective function to globally optimize the selection of anchor frames to adapt to the aerial photography of multiple smoke locations in forest fire scenes. The experimental results show that the ARGNet parametric number on the UAV-IoT platform is as low as 53.48 M, mAP reaches 79.03%, mAP50 reaches 90.26%, mAP75 reaches 82.35%, FPS reaches 122.5, and GFLOPs reaches 55.78. Compared with other mainstream methods, it has the advantages of real-time detection and high accuracy.

Introduction

Forest fires are a hazardous and destructive disaster that often causes economic and ecological damage. In recent years, forest fires have been frequent, and governments have incurred enormous management expenditures to cope with sudden forest fires (Stocks and Martell, 2016). Because forest burning leads to bare forestland, forests lose their function of water containment and soil conservation, which will further cause other natural disasters, such as internal floods, droughts, mudslides, landslides, and dust storms (Ren et al., 2011). After forest fires occur, local ecological conditions such as weather, water and soil are out of balance and often take decades to centuries to recover (Bakirci, 2010). At the same time, forest fires can lead to the burning of rare wildlife or loss of habitat, which can pose a challenge to the biodiversity of the Earth's ecology (Suthar and Bhavsar, 2021). In recent years, the prevention and monitoring of forest fires has become a research hotspot for forest fire prevention departments around the world. The detection methods for forest fires are divided into smoke detection and flame detection. Due to the many weeds and trees in the forest environment, flames are easily blocked in the early stages of a fire. When forest fires occur, they often produce a large amount of smoke. Smoke will spread over time, which makes it easier to detect. Therefore, smoke detection is an important tool for forest fire early warning and plays an important role in forest fire monitoring and prevention and control. Obtaining clearly visible images of smoke in the forest and automating the determination of where the smoke is located is important for strengthening forest fire prevention and control and speeding up the response to forest fire fighting. In order to obtain a clear and visible image of smoke in real time, it is essential to choose the right monitoring tool. Currently, the main tools that can acquire images of smoke in forest fires are observation towers, satellites and drones. The monitoring range of the observation towers is limited and their monitoring dead spots are numerous and coverage is inadequate. Satellite monitoring technology has a low spatial resolution and is susceptible to weather, cloud cover and orbital cycles, and its monitoring is poor in real time. In contrast, the simple structure, high flight flexibility and low acquisition costs of UAVs make them suitable as image acquisition tools in real-time forest fire monitoring.

However, there are still several pressing issues that need to be addressed in the task of detecting forest fire smoke using UAV. 1) Forest fires occur in a wide variety of scenes, and there could be many smoke-like objects, such as exposed grey-white rock and background of similar color, which have similar semantic information to smoke in the images. The conventional feature extraction network extracts less feature information, making it difficult to distinguish the feature differences among them. 2) Due to the need for actual aerial inspection by UAVs, for safety reasons, when capturing forest fire information, the UAV camera is often far from the fire source, and conventional methods can not perform well in smoke detection at a long distance. 3) The image captured by UAV is a top view, and when the UAV is at a high location, the captured image could have multiple forest fires. Conventional matrix NMS method filters each local location and has not yet combined global information, which might cause forest fire smoke images based on UAV capture to have a large deviation in the predicted location of the anchor frame. At the same time, it could also bring about missed and false detection.

In order to distinguish the feature differences between smoke and smoke-like objects, Y Cao et al. proposed an attention-enhanced bi-directional long and short-term memory network that explores the spatial and temporal features of image sequences to capture the feature differences between smoke and smoke-like objects (Cao et al., 2019). Y Hu et al. proposed the Value conversion-Attention mechanism module, which extracts deep colour and texture features of smoke by setting a joint horizontal and vertical weighting strategy to distinguish between smoke and smoke-like objects (Hu et al., 2022). While these methods are simple and effective, they are all designed for ground-based monitoring of smoke. For overhead views taken by UAVs, the characteristic relationship of smoke poses a difficulty for these methods. In order to adapt the characteristics of smoke under the conditions of overhead UAV photography and to distinguish smoke from smoke-like objects, we propose an Adjacent layer composite network (ALCN). It is proposed to parallelize two identical ResNet50-vd models in the feature extraction network. The first ResNet50-vd extracts features and transmits high-level semantic information to the second ResNet50-vd, which is fused with the low-level semantic information of the second ResNet50-vd to enhance the extraction of features with high transparency and no clear edges of smoke. In addition, the original MaxPool is replaced with SoftPool, thus preserving more feature information of the smoke image during downsampling.

In order to improve the detection of smoke at long distances, Z Jiao et al. proposed an improved UAV aerial forest fire detection method based on YOLOv3, which reduces the downsampling rate to reduce the feature loss of small objects (Jiao et al., 2019). Zhao et al. proposed small-sample smoke target detection methods based on target perception and deep convolution to improve the perception of long-range smoke. Although these methods were designed for long-range smoke and achieved certain results, their overall performance needs to be improved as they are limited to the convolution kernel angle, to some extent at the expense of close and medium-range smoke detection (Zhao et al., 2021). In order to avoid the degradation of smoke detection at close and medium distances caused by the improved detection of smoke at long distances, we propose recursive feature pyramid with deconvolution and dilated convolution(RDDFPN). First, add edges containing contextual information to the FPN module, and upsample using deconvolution to ensure the consistency of the original smoke feature information. Then, the dilated convolution is used to downsample and expand the perceptual field. Finally, the RDDFPN processed feature maps are re-entered into Backbone for recursive secondary processing to enhance feature fusion, and the lost return information from object detection is fed back more directly to adjust the parameters of Backbone to enhance the long-range smoke detection.

To reduce missed and false detections of multiple objects in an image, X Huang et al. proposed the R2NMS method, which effectively removes redundant frames without introducing large-scale false detections by using less occluded visible parts (Huang et al., 2020). H Yan et al. improved the NMS method of Faster RCNN, and the improved method raises the hard threshold of NMS by linear weighting to select the anchor frame in a weighted NMS (Yan et al., 2021). These methods improve the effectiveness of NMS to some extent when multiple objects are present in the image, but neither is designed for the characteristics of aerial photography of forest fire smoke from drones, which has a diffuse nature, and the design of both methods tends to result in diffuse smoke being incorrectly filtered out, leading to false detection. In order to reduce the missed and false detection of smoke in UAV aerial images, we propose global optimal nonmaximum suppression (GO-NMS) based on the smoke characteristics under UAV aerial photography. This method adopts the strategy of global selection of the anchor frame, sets the objective function, and iterates in such a way that the final result approaches the minimum value of the objective function to find the optimal solution. Such an image postprocessing method can adapt to the overhead view angle and improve the localization accuracy under forest fire smoke detection by UAV aerial photography, and thus, it can reduce missed and false detection.

The contributions of this paper are summarised as follows.

1) ALCN is proposed to highlight the high transparency and edge features of smoke to distinguish it from smoke-like objects. The SoftPool layer in it retains more feature information during feature extraction.

2) RDDFPN is proposed to enhance the feature extraction capability of the network, improve the long-range smoke detection capability and effectively fuse the rich feature information extracted by ALCN.

3) GO-NMS is proposed to set the objective function under the global perspective and select the optimal anchor frame through multiple rounds of iterations to improve the detection capability of multiple smoke locations under UAV aerial photography and effectively reduce missed and false detections.

Section snippets

Related work

When forest fires occur, a large amount of smoke is often produced, which is diffuse and has a wider area than a flame, therefore smoke detection is the main task in the detection of forest fires (Smith and Dragicevic, 2018). Among smoke detection methods, the main categories are: (1) manual detection methods or sensor detection methods. (2) Image processing methods: traditional image processing algorithms and deep learning methods. (3) Deep learning combined with UAV methods: UAVs in the sky

Dataset acquisition

The dataset collected in this paper consists of three major parts: a self-cropped ground dataset, a self-cropped UAV aerial photography dataset, and a self-collected UAV aerial photography dataset.

Part 1: For the task of forest fire smoke monitoring, there is no open source recognized standard dataset for smoke object detection at the pixel level. Currently, only a few datasets are publicly available, such as (1) the publicly available dataset from the Computer Vision and Pattern Recognition

Results

This section experimentally verifies the superiority of ARGNet for the UAV aerial photography forest fire smoke detection task. It consists of evaluation metrics, experimental environment and settings, ARGNet performance analysis, analysis of method effects, comparison between different models, ablation experiments, comparison of visualization results and practical application testing.

Discussion

We produced an object detection dataset for aerial photography of forest fire smoke images by UAVs, which covers a variety of features of smoke captured by UAVs at different distances when forest fires occur. Through the comparison and analysis of several sets of experiments, we verify the effectiveness of the proposed ARGNet for aerial forest fire smoke detection. In particular, it better solves the three major problems of confusing smoke with smoke-like objects, low accuracy of long-distance

Conclusion and outlook

With the increase in the value of forestry resources, preventing and controlling forest fires has become increasingly important and challenging. Strengthening forest fire smoke detection and fire prevention management is of great significance to enhance the security of forestry ecological safety. In the research of this paper, we build a UAV-IoT system to facilitate the transmission of forest fire scene images captured by UAVs to the server side for object detection. To improve the accuracy and

Funding

This work was supported by Changsha Municipal Natural Science Foundation (Grant No. kq2014160); in part by the National Natural Science Foundation in China (Grant No. 61703441); in part by the key projects of Department of Education Hunan Province (Grant No. 19A511); in part by Hunan Key Laboratory of Intelligent Logistics Technology (2019TP1015); and Natural Science Foundation of Hunan Provincial (Grant No. 2021JJ31164).

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to partial authors' disagreement.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We are grateful to all members of the Forestry Information Research Centre for their advice and assistance in the course of this research. The language of our manuscript have been refined and polished by Elsevier Language Editing Services (Serial number: LE-221159-9DA3850979AC).

References (53)

Y. Hu et al.
Fast forest fire smoke detection using MVMNet
Knowledge-Based Systems
(2022)
J. Gubbi et al.
Smoke detection in video using wavelets and support vector machines
Fire Safety J.
(2009)
A. Shamsoshoara et al.
Aerial imagery pile burn detection using deep learning: the FLAME dataset
Comput. Networks
(2021)
Y. Song et al.
Improved non-maximum suppression for object detection using harmony search algorithm
Appl. Soft Comput.
(2019)
B.J. Stocks et al.
Forest fire management expenditures in Canada: 1970–2013[J]
The Forestry Chronicle
(2016)
D. Ren et al.
Modeling the mudslide aftermath of the 2007 Southern California Wildfires
Nat. Hazards
(2011)
M. Bakirci
Negative impacts of forest fires on ecological balance and environmental sustainability: case of Turkey
Revija za geografijo
(2010)
Suthar, N., Bhavsar, A., 2021. WSN Based Prototype Architecture for Alerting the Effects of Forest Fire on Wild Animals...
Y. Cao et al.
An attention enhanced bidirectional LSTM for early forest fire smoke recognition
IEEE Access
(2019)
Jiao, Z., Zhang, Y., Xin, J., Mu, L., Yi, Y., Liu, H., Liu, D., 2019. A deep learning based forest fire detection...

Y. Zhao et al.

Fire smoke detection based on target-awareness and depthwise convolutions

Multimedia Tools Appl.

(2021)

X. Huang et al.

Nms by representative region: Towards crowded pedestrian detection by proposal pairing

H.e. Yan et al.

A new face detection method based on Faster RCNN

J. Phys.: Conf. Ser.

(2021)

Smith, A. K., & Dragicevic, S. (2018). An agent-based model to represent space-time propagation of forest-fire smoke....

H. Sun et al.

October). A joint source channel adaptive communication system design for the fire environment. In 2017 Chinese Automation Congress (CAC)

(2017)

J. Fonollosa et al.

Chemical sensor systems and associated algorithms for fire detection: A review

Sensors

(2018)

J. Fonollosa et al.

Chemical sensor systems and associated algorithms for fire detection: a review

Sensors

(2018)

C.-C. Ho

Machine vision-based real-time early flame and smoke detection

Measur. Sci. Technol.

(2009)

Chen T, Yin Y, Huang S, et al. The smoke detection for early fire-alarming system base on video...

Töreyin, B.U., Dedeoğlu, Y., Cetin, A.E., 2005. Wavelet based real-time smoke detection in video. In: 2005 13th...

Lee, C.Y., Lin, C.T., Hong, C.T., 2009. Spatio-temporal analysis in smoke detection. In 2009 IEEE International...

Simonyan, K., Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint...

F. Chollet

Xception: Deep learning with depthwise separable convolutions

He, K., Zhang, X., Ren, S., Sun, J., 2016. Deep residual learning for image recognition: IEEE Conference on Computer...

G. Huang et al.

Densely connected convolutional networks

J. Sharma et al.

Cited by (45)

SMWE-GFPNNet: A high-precision and robust method for forest fire smoke detection
2024, Knowledge-Based Systems
Smoke is an early manifestation of forest fire. Accurate identification of smoke from forest fires is crucial for the prevention and control of forest fires, which helps protect the ecological environment and the safety of people. The texture features of smoke are complex and prone to detection omissions. The forest environment is complex, and smoke-like objects in the forest often interfere with smoke recognition. The concentration of smoke at the edge is thin, which easily leads to edge omission. In response to these problems, we propose a high-precision edge focused forest fire smoke detection network. To begin, in response to the problem of detection omission, we present a Swin multidimensional window extractor (SMWE) that enhances information exchange between windows in both horizontal and vertical dimensions to extract global texture features from images with smoke. Then, the guillotine feature pyramid network (GFPN) is suggested, along with a new guillotine convolution method for reducing redundant feature information from a feature fusion perspective, thereby improving the anti-interference ability of the model. Finally, taking into account the thinness and irregularity of the smoke near the borders, a contour adaptive loss function is suggested to minimize the boundary blur caused by down-sampling the feature map in the network. The experimental and application results show that SMWE-GFPNNet accomplishes 80.92 % of the mAP, 90.01 % of the mAP⁵⁰, and 83.38 % of the mAP⁷⁵ on the Forest Fire Smoke Complex Background Detection Dataset. Excellent in anti-interference ability and accuracy.
MFF-Net: A multitask feature fusion network in dual-frequency domains for detecting smoke from one single picture
2024, Displays
The industrial smoke detection is extreme significant for sustainable development and human health, but up to date, its related technologies have not met the real-time and accurate requirements, particularly in the memory-constrained conditions. Towards detecting smoke from one single picture more accurate and efficient, in this paper we present a multitask feature fusion network, dubbed MFF-Net. To specify, according to the characteristics of imbalance found in the frequency distribution of numerous smoke pictures, our MFF-Net is set up with a dual-channel framework, which is composed of the high-frequency and low-frequency channels that process different frequency components respectively. The high-frequency channel is built based on a stream of cross-level fusion inverted residual (CFIR) blocks, each of which encompasses the expansion layer, filtering layer, combination layer, and fusion layer, while the low-frequency channel is formed by embedding the attention mechanism into CFIR blocks. We concatenate the dual channels mentioned above to establish the MFF-Net, in order to systematically fuse high- and low-frequency features of a single smoke picture. As compared with the existing relevant detection models, the depthwise and pointwise convolutions are utilized to make networks more efficient with respect to size and speed, while the dense skip connection is used behind to fuse cross-level features (i.e., semantic and visual features) for keeping high accuracy. Experimental results show that our proposed MFF-Net can trade off multiple tasks in accuracy, memory and computational efficiency.
A triple interference removal network based on temporal and spatial attention interaction for forest smoke recognition in videos
2024, Computers and Electronics in Agriculture
Spatial-temporal feature representation Forest fires can lead to ecological damage, property loss, and human casualties. Smoke recognition is crucial for preventing fire disasters since smoke always emerges before the fire. However, existing methods are vulnerable to false alarms in complex forest scenes due to various smoke-like interference. To solve it, this paper presents a triple interference removal network (TIRNet) to learn discriminative smoke spatial–temporal feature representations by effectively removing interference at spatial, temporal, and semantic levels simultaneously. First, a spatial attention interaction module (SAIM) is designed to eliminate useless spatial interference of non-moving targets by adaptively enhancing smoke-relevant information and suppressing smoke-irrelevant interference information. Second, a temporal attention interaction module (TAIM) based on temporal relation attention and interaction relation attention is designed to characterize frame relation information while strengthening information interaction between frames. A pyramid feature fusion module is designed by stacking features in a pyramid way to aggregate both long-term relation and short-term dependence between frames. Finally, an attention-guided feature fusion module based on channel-wise attention is designed to fuse high-level temporal-dominated features and spatial-dominated features. Extensive experiments on the RFS (recognizing forest smoke) and RISE (recognizing industrial smoke emissions) datasets show that our method achieves the best performance among existing methods.
FireViTNet: A hybrid model integrating ViT and CNNs for forest fire segmentation
2024, Computers and Electronics in Agriculture
Forest fires, influenced by climatic and ecological changes, pose significant risks to global ecosystems and human communities. To address this challenge, our research introduces a segmentation method that integrates vision transformers (ViT) with conventional convolutional neural networks (CNNs). Within this framework, MobileViT serves as the basic architecture, with CNNs enhancing spatial resolution. Our model also incorporates the CBAM attention mechanism, the Dense ASPP module, and SP pooling to enhance segmentation performance. The model, optimized for lightweight and efficiency, achieved an F1-score of 87.2 % and a mIoU of 81.44 % on our custom dataset that underwent data augmentation. In addition, ablation experiments validate the value of each module in the performance of the composite model. Collectively, this research aims to advance real-time wildland fire monitoring capabilities. In addition, its potential extends to a broader range of applications encompassing various agricultural and forestry challenges.
A label-relevance multi-direction interaction network with enhanced deformable convolution for forest smoke recognition
2024, Expert Systems with Applications
Forest fires pose a significant threat to both the economy and ecology, causing extensive damage. Smoke serves as a crucial indicator of forest fires, often appearing before the actual fire. However, existing methods for smoke recognition are susceptible to missed alarms and false alarms due to the diverse nature of smoke and the presence of various continuous interferences in complex real-world scenarios. To tackle these challenges, this paper proposes a label-relevance multi-direction interaction network with enhanced deformable convolution. Firstly, to ensure the extraction of robust features, we propose an enhanced deformable convolution module that breaks away from fixed geometric structures and incorporates interval information. Secondly, to prevent high-response high-level features from overshadowing low-level features during the feature interaction process, we introduce a multi-directional feature interaction module that obtains complementary features from different convolution layers. Lastly, to leverage the relevance and pixel distribution information in the label image, we propose a new loss term based on generative adversarial learning. This loss term measures the distribution similarity between the network's predictions and the ground truth. Through extensive experiments, we demonstrate that our model accurately estimates smoke pixels and outperforms existing smoke recognition methods.
Learning precise feature via self-attention and self-cooperation YOLOX for smoke detection
2023, Expert Systems with Applications
Smoke detection is a key process for fire warning systems. However, the existing smoke detection methods are insufficient to extract precise smoke features due to the smoke’s transparency and variability. To solve this problem, we adopt the efficient YOLOX architecture and devise three strategies to enhance it. A self-cooperation mechanism is proposed to directly remove redundancy and then condense localization and semantic information. Moreover, we utilize the light-weight self-attention mechanism to emphasize the meaningful features of smoke. Finally, we equip the network with the piece-wise focal loss to consolidate its performance towards hard samples. The proposed method is termed as self-attention and self-cooperation YOLOX (SASC-YOLOX). In addition, we build a database that contains images from real scenes and manually annotate them, named annotated real smoke database of Xi’an Jiaotong University (XJTU-RS). SASC-YOLOX obtains 72.6% and 92.1% $A P$ on our database and a synthetic database, respectively, outperforming the state-of-the-art methods. Extensive visualization experiments also validate that SASC-YOLOX has a strong feature extraction ability. Code is available at https://github.com/jingjing-maker/SASC-YOLOX.

View all citing articles on Scopus

¹: Jialei Zhan and Yaowen Hu contribute equally to this work.

View full text

A high-precision forest fire smoke detection approach based on ARGNet

Highlights

Abstract

Introduction

Section snippets

Related work

Dataset acquisition

Results

Discussion

Conclusion and outlook

Funding

Data Availability Statement

Declaration of Competing Interest

Acknowledgments

Knowledge-Based Systems

Fire Safety J.

Comput. Networks

Appl. Soft Comput.

Forest fire management expenditures in Canada: 1970–2013[J]

The Forestry Chronicle

Modeling the mudslide aftermath of the 2007 Southern California Wildfires

Nat. Hazards

Negative impacts of forest fires on ecological balance and environmental sustainability: case of Turkey

Revija za geografijo

An attention enhanced bidirectional LSTM for early forest fire smoke recognition

IEEE Access

Fire smoke detection based on target-awareness and depthwise convolutions

Multimedia Tools Appl.

Nms by representative region: Towards crowded pedestrian detection by proposal pairing

A new face detection method based on Faster RCNN

J. Phys.: Conf. Ser.

October). A joint source channel adaptive communication system design for the fire environment. In 2017 Chinese Automation Congress (CAC)

Chemical sensor systems and associated algorithms for fire detection: A review

Sensors

Chemical sensor systems and associated algorithms for fire detection: a review

Sensors

Machine vision-based real-time early flame and smoke detection

Measur. Sci. Technol.

Xception: Deep learning with depthwise separable convolutions

Densely connected convolutional networks