论文信息 - Wildfire Segmentation Using Deep Vision Transformers - 字舞流文

Wildfire Segmentation Using Deep Vision Transformers

In this paper, we address the problem of forest fires’ early detection and segmentation in order to predict their spread and help with fire fighting. Techniques based on Convolutional Networks are the most used and have proven to be efficient at solving such a problem. However, they remain limited in modeling the long-range relationship between objects in the image, due to the intrinsic locality of convolution operators. In order to overcome this drawback, Transformers, designed for sequence-to-sequence prediction, have emerged as alternative architectures. They have recently been used to determine the global dependencies between input and output sequences using the self-attention mechanism. In this context, we present in this work the very first study, which explores the potential of vision Transformers in the context of forest fire segmentation. Two vision-based Transformers are used, TransUNet and MedT. Thus, we design two frameworks based on the former image Transformers adapted to our complex, non-structured environment, which we evaluate using varying backbones and we optimize for forest fires’ segmentation. Extensive evaluations of both frameworks revealed a performance superior to current methods. The proposed approaches achieved a state-of-the-art performance with an F1-score of 97.7% for TransUNet architecture and 96.0% for MedT architecture. The analysis of the results showed that these models reduce fire pixels mis-classifications thanks to the extraction of both global and local features, which provide finer detection of the fire’s shape.

Moulay A. Akhloufi | Marwa Jmal | Wided Souidène Mseddi | Rabah Attia | Rafik Ghali | Rabah Attia | M. Akhloufi | Rafik Ghali | Marwa Jmal

[1] Michele Volpi,et al. Land cover mapping at very high resolution with rotation equivariant CNNs: towards small yet accurate models , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[2] Francisco Herrera,et al. Object Detection Binary Classifiers methodology based on deep learning to identify small objects handled similarly: Application in video surveillance , 2020, Knowl. Based Syst..

[3] S. S. Vinsley,et al. Efficient Flame Detection Based on Static and Dynamic Texture Analysis in Forest Fire Detection , 2018 .

[4] Lin Cao,et al. A Forest Fire Detection System Based on Ensemble Learning , 2021, Forests.

[5] Forrest N. Iandola,et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[6] Isabel F. Trigo,et al. A deep learning approach for mapping and dating burned areas using temporal sequences of satellite images , 2020 .

[7] Yan Wang,et al. TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation , 2021, ArXiv.

[8] Matthieu Cord,et al. Training data-efficient image transformers & distillation through attention , 2020, ICML.

[9] Sébastien Ourselin,et al. Generalised Dice overlap as a deep learning loss function for highly unbalanced segmentations , 2017, DLMIA/ML-CDS@MICCAI.

[10] S. Dimitropoulos. Fighting fire with science , 2019, Nature.

[11] Yueming Hu,et al. Deep Learning Segmentation and Classification for Urban Village Using a Worldview Satellite Image Based on U-Net , 2020, Remote. Sens..

[12] N. Koutsias,et al. Historical background and current developments for mapping burned area from satellite Earth observation , 2019, Remote Sensing of Environment.

[13] Dragica Radosav,et al. Deep Learning and Medical Diagnosis: A Review of Literature , 2018, Multimodal Technol. Interact..

[14] J. Randerson,et al. Global fire emissions estimates during 1997–2016 , 2017 .

[15] Zichen Zhang,et al. U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection , 2020, Pattern Recognit..

[16] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[17] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[18] Luo Zhong,et al. An Encoder-Decoder Network Based FCN Architecture for Semantic Segmentation , 2020, Wirel. Commun. Mob. Comput..

[19] Hasan Demirel,et al. Fire detection in video sequences using a generic color model , 2009 .

[20] Faisal Saeed,et al. Realtime fire detection using CNN and search space navigation , 2021, Journal of Real-Time Image Processing.

[21] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22] Vishal M. Patel,et al. Medical Transformer: Gated Axial-Attention for Medical Image Segmentation , 2021, MICCAI.

[23] Gary R. Watmough,et al. Evaluating the capabilities of Sentinel-2 for quantitative estimation of biophysical variables in vegetation , 2013 .

[24] Rune Hylsberg Jacobsen,et al. A cloud detection algorithm for satellite imagery based on deep learning , 2019, Remote Sensing of Environment.

[25] Moulay A. Akhloufi,et al. Computer vision for wildfire research: An evolving image dataset for processing and analysis , 2017 .

[26] Moulay A. Akhloufi,et al. Wildland fires detection and segmentation using deep learning , 2018, Defense + Security.

[27] Jay D. Miller,et al. Quantifying burn severity in a heterogeneous landscape with a relative version of the delta Normalized Burn Ratio (dNBR) , 2007 .

[28] Pu Li,et al. Image fire detection algorithms based on convolutional neural networks , 2020, Case Studies in Thermal Engineering.

[29] D. Roy,et al. The Collection 6 MODIS burned area mapping algorithm and product , 2018, Remote sensing of environment.

[30] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[31] Myle Ott,et al. Scaling Neural Machine Translation , 2018, WMT.

[32] Chang Zhou,et al. CogView: Mastering Text-to-Image Generation via Transformers , 2021, NeurIPS.

[33] Jian Wang,et al. Multi-feature fusion based fast video flame detection , 2010 .

[34] Biswajeet Pradhan,et al. Deep Learning Approaches Applied to Remote Sensing Datasets for Road Extraction: A State-Of-The-Art Review , 2020, Remote. Sens..

[35] Jinfei Wang,et al. A New Model for Transfer Learning-Based Mapping of Burn Severity , 2020, Remote. Sens..

[36] Arnisha Khondaker,et al. Computer vision-based early fire detection using enhanced chromatic segmentation and optical flow analysis technique , 2020, Int. Arab J. Inf. Technol..

[37] Georg Heigold,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2021, ICLR.

[38] Yusuf Huseyin Sahin,et al. EfficientSeg: An Efficient Semantic Segmentation Network , 2020, ArXiv.

[39] A. Bernardino,et al. Fire segmentation using a DeepLabv3+ architecture , 2020, Remote Sensing.

[40] Pavel Zemcík,et al. Fire Segmentation in Still Images , 2020, ACIVS.

[41] D. Tao,et al. A Survey on Visual Transformer , 2020, ArXiv.

[42] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[43] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[44] Moulay A. Akhloufi,et al. Forest fire spread prediction using deep learning , 2021, Defense + Commercial Sensing.

[45] Abhishek Kumar Singh,et al. Video Flame and Smoke Based Fire Detection Algorithms: A Literature Review , 2020, Fire Technology.

[46] Fahad Shahbaz Khan,et al. Transformers in Vision: A Survey , 2021, ACM Comput. Surv..

[47] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[48] Vladimir Sergeevich Bochkov,et al. wUUNET: advanced fully convolutional neural network for multiclass fire segmentation , 2021, Symmetry.

[49] Nicolas Usunier,et al. End-to-End Object Detection with Transformers , 2020, ECCV.

[50] Alessandro Farasin,et al. Double-Step U-Net: A Deep Learning-Based Approach for the Estimation of Wildfire Damage Severity through Sentinel-2 Satellite Data , 2020, Applied Sciences.

[51] Yushi Chen,et al. Spatial-Spectral Transformer for Hyperspectral Image Classification , 2021, Remote. Sens..

[52] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[53] George Papandreou,et al. Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[54] Carl H. Key,et al. Landscape Assessment (LA) , 2006 .

[55] T. Danaher,et al. A remote sensing approach to mapping fire severity in south-eastern Australia using sentinel 2 and random forest , 2020 .

[56] ByoungChul Ko,et al. Fire detection based on vision sensor and support vector machines , 2009 .

[57] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.