Comparison of two semantic segmentation databases for smoke detection

Researchers have found strong correlation between warm summer and the frequency and intensity of fires around the world. Climate models due to global warming tells us that average summer temperature will increase drastically in the next few decades entailing an increase of wildfire. Computer vision is a good tools to detect and locate an incipient fire and prevent a rapid spread of fire destroying huge forest areas as in Australia or Brazil. Smoke is the first clue of an incipient fire that can be detected by a camera to warn firemen to act as quickly as possible. Convolutional neural networks and semantic segmentation can achieve this task by giving location and scale of the fire to firemen. In order to efficiently train this type of network architectures, we need a database composed of many images and corresponding masks. The complexity of the smoke in terms of shape, texture, color and intensity is difficult to segment properly. The diversity of smoke types in the image database is crucial for generalizing prediction in real-world circumstances. Numerous research papers proposed new network architectures for segmenting smoke in visible images spectrum and tested the accuracy of the segmentation on their database. Database that, for the most of the time, was not available. This article deals with comparison of a network performances on two smoke databases and highlight the importance of a rich images database in terms of quality rather than quantity.

[1]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[2]  Subhransu Maji,et al.  Semantic contours from inverse detectors , 2011, 2011 International Conference on Computer Vision.

[3]  Sylvain Paris,et al.  Automatic Portrait Segmentation for Image Stylization , 2016, Comput. Graph. Forum.

[4]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Sanja Fidler,et al.  The Role of Context for Object Detection and Semantic Segmentation in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[7]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[8]  Xuelong Li,et al.  Deep Smoke Segmentation , 2018, Neurocomputing.

[9]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[15]  Giorgio Visani,et al.  Metrics for Multi-Class Classification: an Overview , 2020, ArXiv.

[16]  Noah Snavely,et al.  Material recognition in the wild with the Materials in Context Database , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[20]  José García Rodríguez,et al.  A Review on Deep Learning Techniques Applied to Semantic Segmentation , 2017, ArXiv.

[21]  Sanja Fidler,et al.  Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Roberto Cipolla,et al.  Semantic object classes in video: A high-definition ground truth database , 2009, Pattern Recognit. Lett..