Detecting flooded areas with machine learning techniques: case study of the Selška Sora river flash flood in September 2007

Abstract Floods seem to appear with increased frequency from one year to another. They create great damage to property and in some cases even result in lost lives. However, a quick and effective response by rescue services can greatly reduce the consequences. Machine learning techniques can reduce the time necessary for flood mapping. We test various machine learning methods to find the one with the highest classification accuracy. We also present the most important points for quick and effective machine learning procedures on remote sensing data. First, the data must be prepared correctly. We use satellite images, digital terrain models (DTMs), and the river network. The data in its primary form (e.g., bands of multispectral satellite images or DTMs) is insufficient. We also need certain derived attributes, such as the vegetation index or the slope derived from the DTM. Second, we must select suitable training samples and a suitable machine learning method. This approach to determining floods is presented in a case study of flash floods in the Selška Sora river valley. Machine learning techniques have proven successful in quickly determining flooded areas. The best results are produced by the J48 decision tree algorithm. The success of the ensemble machine learning methods is comparable to the J48 algorithm, while the JRip classification is not as good.

[1]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[2]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[3]  Ian Witten,et al.  Data Mining , 2000 .

[4]  Ahmed Mahmood,et al.  An overview of the International Charter 'Space and Major Disasters' , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[5]  Jay Gao,et al.  Use of normalized difference built-up index in automatically mapping urban areas from TM imagery , 2003 .

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[8]  T. Gillespie,et al.  Assessment and prediction of natural hazards from satellite imagery , 2007, Progress in physical geography.

[9]  Matjaž Mikoš,et al.  Characteristics of the extreme rainfall event and consequent flash floods in W Slovenia in September 2007 , 2009 .

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Peter Lamovec,et al.  UPORABA STROJNEGA UČENJA ZA DOLOČITEV POPLAVLJENIH OBMOČIJ - PRIMER POPLAV V SELŠKI DOLINI LETA 2007 APPLICATION OF DATA MINING FOR DETERMINATION OF FLOODED AREAS - SELŠKA VALLEY 2007 FLOODS CASE STUDY , 2010 .

[12]  Frank S. Marzano,et al.  Lessons learned from using COSMO-SkyMed imagery for flood mapping: some case studies , 2012, Remote Sensing.

[13]  Paul D. Bates,et al.  A Change Detection Approach to Flood Mapping in Urban Areas Using TerraSAR-X , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Saso Dzeroski,et al.  Tree ensembles for predicting structured outputs , 2013, Pattern Recognit..

[15]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .