Learning Spatial Decision Trees for Land Cover Mapping

Given learning samples from a raster dataset, spatial decision tree learning aims to find a decision tree classifier that minimizes classification errors as well as salt-and-pepper noise. The problem has important societal applications such as land cover classification for natural resource management. However, the problem is challenging due to the fact that learning samples show spatial autocorrelation in class labels, instead of being independently identically distributed. Related work relies on local tests (i.e., testing feature information of a location) and cannot adequately model the spatial autocorrelation effect, resulting in salt-and-pepper noise. In contrast, in my recent papers, we proposed a focal-test-based spatial decision tree (FTSDT), in which the tree traversal direction of a sample is based on both local and focal (neighborhood) information. We introduced a focal test approach with adaptive neighborhoods that avoids over-smoothing in wedge-shaped areas. We also conducted computational refinement on the FTSDT training algorithm by reusing focal values across candidate thresholds. Theoretical analysis shows that the refined training algorithm is correct and more scalable. Experiment results on real world datasets show that new FTSDT with adaptive neighborhoods improves classification accuracy, and that our computational refinement significantly reduces training time.

[1]  Geoffrey J. Hay,et al.  Geographic Object-Based Image Analysis (GEOBIA): A new name for a new discipline , 2008 .

[2]  W. Marsden I and J , 2012 .

[3]  Joseph F. Knight,et al.  Influence of Multi-Source and Multi-Temporal Remotely Sensed and Ancillary Data on the Accuracy of Random Forest Classification of Wetlands in Northern Minnesota , 2013, Remote. Sens..

[4]  Chris Rizos,et al.  Manual of Geospatial Science and Technology , 2009 .

[5]  Hassan A. Kingravi,et al.  Detection of blue-white veil areas in dermoscopy images using machine learning techniques , 2006, SPIE Medical Imaging.

[6]  Shashi Shekhar,et al.  Learning spatial decision tree for geographical classification: a summary of results , 2012, SIGSPATIAL/GIS.

[7]  Michelangelo Ceci,et al.  Dealing with spatial autocorrelation when learning predictive clustering trees , 2013, Ecol. Informatics.

[8]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[9]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[10]  Tao Yuan,et al.  Spatial defect pattern recognition on semiconductor wafers using model-based clustering and Bayesian inference , 2008, Eur. J. Oper. Res..

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Shashi Shekhar,et al.  Focal-Test-Based Spatial Decision Tree Learning , 2015, IEEE Transactions on Knowledge and Data Engineering.

[13]  Alice Deschamps,et al.  Geospatial data integration for applications in flood prediction and management in the Red River Basin , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[14]  C. Boncelet Image Noise Models , 2009 .

[15]  Shashi Shekhar,et al.  Focal-Test-Based Spatial Decision Tree Learning: A Summary of Results , 2013, 2013 IEEE 13th International Conference on Data Mining.

[16]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[17]  David W. Aha,et al.  Ensembles of Classifiers for Morphological Galaxy Classification , 2001 .

[18]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[19]  C. Westen,et al.  Remote sensing for natural disaster management , 2000 .

[20]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[21]  Xiang Li,et al.  A Spatial Entropy‐Based Decision Tree for Classification of Geographical Information , 2006, Trans. GIS.

[22]  Russell G. Congalton,et al.  A review of assessing the accuracy of classifications of remotely sensed data , 1991 .