A Fuzzy Decision Tree for Processing Satellite Images and Landsat Data

Abstract Satellite and airborne images, including Landsat, ASTER, and Hyperspectral data, are widely used in remote sensing and Geo- graphic Information Systems (GIS) to understand natural earth related processes, climate change, and anthropogenic activity. The nature of this type of data is usually multi or hyperspectral with individual spectral bands stored in raster file structures of large size and global coverage. The elevated number of bands (on the order of 200 to 250 bands) requires data processing algorithms capable of extracting information content, removing redundancy. Conventional statistical methods have been devised to reduce dimension- ality however they lack specific processing to handle data diversity. Hence, in this paper we propose a new data analytic technique to classify these complex multidimensional data cubes. Here, we use a well-known database consisting of multi-spectral values of pixels from satellite images, where the classification is associated with the central pixel in each neighborhood. The goal of our proposed approach is to predict this classification based on the given multi-spectral values. To solve this classification problem, we propose an improved decision tree (DT) algorithm based on a fuzzy approach. More particularly, we introduce a new hybrid classification algorithm that utilizes the conventional decision tree algorithm enhanced with the fuzzy approach. We propose an improved data classification algorithm that utilizes the best of a decision tree and multi-criteria classification. To investigate and evaluate the performance of our proposed method against other DT classifiers, a comparative and analytical study is conducted on well-known Landsat data.

[1]  Nabil Belacel,et al.  Multicriteria fuzzy assignment method: a useful tool to assist medical diagnosis , 2001, Artif. Intell. Medicine.

[2]  Nabil Belacel,et al.  Web-integration PROAFTN methodology for acute leukemia diagnosis. , 2005, Telemedicine journal and e-health : the official journal of the American Telemedicine Association.

[3]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[4]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[5]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[6]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[7]  Elisabeth N. Bui,et al.  Spatial data mining for enhanced soil map modelling , 2002, Int. J. Geogr. Inf. Sci..

[8]  Rick L. Lawrence,et al.  Classification of remotely sensed imagery using stochastic gradient boosting as a refinement of classification tree analysis , 2004 .

[9]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[10]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[11]  Nabil Belacel,et al.  Multicriteria assignment method PROAFTN: Methodology and medical application , 2000, Eur. J. Oper. Res..

[12]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[13]  S French,et al.  Multicriteria Methodology for Decision Aiding , 1996 .