Random Forest classification of multisource remote sensing and geographic data

The use of random forests for classification of multisource data is investigated in this paper. Random Forest is a classifier that grows many classification trees. Each tree is trained on a bootstrapped sample of the training data, and at each node the algorithm only searches across a random subset of the variables to determine a split. To classify an input vector in random forest, the vector is submitted as an input to each of the trees in the forest, and the classification is then determined by a majority vote. The experiments presented in the paper were done on a multisource remote sensing and geographic data set. The experimental results obtained with random forests were compared to results obtained by bagging and boosting methods.

[1]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[4]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[5]  Ron Kohavi,et al.  The Power of Decision Tables , 1995, ECML.

[6]  Joydeep Ghosh,et al.  Random forests of binary hierarchical classifiers for analysis of hyperspectral data , 2003, IEEE Workshop on Advances in Techniques for Analysis of Remotely Sensed Data, 2003.

[7]  Johannes R. Sveinsson,et al.  Multiple classifiers applied to multisource remote sensing data , 2002, IEEE Trans. Geosci. Remote. Sens..