论文信息 - Random Forest Classifiers

Random Forest Classifiers

A classification tree represents the probability spaceP of posterior probabilities p(y|x) of label given feature by a recursive partition of the feature space X , where each partition is performed by a test on the feature x. Each such test is called a split rule. Since the partition is recursive, the split rules can be arranged into a tree. The split rules are learned by partitioning the training set T recursively in a way that increases the purity of the subsets formed by each split. A set is pure if one of the labels dominate the others in the set, in a sense to be made more precise later on. To each set of the partition is assigned a posterior probability distribution, and p(y|x) for a training or test feature x ∈ X is then defined as the probability distribution associated with the set of the partition that contains x. A popular binary split rule called a 1-rule partitions a subset1 S ⊆ X × Y into the two sets

Carlo Tomasi

[1] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[2] Yali Amit,et al. Shape Quantization and Recognition with Randomized Trees , 1997, Neural Computation.

[3] Ronald L. Rivest,et al. Constructing Optimal Binary Decision Trees is NP-Complete , 1976, Inf. Process. Lett..

[4] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[5] Thomas G. Dietterich. An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[6] L. Breiman. Arcing Classifiers , 1998 .

[7] Leo Breiman,et al. Classification and Regression Trees , 1984 .

[8] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[9] Eric Bauer,et al. An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.