论文信息 - Optimal aggregation of classifiers in statistical learning

Optimal aggregation of classifiers in statistical learning

Classification can be considered as nonparametric estimation of sets, where the risk is defined by means of a specific distance between sets associated with misclassification error. It is shown that the rates of convergence of classifiers depend on two parameters: the complexity of the class of candidate sets and the margin parameter. The dependence is explicitly given, indicating that optimal fast rates approaching O(n -1 ) can be attained, where n is the sample size, and that the proposed classifiers have the property of robustness to the margin. The main result of the paper concerns optimal aggregation of classifiers: we suggest a classifier that automatically adapts both to the complexity and to the margin, and attains the optimal fast rates, up to a logarithmic factor.

A. Tsybakov | A. Tsybakov

[1] Shun-ichi Amari,et al. A Theory of Pattern Recognition , 1968 .

[2] R. Dudley. Metric Entropy of Some Classes of Sets with Differentiable Boundaries , 1974 .

[3] Leo Breiman,et al. Classification and Regression Trees , 1984 .

[4] K. Alexander,et al. Probability Inequalities for Empirical Processes and a Law of the Iterated Logarithm , 1984 .

[5] K. Alexander,et al. Rates of growth and sample moduli for weighted empirical processes indexed by sets , 1987 .

[6] Andrew R. Barron,et al. Complexity Regularization with Application to Artificial Neural Networks , 1991 .

[7] O. Lepskii. On a Problem of Adaptive Estimation in Gaussian White Noise , 1991 .

[8] P. Massart,et al. Rates of convergence for minimum contrast estimators , 1993 .

[9] A. Tsybakov,et al. Minimax theory of image reconstruction , 1993 .

[10] E. Mammen,et al. Asymptotical minimax recovery of sets with smooth boundaries , 1995 .

[11] László Györfi,et al. A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.