论文信息 - An Information Theoretic Perspective on Multiple Classifier Systems

An Information Theoretic Perspective on Multiple Classifier Systems

This paper examines the benefits that information theory can bring to the study of multiple classifier systems. We discuss relationships between the mutual information and the classification error of a predictor. We proceed to discuss how this concerns ensemble systems, by showing a natural expansion of the ensemble mutual information into "accuracy" and "diversity" components. This natural derivation of a diversity term is an alternative to previous attempts to artificially define a term. The main finding is that diversity in fact exists at multiple orders of correlation, and pairwise diversity can capture only the low order components.

Gavin Brown

[1] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[2] Ludmila I. Kuncheva,et al. That Elusive Diversity in Classifier Ensembles , 2003, IbPRIA.

[3] William J. McGill. Multivariate information transmission , 1954, Trans. IRE Prof. Group Inf. Theory.

[4] Joydeep Ghosh,et al. Hierarchical Fusion of Multiple Classifiers for Hyperspectral Data Analysis , 2002, Pattern Analysis & Applications.

[5] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[6] Subhash C. Bagui,et al. Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[7] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[8] Thomas G. Dietterich,et al. Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[9] Tin Kam Ho,et al. Complexity Measures of Supervised Classification Problems , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10] Robert Tibshirani,et al. Margin Trees for High-dimensional Classification , 2007, J. Mach. Learn. Res..

[12] Trevor Hastie,et al. Multi-class AdaBoost ∗ , 2009 .