1 CLASSIFICATION EFFICIENCIES FOR ROBUST LINEAR DISCRIMINANT ANALYSIS

Linear discriminant analysis is typically carried out using Fisher’s method. This method relies on the sample averages and covariance matrices computed from the different groups constituting the training sample. Since sample averages and covariance matrices are not robust, it has been proposed to use robust estimators of location and covariance instead, yielding a robust version of Fisher’s method. In this paper relative classification efficiencies of the robust procedures with respect to the classical method are computed. Second order influence functions appear to be useful for computing these classification efficiencies. It turns out that, when using an appropriate robust estimator, the loss in classification efficiency at the normal model remains limited. These findings are confirmed by finite sample simulations.

[1]  V. Yohai,et al.  A Fast Algorithm for S-Regression Estimates , 2006 .

[2]  Peter Filzmoser,et al.  Multiple group linear discriminant analysis: robustness and error rate , 2006 .

[3]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[4]  Christophe Croux,et al.  Influence of observations on the misclassification probability in quadratic discriminant analysis , 2005 .

[5]  Mia Hubert,et al.  Fast and robust discriminant analysis , 2004, Comput. Stat. Data Anal..

[6]  J. A. Branco,et al.  Partial influence functions , 2002 .

[7]  C. Croux,et al.  Robust linear discriminant analysis using S‐estimators , 2001 .

[8]  W. Fung,et al.  High Breakdown Estimation for Multiple Populations with Applications to Discriminant Analysis , 2000 .

[9]  Katrien van Driessen,et al.  A Fast Algorithm for the Minimum Covariance Determinant Estimator , 1999, Technometrics.

[10]  Douglas M. Hawkins,et al.  High-Breakdown Linear Discriminant Analysis , 1997 .

[11]  W. Fung The influence of observations on misclassification probability in multiple discriminant analysis , 1996 .

[12]  Twk Fung,et al.  Influence on classification and probability of misclassification , 1995 .

[13]  P. J. Rousseeuw,et al.  Integrating a high-breakdown option into discriminant analysis in exploration geochemistry , 1992 .

[14]  W. Fung Some diagnostic measures in discriminant analysis , 1992 .

[15]  Frank Critchley,et al.  The influence of observations on misclassification probability estimates in linear discriminant analysis , 1991 .

[16]  H. P. Lopuhaä On the relation between S-estimators and M-estimators of multivariate location and covariance , 1989 .

[17]  S. Bull,et al.  The Efficiency of Multinomial Logistic Regression Compared with Multiple Group Discriminant Analysis , 1987 .

[18]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[19]  B. Efron The Efficiency of Logistic Regression Compared to Normal Discriminant Analysis , 1975 .