SUMMARY The influence of observations upon misclassification probability estimates in linear discriminant analysis is examined. For a single observation, an exact expression is given confirming two surprising results of Campbell (1978). It also shows that the influence of an observation is governed by two quantities: (a) the difference between its linear discriminant score and that for its sample mean, and (b) its atypicality estimate for its own population. This is analogous to linear model results where an observation's influence depends upon its residual and its leverage. However, important differences from the regression situation are noted. Contours of this one-at-a-time influence can be superimposed on the plot introduced by Critchley & Ford (1985). Examining the joint influence of several observations -is complicated by the computational burden and by possible masking effects. A quadratic approximation is developed for this problem using Pregibon's (1981) case weights scheme. This approximation has an error of order n-3, where n denotes the assumed common order of the sample sizes. An example is given.
[1]
S. Weisberg,et al.
Residuals and Influence in Regression
,
1982
.
[2]
R. Dennis Cook,et al.
Assessing influence on regression coefficients in generalized linear models
,
1989
.
[3]
N. Campbell,et al.
The Influence Function as an Aid in Outlier Detection in Discriminant Analysis
,
1978
.
[4]
S. Weisberg.
Plots, transformations, and regression
,
1985
.
[5]
F. Critchley.
Influence in principal components analysis
,
1985
.
[6]
D. Pregibon.
Logistic Regression Diagnostics
,
1981
.
[7]
John Aitchison,et al.
Statistical Prediction Analysis
,
1975
.
[8]
T. W. Anderson.
An Introduction to Multivariate Statistical Analysis
,
1959
.
[9]
F. Critchley,et al.
Interval estimation in discrimination: The multivariate normal equal covariance case
,
1985
.