Identification of Outliers

One of the most vexing of problems in data analysis is the determination of whether or not to discard some observations because they are inconsistent with the rest of the observations and/or the probability distribution assumed to be the underlying distribution of the data. One direction of research activity related to this problem is that of the study of robust statistical procedures (cf. Huber [1981]), primarily procedures for estimating population parameters which are insensitive to the effect of “outliers”, i.e., observations inconsistent with the assumed model of the random process generating the observations. Typically, the robust procedure involves some “trimming” or down-weighting procedure, wherein some fraction of the extreme observations are automatically eliminated or given less weight to guard against the potential effect of outliers.

[1]  W. R. Thompson On a Criterion for the Rejection of Observations and the Distribution of the Ratio of Deviation to Sample Standard Deviation , 1935 .

[2]  E. S. Pearson,et al.  THE EFFICIENCY OF STATISTICAL TOOLS AND A CRITERION FOR THE REJECTION OF OUTLYING OBSERVATIONS , 1936 .

[3]  F. E. Grubbs Sample Criteria for Testing Outlying Observations , 1950 .

[4]  John E. Walsh Some Nonparametric Tests of Whether the Largest Observations of a Set are too Large or too Small , 1950 .

[5]  John E. Walsh Correction to "Some Nonparametric Tests of Whether the Largest Observations of a set are too Large or too Small" , 1953 .

[6]  John E. Walsh Large sample nonparametric rejection of outlying observations , 1959 .

[7]  Thomas S. Ferguson,et al.  Rules for Rejection of Outliers , 1961 .

[8]  Thomas S. Ferguson,et al.  On the Rejection of Outliers , 1961 .

[9]  E. S. Pearson,et al.  The ratio of range to standard deviation in the same normal sample , 1964 .

[10]  R. H. Moore,et al.  Some Grubbs-Type Statistics for the Detection of Several Outliers , 1972 .

[11]  Bernard Rosner,et al.  On the Detection of Many Outliers , 1975 .

[12]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[13]  R. Cook Influential Observations in Linear Regression , 1979 .

[14]  Douglas M. Hawkins,et al.  Fractiles of an extended multiple outlier test , 1979 .

[15]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[16]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[17]  B. Rosner Percentage Points for a Generalized ESD Many-Outlier Procedure , 1983 .

[18]  S. Weisberg,et al.  Residuals and Influence in Regression , 1982 .