In experimental science measurements are typically repeated only a few times, yielding a sample size n of the order of 3 to 8. One then wants to summarize the measurements by a central value and measure their variability, i.e. estimate location and scale. These estimates should preferably be robust against outliers, as reflected by their small-sample breakdown value. The estimator's stylized empirical influence function should be smooth, monotone increasing for location, and decreasing-increasing for scale. It turns out that location can be estimated robustly for n ≥ 3, whereas for scale n ≥ 4 is needed. Several well-known robust estimators are studied for small n, yielding some surprising results. For instance, the Hodges-Lehmann estimator equals the average when n=4. Also location M-estimators with auxiliary scale are studied, addressing issues like the difference between one-step and fully iterated M-estimators. Simultaneous M-estimators of location and scale ('Huber's Proposal 2') are considered as well, and it turns out that their lack of robustness is already noticeable for such small samples. Recommendations are given as to which estimators to use.
[1]
P. Rousseeuw,et al.
Alternatives to the Median Absolute Deviation
,
1993
.
[2]
Peter J. Rousseeuw,et al.
Unconventional features of positive-breakdown estimators
,
1994
.
[3]
Werner A. Stahel,et al.
Robust Statistics: The Approach Based on Influence Functions
,
1987
.
[4]
P. Rousseeuw,et al.
A class of high-breakdown scale estimators based on subranges
,
1992
.
[5]
Ali S. Hadi,et al.
Finding Groups in Data: An Introduction to Chster Analysis
,
1991
.
[6]
D. F. Andrews,et al.
Robust Estimates of Location: Survey and Advances.
,
1975
.
[7]
Peter J. Rousseeuw,et al.
Robust regression and outlier detection
,
1987
.