Robust Bootstrap with Non Random Weights Based on the Influence Function

Abstract The existence of outliers in a sample is an obvious problem which can become worse when the usual bootstrap is applied, because some resamples may have a higher contamination level than the initial sample. Bootstrapping using robust estimators may be a solution to this problem. However, in many instances, this will not be enough because it can lead to several complications, such as: (i) the breakdown point for the whole procedure may be small even when based on an estimator with a high breakdown point; (ii) mathematical difficulties; (iii) very high computation time. In this paper, we suggest a modification of the bootstrap procedure in order to solve these problems which consists of forming each bootstrap sample by resampling with different probabilities so that the potentially more harmful observations have smaller probabilities of selection. The aim is to protect the whole procedure against a given number of arbitrary outliers. As an illustration, we consider point and interval estimation for the correlation coefficient. We use Monte Carlo methods to compare this method with another robust bootstrap procedure, the winsorized bootstrap, recently suggested by Singh [Singh, K. (1998). Breakdown theory for bootstrap quantiles. Ann. Statist. 26:1719–1732].

[1]  P. Rousseeuw,et al.  A robust version of principal factor analysis , 2000 .

[2]  P. Rousseeuw,et al.  Breakdown Points of Affine Equivariant Estimators of Multivariate Location and Covariance Matrices , 1991 .

[3]  Ana M. Pires,et al.  Influence functions and outlier detection under the common principal components model: A robust approach , 2002 .

[4]  W. R. Buckland,et al.  Outliers in Statistical Data , 1979 .

[5]  R. Zamar,et al.  Bootstrapping robust estimates of regression , 2002 .

[6]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[7]  B. Efron Jackknife‐After‐Bootstrap Standard Errors and Influence Functions , 1992 .

[8]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[9]  S. J. Devlin,et al.  Robust estimation and outlier detection with correlation coefficients , 1975 .

[10]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[11]  Debashis Kushary,et al.  Bootstrap Methods and Their Application , 2000, Technometrics.

[12]  Jun Shao,et al.  The Efficiency and Consistency of Approximations to the Jackknife Variance Estimators , 1989 .

[13]  Kesar Singh,et al.  Breakdown theory for bootstrap quantiles , 1998 .

[14]  Arnold J. Stromberg,et al.  Robust covariance estimates based on resampling , 1997 .

[15]  D. Ruppert Robust Statistics: The Approach Based on Influence Functions , 1987 .

[16]  M. Chavance [Jackknife and bootstrap]. , 1992, Revue d'epidemiologie et de sante publique.