A Fast Algorithm for S-Regression Estimates

Equivariant high-breakdown point regression estimates are computationally expensive, and the corresponding algorithms become unfeasible for moderately large number of regressors. One important advance to improve the computational speed of one such estimator is the fast-LTS algorithm. This article proposes an analogous algorithm for computing S-estimates. The new algorithm, that we call “fast-S”, is also based on a “local improvement” step of the resampling initial candidates. This allows for a substantial reduction of the number of candidates required to obtain a good approximation to the optimal solution. We performed a simulation study which shows that S-estimators computed with the fast-S algorithm compare favorably to the LTS-estimators computed with the fast-LTS algorithm.

[1]  F. Hampel A General Qualitative Definition of Robustness , 1971 .

[2]  Location estimators based on linear combinations of modified order statistics , 1976 .

[3]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[4]  Peter J. Rousseeuw,et al.  ROBUST REGRESSION BY MEANS OF S-ESTIMATORS , 1984 .

[5]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[6]  J.M. Steele,et al.  Algorithms and complexity for least median of squares regression , 1986, Discret. Appl. Math..

[7]  V. Yohai HIGH BREAKDOWN-POINT AND HIGH EFFICIENCY ROBUST ESTIMATES FOR REGRESSION , 1987 .

[8]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[9]  V. Yohai,et al.  High Breakdown-Point Estimates of Regression by Means of the Minimization of an Efficient Scale , 1988 .

[10]  V. Yohai,et al.  Min-Max Bias Robust Regression. , 1989 .

[11]  P. L. Davies,et al.  The asymptotics of S-estimators in the linear regression model , 1990 .

[12]  D. Pollard,et al.  Cube Root Asymptotics , 1990 .

[13]  P. Rousseeuw,et al.  Least median of squares estimation in power systems , 1991 .

[14]  D. Ruppert Computing S Estimators for Regression and Multivariate Location/Dispersion , 1992 .

[15]  Arnold J. Stromberg,et al.  Computing the Exact Least Median of Squares Estimate and Stability Diagnostics in Multiple Linear Regression , 1993, SIAM J. Sci. Comput..

[16]  David M. Rocke,et al.  Computable Robust Estimation of Multivariate Location and Shape in High Dimension Using Compound Estimators , 1994 .

[17]  The influence functions for the least trimmed squares and the least trimmed absolute deviations estimators , 1994 .

[18]  Douglas M. Hawkins,et al.  The feasible solution algorithm for least trimmed squares regression , 1994 .

[19]  O. Hössjer Rank-Based Estimates in the Linear Model with High Breakdown Point , 1994 .

[20]  David L. Woodruff,et al.  Identification of Outliers in Multivariate Data , 1996 .

[21]  David E. Tyler,et al.  Constrained M-Estimation for Regression , 1996 .

[22]  Katrien van Driessen,et al.  A Fast Algorithm for the Minimum Covariance Determinant Estimator , 1999, Technometrics.

[23]  José Julio Espina Agulló New algorithms for computing the least trimmed squares regression estimator , 2001 .

[24]  Ruben H. Zamar,et al.  Maximum Bias Curves for Robust Regression with Non-elliptical Regressors , 2001 .

[25]  C. Müller,et al.  Breakdown points of trimmed likelihood estimators and related estimators in generalized linear models , 2003 .

[26]  PETER J. ROUSSEEUW,et al.  Computing LTS Regression for Large Data Sets , 2005, Data Mining and Knowledge Discovery.