On robust linear regression with incomplete data

In this paper, we use recently developed methods of very robust regression to extend missing value techniques to data with several outliers. Simulation experiments reveal that additional outliers may be imputed if one ignores the outliers already in the data. The combination of the forward search algorithm for high breakdown point estimators and the EM algorithm or multiple imputation for missing values can avoid problems of this kind. Some multiple deletion diagnostics for linear regression with incomplete data are also discussed.

[1]  P. Rousseeuw,et al.  Unmasking Multivariate Outliers and Leverage Points , 1990 .

[2]  R. Little,et al.  Maximum likelihood inference for multiple regression with missing values , 1979 .

[3]  E. Beale,et al.  Missing Values in Multivariate Analysis , 1975 .

[4]  Douglas M. Hawkins,et al.  The feasible solution algorithm for least trimmed squares regression , 1994 .

[5]  D. Rubin,et al.  Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable Nonresponse , 1986 .

[6]  Chuanhai Liu,et al.  Missing data imputation using the multivariate t distribution , 1995 .

[7]  P. Rousseeuw,et al.  A fast algorithm for the minimum covariance determinant estimator , 1999 .

[8]  Roderick J. A. Little Regression with Missing X's: A Review , 1992 .

[9]  S. Weisberg,et al.  Characterizations of an Empirical Influence Function for Detecting Influential Cases in Regression , 1980 .

[10]  D. G. Simpson,et al.  Unmasking Multivariate Outliers and Leverage Points: Comment , 1990 .

[11]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[12]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  S. Weisberg Plots, transformations, and regression , 1985 .

[15]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[16]  Donald B. Rubin,et al.  Multiple imputation in mixture models for nonignorable nonresponse with follow-ups , 1993 .

[17]  A. Atkinson Fast Very Robust Methods for the Detection of Multiple Outliers , 1994 .

[18]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[19]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[20]  S. Weisberg,et al.  Assessing influence in multiple linear regression with incomplete data , 1986 .

[21]  A. C. Atkinson,et al.  Computing least trimmed squares regression with the forward search , 1999, Stat. Comput..

[22]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .