Semiparametric Regression Analysis With Missing Response at Random

We develop inference tools in a semiparametric partially linear regression model with missing response data. A class of estimators is defined that includes as special cases a semiparametric regression imputation estimator, a marginal average estimator, and a (marginal) propensity score weighted estimator. We show that any of our class of estimators is asymptotically normal. The three special estimators have the same asymptotic variance. They achieve the semiparametric efficiency bound in the homoscedastic Gaussian case. We show that the jackknife method can be used to consistently estimate the asymptotic variance. Our model and estimators are defined with a view to avoid the curse of dimensionality, which severely limits the applicability of existing methods. The empirical likelihood method is developed. It is shown that when missing responses are imputed using the semiparametric regression method the empirical log-likelihood is asymptotically a scaled chi-squared variable. An adjusted empirical log-likelihood ratio, which is asymptotically standard chisquared, is obtained. Also, a bootstrap empirical log-likelihood ratio is derived and its distribution is used to approximate that of the imputed empirical log-likelihood ratio. A simulation study is conducted to compare the adjusted and bootstrap empirical likelihood with the normal approximation-based method in terms of coverage accuracies and average lengths of confidence intervals. Based on biases and standard errors, a comparison is also made by simulation between the proposed estimators and the related estimators.

[1]  A. Jon Information Bounds for Regression Models with Missing Data , 2000 .

[2]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator , 1998 .

[3]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[4]  T. Severini,et al.  Quasi-Likelihood Estimation in Semiparametric Models , 1994 .

[5]  J. Powell,et al.  Semiparametric estimation of censored selection models with a nonparametric selection mechanism , 1993 .

[6]  J. Rice Convergence rates for partially splined models , 1986 .

[7]  J. Robins,et al.  Analysis of semi-parametric regression models with non-ignorable non-response. , 1997, Statistics in medicine.

[8]  R. Gray,et al.  Spline-based tests in survival analysis. , 1994, Biometrics.

[9]  P. Bickel Efficient and Adaptive Estimation for Semiparametric Models , 1993 .

[10]  J. Robins,et al.  Semiparametric Efficiency in Multivariate Regression Models with Missing Data , 1995 .

[11]  J. Stock Nonparametric Policy Analysis , 1989 .

[12]  James L. Powell,et al.  Semiparametric Estimation of Selection Models: Some Empirical Results , 1990 .

[13]  Jun S. Liu,et al.  Sequential Imputations and Bayesian Missing Data Problems , 1994 .

[14]  M. Emond,et al.  Information Bounds for Regression Models with Missing Data , 2000 .

[15]  Clive W. J. Granger,et al.  Semiparametric estimates of the relation between weather and electricity sales , 1986 .

[16]  P. Robinson ROOT-N-CONSISTENT SEMIPARAMETRIC REGRESSION , 1988 .

[17]  J. Peixoto A Property of Well-Formulated Polynomial Regression Models , 1990 .

[18]  F. Yates The analysis of replicated experiments when the field results are incomplete , 1933 .

[19]  Art B. Owen,et al.  Empirical Likelihood for Linear Models , 1991 .

[20]  M. Healy,et al.  Missing Values in Experiments Analysed on Automatic Computers , 1956 .

[21]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[22]  Philip E. Cheng,et al.  Nonparametric Estimation of Mean Functionals with Data Missing at Random , 1994 .

[23]  P. Speckman Kernel smoothing in partial linear models , 1988 .

[24]  Anne Lohrli Chapman and Hall , 1985 .

[25]  J. Heckman,et al.  Matching as an econometric estimator , 1998 .

[26]  Hung Chen,et al.  Convergence Rates for Parametric Components in a Partly Linear Model , 1988 .

[27]  Oliver Linton,et al.  SECOND ORDER APPROXIMATION IN THE PARTIALLY LINEAR REGRESSION MODEL , 1995 .

[28]  Yuichi Kitamura,et al.  An Information-Theoretic Alternative to Generalized Method of Moments Estimation , 1997 .

[29]  A. Owen Empirical Likelihood Ratio Confidence Regions , 1990 .

[30]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[31]  Jack Cuzick,et al.  Semiparametric additive regression , 1992 .

[32]  James M. Robins,et al.  Semiparametric efficient estimation of a conditional density with missing or mismeasured covariates , 1995 .

[33]  J. Robins,et al.  Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. , 1997, Statistics in medicine.

[34]  J. N. K. Rao,et al.  Empirical Likelihood‐based Inference in Linear Models with Missing Data , 2002 .

[35]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[36]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[37]  D. O. Scharfstein Adjusting for nonignorable dropout using semiparametric nonresponse models (with discussion) , 1999 .

[38]  J. N. K. Rao,et al.  Empirical likelihood for linear regression models under imputation for missing responses , 2001 .

[39]  A. Owen Empirical likelihood ratio confidence intervals for a single functional , 1988 .

[40]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[41]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2002 .

[42]  C. J. Stone,et al.  Optimal Rates of Convergence for Nonparametric Estimators , 1980 .

[43]  Thomas Ed,et al.  SOME THERAPEUTIC APPROACHES TO CHRONIC RENAL INSUFFICIENCY. , 1965 .

[44]  J. Rao On Variance Estimation with Imputed Survey Data , 1996 .

[45]  Nancy E. Heckman,et al.  Spline Smoothing in a Partly Linear Model , 1986 .