Nonparametric curve estimation with missing data: A general empirical process approach

Abstract A general nonparametric imputation procedure, based on kernel regression, is proposed to estimate points as well as set- and function-indexed parameters when the data are missing at random (MAR). The proposed method works by imputing a specific function of a missing value (and not the missing value itself), where the form of this specific function is dictated by the parameter of interest. Both single and multiple imputations are considered. The associated empirical processes provide the right tool to study the uniform convergence properties of the resulting estimators. Our estimators include, as special cases, the imputation estimator of the mean, the estimator of the distribution function proposed by Cheng and Chu [1996. Kernel estimation of distribution functions and quantiles with missing data. Statist. Sinica 6, 63–78], imputation estimators of a marginal density, and imputation estimators of regression functions.

[1]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[2]  Oliver Linton,et al.  Semiparametric Regression Analysis With Missing Response at Random , 2003 .

[3]  A. Kolmogorov,et al.  Entropy and "-capacity of sets in func-tional spaces , 1961 .

[4]  M. Lefebvre Applied probability and statistics , 2006 .

[5]  Qiwei Yao,et al.  Set-Indexed Conditional Empirical and Quantile Processes Based on Dependent Data , 2002 .

[6]  D. Pollard Convergence of stochastic processes , 1984 .

[7]  Philip E. Cheng,et al.  Nonparametric Estimation of Mean Functionals with Data Missing at Random , 1994 .

[8]  C. Chu,et al.  KERNEL ESTIMATION OF DISTRIBUTION FUNCTIONS AND QUANTILES WITH MISSING DATA , 1999 .

[9]  George C. Canavos,et al.  Applied Probability and Statistical Methods.@@@Introduction to Mathematical Statistics.@@@Dice, Data and Decisions: Introductory Statistics. , 1986 .

[10]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[11]  S. Geer Applications of empirical process theory , 2000 .

[12]  M. Hazelton Marginal density estimation from incomplete bivariate data , 2000 .

[13]  M. Degroot,et al.  Probability and Statistics , 2021, Examining an Operational Approach to Teaching Probability.

[14]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[15]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .