Higher order influence functions and minimax estimation of nonlinear functionals

We present a theory of point and interval estimation for nonlinear functionals in parametric, semi-, and non-parametric models based on higher order influence functions (Robins (2004), Section 9; Li et al. (2004), Tchetgen et al. (2006), Robins et al. (2007)). Higher order influence functions are higher order U-statistics. Our theory extends the first order semiparametric theory of Bickel et al. (1993) and van der Vaart (1991) by incorporating the theory of higher order scores considered by Pfanzagl (1990), Small and McLeish (1994) and Lindsay and Waterman (1996). The theory reproduces many previous results, produces new non-$\sqrt{n}$ results, and opens up the ability to perform optimal non-$\sqrt{n}$ inference in complex high dimensional models. We present novel rate-optimal point and interval estimators for various functionals of central importance to biostatistics in settings in which estimation at the expected $\sqrt{n}$ rate is not possible, owing to the curse of dimensionality. We also show that our higher order influence functions have a multi-robustness property that extends the double robustness property of first order influence functions described by Robins and Rotnitzky (2001) and van der Laan and Robins (2003).

[1]  S. Portnoy Asymptotic Behavior of Likelihood Methods for Exponential Families when the Number of Parameters Tends to Infinity , 1988 .

[2]  Lie Wang,et al.  Variance Function Estimation in Multivariate Nonparametric Regression , 2006 .

[3]  L. Brown,et al.  Effect of mean on variance function estimation in nonparametric regression , 2008, 0804.0709.

[4]  Aad Van Der Vbart,et al.  ON DIFFERENTIABLE FUNCTIONALS , 1988 .

[5]  P. Bickel Efficient and Adaptive Estimation for Semiparametric Models , 1993 .

[6]  J. Robins,et al.  Adaptive nonparametric confidence sets , 2006, math/0605473.

[7]  J. Pfanzagl Estimation in Semiparametric Models: Some Recent Developments , 1990 .

[8]  S. Dudoit,et al.  Asymptotics of cross-validated risk estimation in estimator selection and performance assessment , 2005 .

[9]  J. Robins,et al.  Robust Inference with Higher Order Inuence Functions : Part II , 2005 .

[10]  Alan J. Lee,et al.  U-Statistics: Theory and Practice , 1990 .

[11]  James L. Powell,et al.  Estimation of semiparametric models , 1994 .

[12]  C. Small,et al.  Hilbert Space Methods in Probability and Statistical Inference , 1994 .

[13]  J. Robins,et al.  Robust inference with higher order influence functions: Part I, Part II , 2005 .

[14]  Lie Wang,et al.  Variance function estimation in multivariate nonparametric regression with fixed design , 2009, J. Multivar. Anal..

[15]  Bruce G. Lindsay,et al.  Projected score methods for approximating conditional scores , 1996 .

[16]  Chris A. J. Klaassen,et al.  Consistent Estimation of the Influence Function of Locally Asymptotically Linear Estimators , 1987 .

[17]  Peter J. Bickel,et al.  INFERENCE FOR SEMIPARAMETRIC MODELS: SOME QUESTIONS AND AN ANSWER , 2001 .

[18]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[19]  P. Bickel,et al.  Nonparametric estimators which can be "plugged-in" , 2003 .

[20]  P. Massart,et al.  Estimation of Integral Functionals of a Density , 1995 .

[21]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[22]  Stan Hurn Panel Data Econometrics , 2010 .

[23]  S. Mallat A wavelet tour of signal processing , 1998 .

[24]  J. Robins,et al.  Twicing Kernels and a Small Bias Property of Semiparametric Estimators , 2004 .

[25]  James M. Robins,et al.  Unified Methods for Censored Longitudinal Data and Causality , 2003 .

[26]  P. Bickel,et al.  Achieving Information Bounds in Non and Semiparametric Models , 1990 .

[27]  J. Robins,et al.  Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. , 1997, Statistics in medicine.

[28]  Q. Shao,et al.  On Parameters of Increasing Dimensions , 2000 .