Doubly robust estimation of generalized partial linear models for longitudinal data with dropouts

We develop a doubly robust estimation of generalized partial linear models for longitudinal data with dropouts. Our method extends the highly efficient aggregate unbiased estimating function approach proposed in Qu et al. (2010) to a doubly robust one in the sense that under missing at random (MAR), our estimator is consistent when either the linear conditional mean condition is satisfied or a model for the dropout process is correctly specified. We begin with a generalized linear model for the marginal mean, and then move forward to a generalized partial linear model, allowing for nonparametric covariate effect by using the regression spline smoothing approximation. We establish the asymptotic theory for the proposed method and use simulation studies to compare its finite sample performance with that of Qu's method, the complete-case generalized estimating equation (GEE) and the inverse-probability weighted GEE. The proposed method is finally illustrated using data from a longitudinal cohort study.

[1]  M. Pepe,et al.  A cautionary note on inference for marginal regression models with longitudinal data and general correlated response data , 1994 .

[2]  Myunghee C. Paik,et al.  The generalized estimating equation approach when data are not missing completely at random , 1997 .

[3]  M. Kenward,et al.  A comparison of multiple imputation and doubly robust estimation for analyses with missing data , 2006 .

[4]  Mark Lunt,et al.  Health Assessment Questionnaire disability progression in early rheumatoid arthritis: Systematic review and analysis of two inception cohorts , 2014, Seminars in arthritis and rheumatism.

[5]  Zhongyi Zhu,et al.  Robust Estimation in Generalized Partial Linear Models for Clustered Data , 2005 .

[6]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[7]  Zhongyi Zhu,et al.  Robust estimation in generalized semiparametric mixed models for longitudinal data , 2007 .

[8]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[9]  Zhiqiang Tan,et al.  Bounded, efficient and doubly robust estimation with inverse weighting , 2010 .

[10]  Andrew Copas,et al.  Doubly robust generalized estimating equations for longitudinal data , 2009, Statistics in medicine.

[11]  Marie Davidian,et al.  Improved Doubly Robust Estimation When Data Are Monotonely Coarsened, with Application to Longitudinal Studies with Dropout , 2011, Biometrics.

[12]  A. Silman,et al.  The incidence of rheumatoid arthritis in the United Kingdom: results from the Norfolk Arthritis Register. , 1994, British journal of rheumatology.

[13]  Lin Lu,et al.  Highly Efficient Aggregate Unbiased Estimating Functions Approach for Correlated Data With Missing at Random , 2010 .

[14]  Grace Y. Yi,et al.  A functional generalized method of moments approach for longitudinal studies with missing responses and covariate measurement error , 2012, Biometrika.

[15]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[16]  J. Robins,et al.  Improved double-robust estimation in missing data and causal inference models. , 2012, Biometrika.

[17]  Xiao-Hua Zhou,et al.  Generalized Partially Linear Models for Incomplete Longitudinal Data In the Presence of Population‐Level Information , 2013, Biometrics.