Sampling Adjusted Analysis of Dynamic Additive Regression Models for Longitudinal Data

We consider a modelling approach to longitudinal data that aims at estimating flexible covariate effects in a model where the sampling probabilities are modelled explicitly. The joint modelling yields simple estimators that are easy to compute and analyse, even if the sampling of the longitudinal responses interacts with the response level. An incorrect model for the sampling probabilities results in biased estimates. Non-representative sampling occurs, for example, if patients with an extreme development (based on extreme values of the response) are called in for additional examinations and measurements. We allow covariate effects to be time-varying or time-constant. Estimates of covariate effects are obtained by solving martingale equations locally for the cumulative regression functions. Using Aalen's additive model for the sampling probabilities, we obtain simple expressions for the estimators and their asymptotic variances. The asymptotic distributions for the estimators of the non-parametric components as well as the parametric components of the model are derived drawing on general martingale results. Two applications are presented. We consider the growth of cystic fibrosis patients and the prothrombin index for liver cirrhosis patients. The conclusion about the growth of the cystic fibrosis patients is not altered when adjusting for a possible non-representativeness in the sampling, whereas we reach substantively different conclusions about the treatment effect for the liver cirrhosis patients.

[1]  Ian W. McKeague,et al.  A partly parametric additive risk model , 1994 .

[2]  O. Aalen,et al.  Further results on the non-parametric linear regression model in survival analysis. , 1993, Statistics in medicine.

[3]  T. Scheike Nonparametric kernel regression when the regressor follows a counting process , 1996 .

[4]  O. Aalen Nonparametric Inference for a Family of Counting Processes , 1978 .

[5]  P. Speckman Kernel smoothing in partial linear models , 1988 .

[6]  D. Harrington,et al.  Counting Processes and Survival Analysis , 1991 .

[7]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[8]  I. McKeague,et al.  Identification of Nonlinear Time Series from First Order Cumulative Characteristics , 1994 .

[9]  Ian W. McKeague,et al.  Weighted Least Squares Estimation for Aalen's Additive Risk Model , 1991 .

[10]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[11]  O. Aalen A linear regression model for the analysis of life times. , 1989, Statistics in medicine.

[12]  Ian W. McKeague,et al.  Asymptotic Theory for Weighted Least Squares Estimators in Aalen's Additive Risk Model. , 1987 .

[13]  Niels Keiding,et al.  Statistical Models Based on Counting Processes , 1993 .

[14]  Comparison of non-parametric regression functions through their cumulatives , 2000 .

[15]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[16]  Thomas H. Scheike,et al.  Parametric regression for longitudinal data with counting process measurement times , 1994 .