Longitudinal Data with Follow-up Truncated by Death: Match the Analysis Method to Research Aims.

Diverse analysis approaches have been proposed to distinguish data missing due to death from nonresponse, and to summarize trajectories of longitudinal data truncated by death. We demonstrate how these analysis approaches arise from factorizations of the distribution of longitudinal data and survival information. Models are illustrated using cognitive functioning data for older adults. For unconditional models, deaths do not occur, deaths are independent of the longitudinal response, or the unconditional longitudinal response is averaged over the survival distribution. Unconditional models, such as random effects models fit to unbalanced data, may implicitly impute data beyond the time of death. Fully conditional models stratify the longitudinal response trajectory by time of death. Fully conditional models are effective for describing individual trajectories, in terms of either aging (age, or years from baseline) or dying (years from death). Causal models (principal stratification) as currently applied are fully conditional models, since group differences at one timepoint are described for a cohort that will survive past a later timepoint. Partly conditional models summarize the longitudinal response in the dynamic cohort of survivors. Partly conditional models are serial cross-sectional snapshots of the response, reflecting the average response in survivors at a given timepoint rather than individual trajectories. Joint models of survival and longitudinal response describe the evolving health status of the entire cohort. Researchers using longitudinal data should consider which method of accommodating deaths is consistent with research aims, and use analysis methods accordingly.

[1]  Dylan Small,et al.  Defining and Estimating Intervention Effects for Groups that will Develop an Auxiliary Outcome , 2007 .

[2]  J. Williamson,et al.  Patterns of Self‐Rated Health in Older Adults Before and After Sentinel Health Events , 2001, Journal of the American Geriatrics Society.

[3]  Myunghee C. Paik,et al.  The generalized estimating equation approach when data are not missing completely at random , 1997 .

[4]  Xihong Lin,et al.  Semiparametric Modeling of Longitudinal Measurements and Time‐to‐Event Data–A Two‐Stage Regression Calibration Approach , 2008, Biometrics.

[5]  J. Kalbfleisch,et al.  Between- and within-cluster covariate effects in the analysis of clustered data. , 1998, Biometrics.

[6]  G. Robinson That BLUP is a Good Thing: The Estimation of Random Effects , 1991 .

[7]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[8]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[9]  本田 純久 Longitudinal Data , 2003, Encyclopedia of Wireless Networks.

[10]  V. De Gruttola,et al.  Modelling progression of CD4-lymphocyte count and its relationship to survival time. , 1994, Biometrics.

[11]  J. Williamson,et al.  Predicting future years of healthy life for older adults. , 1998, Journal of clinical epidemiology.

[12]  M. Wulfsohn,et al.  A joint model for survival and longitudinal data measured with error. , 1997, Biometrics.

[13]  N. Pedersen,et al.  Population Inference with Mortality and Attrition in Longitudinal Studies on Aging: A Two-Stage Multiple Imputation Method , 2007, Experimental aging research.

[14]  R. Kronmal,et al.  The Cardiovascular Health Study: design and rationale. , 1991, Annals of epidemiology.

[15]  S. Thompson,et al.  A joint analysis of quality of life and survival using a random effect selection model. , 2000, Statistics in medicine.

[16]  E. Mackenzie,et al.  On Estimation of the Survivor Average Causal Effect in Observational Studies When Important Confounders Are Missing Due to Death , 2009, Biometrics.

[17]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[18]  D. Pauler,et al.  An Estimator for Treatment Comparisons among Survivors in Randomized Trials , 2005, Biometrics.

[19]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[20]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[21]  浜田賀代子,et al.  老年期痴呆患者のスクリーニングにおけるThe modified mini-mental state (3MS) examination日本語版の有用性 , 1992 .

[22]  T. Raghunathan,et al.  Including deaths when measuring health status over time. , 1995, Medical care.

[23]  B. Psaty,et al.  Vascular events, mortality, and preventive therapy following ischemic stroke in the elderly , 2005, Neurology.

[24]  J. Spertus,et al.  Transforming Self-Rated Health and the SF-36 Scales to Include Death and Improve Interpretability , 2001, Medical care.

[25]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[26]  M. Joffe,et al.  A Potential Outcomes Approach to Developmental Toxicity Analyses , 2006, Biometrics.

[27]  Scott L. Zeger,et al.  Marginalized Multilevel Models and Likelihood Inference , 2000 .

[28]  D. Rubin Causal Inference Through Potential Outcomes and Principal Stratification: Application to Studies with “Censoring” Due to Death , 2006, math/0612783.

[29]  Marginalized transition models for longitudinal binary data with ignorable and non‐ignorable drop‐out , 2004, Statistics in medicine.

[30]  Ellen MacKenzie,et al.  Principal Stratification Designs to Estimate Input Data Missing Due to Death , 2007, Biometrics.

[31]  L. Fried,et al.  Factors Associated with Healthy Aging: The Cardiovascular Health Study , 2001, Journal of the American Geriatrics Society.

[32]  Ron Brookmeyer,et al.  Multidimensional Longitudinal Data: Estimating a Treatment Effect from Continuous, Discrete, or Time-to-Event Response Variables , 2000 .

[33]  D. Pauler,et al.  Pattern mixture models for longitudinal quality of life studies in advanced stage disease , 2003, Statistics in medicine.

[34]  D. Scharfstein,et al.  Causal inference for non-mortality outcomes in the presence of death. , 2007, Biostatistics.

[35]  H. Chui,et al.  The Modified Mini-Mental State (3MS) examination. , 1987, The Journal of clinical psychiatry.

[36]  Carole Dufouil,et al.  Analysis of longitudinal studies with death and drop‐out: a case study , 2004, Statistics in medicine.

[37]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[38]  D. Bennett,et al.  Terminal decline in cognitive function , 2003, Neurology.

[39]  D. Rubin,et al.  Principal Stratification in Causal Inference , 2002, Biometrics.

[40]  J. Williamson,et al.  The aging and dying processes and the health of older adults. , 2002, Journal of clinical epidemiology.

[41]  I. Siegler,et al.  The terminal drop hypothesis: fact or artifact? , 1975, Experimental aging research.

[42]  Patrick J Heagerty,et al.  Directly parameterized regression conditioning on being alive: analysis of longitudinal data truncated by deaths. , 2005, Biostatistics.

[43]  S. Ratcliffe,et al.  Joint Modeling of Longitudinal and Survival Data via a Common Frailty , 2004, Biometrics.

[44]  N M Laird,et al.  Generalized linear mixture models for handling nonignorable dropouts in longitudinal studies. , 2000, Biostatistics.