Time to Diagnosis: Accounting for Differential Endpoint Follow-up in Multi-Cohort Studies

Cox regression is widely used to analyze discrete survival time data. Differential endpoint follow-up across sub-cohorts where distribution of a covariate varies may cause typical estimators to be biased or inefficient. We demonstrate that with Cardiovascular Health Study data for incident type 2 diabetes. Two cohorts with extremely different race distribution have differential follow-up for fasting glucose levels. We study various scenarios of Cox regression. We suggest an alternative approach, Poisson generalized estimating equations with an offset to accommodate the differential follow-up. We use simulations to contrast the methods.

[1]  Wei Pan Extending the Iterative Convex Minorant Algorithm to the Cox Model for Interval-Censored Data , 1999 .

[2]  D. Mozaffarian,et al.  Lifestyle risk factors and new-onset diabetes mellitus in older adults: the cardiovascular health study. , 2009, Archives of internal medicine.

[3]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[4]  V. Vaccarino,et al.  Longitudinal association between depressive symptoms and incident type 2 diabetes mellitus in older adults: the cardiovascular health study. , 2007, Archives of internal medicine.

[5]  R B D'Agostino,et al.  Relation of pooled logistic regression to time dependent Cox regression analysis: the Framingham Heart Study. , 1990, Statistics in medicine.

[6]  T. Holford The analysis of rates and of survivorship using log-linear models. , 1980, Biometrics.

[7]  T. Dawber,et al.  The Framingham Study: The Epidemiology of Atherosclerotic Disease , 1980 .

[8]  R. D'Agostino,et al.  EPIDEMIOLOGICAL BACKGROUND AND DESIGN : THE FRAMINGHAM STUDY , 2022 .

[9]  P. Savage,et al.  Assessing the use of medications in the elderly: methods and initial experience in the Cardiovascular Health Study. The Cardiovascular Health Study Collaborative Research Group. , 1992, Journal of clinical epidemiology.

[10]  I. Hertz-Picciotto,et al.  Validity and efficiency of approximation methods for tied survival times in Cox regression. , 1997, Biometrics.

[11]  L. Fried,et al.  Long‐Term Function in an Older Cohort—The Cardiovascular Health Study All Stars Study , 2009, Journal of the American Geriatrics Society.

[12]  R B D'Agostino,et al.  Comparison of baseline and repeated measure covariate techniques in the Framingham Heart Study. , 1988, Statistics in medicine.

[13]  D. Mozaffarian,et al.  Association Between Adiposity in Midlife and Older Age and Risk of Diabetes in Older Adults , 2010 .

[14]  A. Dreher Modeling Survival Data Extending The Cox Model , 2016 .

[15]  L. Kuller,et al.  The relation of markers of inflammation to the development of glucose disorders in the elderly: the Cardiovascular Health Study. , 2001, Diabetes.

[16]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[17]  Wei Pan,et al.  Extending the Iterative Convex Minorant Algorithm to the Cox Model for Interval Censored Data , 2011 .

[18]  Thomas Lumley,et al.  Longitudinal data analysis for generalized linear models with follow‐up dependent on outcome‐related variables , 2007 .

[19]  Nan M. Laird,et al.  Covariance Analysis of Censored Survival Data Using Log-Linear Analysis Techniques , 1981 .

[20]  R. Kronmal,et al.  The Cardiovascular Health Study: design and rationale. , 1991, Annals of epidemiology.