Integrating latent classes in the Bayesian shared parameter joint model of longitudinal and survival outcomes

Cystic fibrosis is a chronic lung disease requiring frequent lung-function monitoring to track acute respiratory events (pulmonary exacerbations). The association between lung-function trajectory and time-to-first exacerbation can be characterized using joint longitudinal-survival modeling. Joint models specified through the shared parameter framework quantify the strength of association between such outcomes but do not incorporate latent sub-populations reflective of heterogeneous disease progression. Conversely, latent class joint models explicitly postulate the existence of sub-populations but do not directly quantify the strength of association. Furthermore, choosing the optimal number of classes using established metrics like deviance information criterion is computationally intensive in complex models. To overcome these limitations, we integrate latent classes in the shared parameter joint model through a fully Bayesian approach. To choose the optimal number of classes, we construct a mixture model assuming more latent classes than present in the data, thereby asymptotically “emptying” superfluous latent classes, provided the Dirichlet prior on class proportions is sufficiently uninformative. Model properties are evaluated in simulation studies. Application to data from the US Cystic Fibrosis Registry supports the existence of three sub-populations corresponding to lung-function trajectories with high initial forced expiratory volume in 1 s (FEV1), rapid FEV1 decline, and low but steady FEV1 progression. The association between FEV1 and hazard of exacerbation was negative in each class, but magnitude varied.

[1]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[2]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[3]  M. Escobar,et al.  Bayesian Density Estimation and Inference Using Mixtures , 1995 .

[4]  G. Verbeke,et al.  A Linear Mixed-Effects Model with Heterogeneity in the Random-Effects Population , 1996 .

[5]  D. Thomas,et al.  Simultaneously modelling censored survival data and repeatedly measured covariates: a Gibbs sampling approach. , 1996, Statistics in medicine.

[6]  M. Wulfsohn,et al.  A joint model for survival and longitudinal data measured with error. , 1997, Biometrics.

[7]  M. Stephens Dealing with label switching in mixture models , 2000 .

[8]  C. McCulloch,et al.  Latent Class Models for Joint Analysis of Longitudinal Biomarker and Event Process Data , 2002 .

[9]  Mark D Schluchter,et al.  Jointly modelling the relationship between survival and pulmonary function in cystic fibrosis patients , 2002, Statistics in medicine.

[10]  Joseph G Ibrahim,et al.  Bayesian Approaches to Joint Cure‐Rate and Longitudinal Models with Applications to Cancer Vaccine Trials , 2003, Biometrics.

[11]  Anastasios A. Tsiatis,et al.  Joint Modeling of Longitudinal and Time-to-Event Data : An Overview , 2004 .

[12]  Cécile Proust-Lima,et al.  Estimation of linear mixed models with a mixture of distribution for the random effects , 2005, Comput. Methods Programs Biomed..

[13]  C. Robert,et al.  Deviance information criteria for missing data models , 2006 .

[14]  Cécile Proust-Lima,et al.  Score Test for Conditional Independence Between Longitudinal Outcome and Time to Event Given the Classes in the Joint Latent Class Model , 2010, Biometrics.

[15]  K. Mengersen,et al.  Asymptotic behaviour of the posterior distribution in overfitted mixture models , 2011 .

[16]  Dimitris Rizopoulos,et al.  A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time‐to‐event , 2011, Statistics in medicine.

[17]  Dimitris Rizopoulos,et al.  Joint Models for Longitudinal and Time-to-Event Data: With Applications in R , 2012 .

[18]  Annalisa V Piccorelli,et al.  Jointly modeling the relationship between longitudinal and survival data subject to left truncation with applications to cystic fibrosis. , 2012, Statistics in medicine.

[19]  Cécile Proust-Lima,et al.  Analysis of multivariate mixed longitudinal data: a flexible latent process approach. , 2012, The British journal of mathematical and statistical psychology.

[20]  Eleni-Rosalina Andrinopoulou,et al.  Joint modeling of two longitudinal outcomes and competing risk data , 2014, Statistics in medicine.

[21]  Cécile Proust-Lima,et al.  Joint latent class models for longitudinal and time-to-event data: A review , 2014, Statistical methods in medical research.

[22]  Benoit Liquet,et al.  Estimation of extended mixed models using latent classes and latent processes: the R package lcmm , 2015, 1503.00890.

[23]  Robin Henderson,et al.  Joint modelling of repeated measurements and time-to-event outcomes: flexible model specification and exact likelihood inference , 2014, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[24]  Graeme L. Hickey,et al.  Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issues , 2016, BMC Medical Research Methodology.

[25]  H. Jacqmin-Gadda,et al.  Joint latent class model for longitudinal data and interval‐censored semi‐competing events: Application to dementia , 2015, Biometrics.

[26]  Scott Sagel,et al.  A comparison of change point models with application to longitudinal lung function measurements in children with cystic fibrosis , 2016, Statistics in medicine.

[27]  E. Lesaffre,et al.  Comparison of Criteria for Choosing the Number of Classes in Bayesian Finite Mixture Models , 2017, PloS one.

[28]  Rhonda Szczesniak,et al.  Use of FEV1 in cystic fibrosis epidemiologic studies and clinical trials: A statistical perspective for the clinical researcher. , 2017, Journal of cystic fibrosis : official journal of the European Cystic Fibrosis Society.

[29]  John Pestian,et al.  Phenotypes of Rapid Cystic Fibrosis Lung Disease Progression during Adolescence and Young Adulthood , 2017, American journal of respiratory and critical care medicine.

[30]  Gertraud Malsiner-Walli,et al.  From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering , 2018, Advances in Data Analysis and Classification.