Joint analysis of mixed types of outcomes with latent variables

We propose a joint modeling approach to investigating the observed and latent risk factors of mixed types of outcomes. The proposed model comprises three parts. The first part is an exploratory factor analysis model that summarizes latent factors through multiple observed variables. The second part is a proportional hazards model that examines the observed and latent risk factors of multivariate time-to-event outcomes. The third part is a linear regression model that investigates the determinants of a continuous outcome. We develop a Bayesian approach coupled with MCMC methods to determine the number of latent factors, the association between latent and observed variables, and the important risk factors of different types of outcomes. A modified stochastic search item selection algorithm, which introduces normal-mixture-inverse gamma priors to factor loadings and regression coefficients, is developed for simultaneous model selection and parameter estimation. The proposed method is subjected to simulation studies for empirical performance assessment and then applied to a study concerning the risk factors of type 2 diabetes and the associated complications.

[1]  H. Shin,et al.  Implication of Genetic Variants Near TCF7L2, SLC30A8, HHEX, CDKAL1, CDKN2A/B, IGF2BP2, and FTO in Type 2 Diabetes and Obesity in 6,719 Asians , 2008, Diabetes.

[2]  Jianwen Cai,et al.  Partially Linear Hazard Regression for Multivariate Survival Data , 2007 .

[3]  G. Malsiner‐Walli,et al.  Comparing Spike and Slab Priors for Bayesian Variable Selection , 2016, 1812.07259.

[4]  Xilin Yang,et al.  Metabolic Syndrome Predicts New Onset of Chronic Kidney Disease in 5,829 Patients With Type 2 Diabetes , 2008, Diabetes Care.

[5]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[6]  Richard D. Gill,et al.  A counting process approach to maximum likelihood estimation in frailty models , 1992 .

[7]  Joseph G. Ibrahim,et al.  A Bayesian justification of Cox's partial likelihood , 2003 .

[8]  D. Dunson,et al.  Sparse Bayesian infinite factor models. , 2011, Biometrika.

[9]  Jason Roy,et al.  Analysis of Multivariate Longitudinal Outcomes With Nonignorable Dropouts and Missing Covariates , 2002 .

[10]  Xinyuan Song,et al.  Bayesian proportional hazards model with latent variables , 2019, Statistical methods in medical research.

[11]  M. Ng,et al.  Phenotype–genotype interactions on renal function in type 2 diabetes: an analysis using structural equation modelling , 2009, Diabetologia.

[12]  Kani Chen,et al.  ANALYSIS OF MULTIVARIATE FAILURE TIME DATA USING MARGINAL PROPORTIONAL HAZARDS MODEL. , 2010, Statistica Sinica.

[13]  Xihong Lin,et al.  Latent Variable Models for Longitudinal Data with Multiple Continuous Outcomes , 2000, Biometrics.

[14]  L. Fahrmeir,et al.  Spike-and-Slab Priors for Function Selection in Structured Additive Regression Models , 2011, 1105.5250.

[15]  Yongdai Kim,et al.  Bayesian analysis for monotone hazard ratio , 2011, Lifetime data analysis.

[16]  T. Berge,et al.  Generic global indentification in factor analysis , 1997 .

[17]  L. Ryan,et al.  Latent variable models with fixed effects. , 1996, Biometrics.

[18]  Joseph G. Ibrahim,et al.  Bayesian frailty models based on box-cox transformed hazards , 2005 .

[19]  W. Gilks,et al.  Adaptive Rejection Metropolis Sampling Within Gibbs Sampling , 1995 .

[20]  S. Skevington,et al.  The World Health Organization's WHOQOL-BREF quality of life assessment: Psychometric properties and results of the international field trial. A Report from the WHOQOL Group , 2004, Quality of Life Research.

[21]  D. Clayton,et al.  Multivariate generalizations of the proportional hazards model , 1985 .

[22]  Xinyuan Song,et al.  Semiparametric Latent Variable Models With Bayesian P-Splines , 2010 .

[23]  Jianqing Fan,et al.  Variable Selection for Cox's proportional Hazards Model and Frailty Model , 2002 .

[24]  Scott M. Berry,et al.  Bayesian Smoothing and Regression Splines for Measurement Error Problems , 2002 .

[25]  Eleni-Rosalina Andrinopoulou,et al.  Improved dynamic predictions from joint models of longitudinal and survival data with time‐varying effects using P‐splines , 2016, Biometrics.

[26]  Zhiliang Ying,et al.  Marginal proportional hazards models for multiple event‐time data , 2001 .

[27]  Dimitris Mavridis,et al.  Stochastic search item selection for factor analytic models. , 2014, The British journal of mathematical and statistical psychology.

[28]  S. Lang,et al.  Bayesian P-Splines , 2004 .

[29]  W. Ledermann On the rank of the reduced correlational matrix in multiple-factor analysis , 1937 .

[30]  D. Lin,et al.  Cox regression analysis of multivariate failure time data: the marginal approach. , 1994, Statistics in medicine.

[31]  L. J. Wei,et al.  Regression analysis of multivariate incomplete failure time data by modeling marginal distributions , 1989 .

[32]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[33]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[34]  A. V. D. Vaart,et al.  BAYESIAN LINEAR REGRESSION WITH SPARSE PRIORS , 2014, 1403.0735.

[35]  Yongdai Kim,et al.  Bayesian Analysis of the Proportional Hazards Model with Time‐Varying Coefficients , 2017 .

[36]  D. Oakes,et al.  Frailty Models and Rank Tests , 1998, Lifetime data analysis.

[37]  Xin-Yuan Song,et al.  Regression Analysis of Additive Hazards Model With Latent Variables , 2015 .