Bayesian variable selection for multistate Markov models with interval‐censored data in an ecological momentary assessment study of smoking cessation

The application of sophisticated analytical methods to intensive longitudinal data, collected with ecological momentary assessments (EMA), has helped researchers better understand smoking behaviors after a quit attempt. Unfortunately, the wealth of information captured with EMAs is typically underutilized in practice. Thus, novel methods are needed to extract this information in exploratory research studies. One of the main objectives of intensive longitudinal data analysis is identifying relations between risk factors and outcomes of interest. Our goal is to develop and apply expectation maximization variable selection for Bayesian multistate Markov models with interval-censored data to generate new insights into the relation between potential risk factors and transitions between smoking states. Through simulation, we demonstrate the effectiveness of our method in identifying associated risk factors and its ability to outperform the LASSO in a special case. Additionally, we use the expectation conditional-maximization algorithm to simplify estimation, a deterministic annealing variant to reduce the algorithm's dependence on starting values, and Louis's method to estimate unknown parameter uncertainty. We then apply our method to intensive longitudinal data collected with EMA to identify risk factors associated with transitions between smoking states after a quit attempt in a cohort of socioeconomically disadvantaged smokers who were interested in quitting.

[1]  Andrew C Titman,et al.  Model diagnostics for multi-state models , 2010, Statistical methods in medical research.

[2]  C. Chatfield Model uncertainty, data mining and statistical inference , 1995 .

[3]  R H Jones,et al.  Multi-state models and diabetic retinopathy. , 1995, Statistics in medicine.

[4]  S. Tiffany,et al.  A systematic review of the relationships between craving and smoking cessation. , 2013, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[5]  Minxing Chen,et al.  Predicting quit attempts among homeless smokers seeking cessation treatment: an ecological momentary assessment study. , 2014, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[6]  Gary K Grunwald,et al.  Continuous Time Markov Models for Binary Longitudinal Data , 2006, Biometrical journal. Biometrische Zeitschrift.

[7]  Veronika Rockova,et al.  EMVS: The EM Approach to Bayesian Variable Selection , 2014 .

[8]  J Grüger,et al.  The validity of inferences based on incomplete observations in disease state models. , 1991, Biometrics.

[9]  D. Kendzor,et al.  Financial incentives for abstinence among socioeconomically disadvantaged individuals in smoking cessation treatment. , 2015, American journal of public health.

[10]  S Shiffman,et al.  Dynamic effects of self-efficacy on smoking lapse and relapse. , 2000, Health psychology : official journal of the Division of Health Psychology, American Psychological Association.

[11]  B. Tom,et al.  The versatility of multi-state models for the analysis of longitudinal data with unobservable features , 2012, Lifetime Data Analysis.

[12]  New York Dover,et al.  ON THE CONVERGENCE PROPERTIES OF THE EM ALGORITHM , 1983 .

[13]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[14]  Thomas Kneib,et al.  Structured fusion lasso penalized multi‐state models , 2016, Statistics in medicine.

[15]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[16]  Xiao-Li Meng,et al.  Maximum likelihood estimation via the ECM algorithm: A general framework , 1993 .

[17]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[18]  Naonori Ueda,et al.  Deterministic annealing EM algorithm , 1998, Neural Networks.

[19]  Saul Shiffman,et al.  What can hunger teach us about drug craving? A comparative analysis of the two constructs , 1992 .

[20]  Saul Shiffman,et al.  Remember that? A comparison of real-time versus retrospective recall of smoking lapses. , 1997, Journal of consulting and clinical psychology.

[21]  Momiao Xiong,et al.  Analysis of transtheoretical model of health behavioral changes in a nutrition intervention study—a continuous time Markov chain model with Bayesian approach , 2015, Statistics in medicine.

[22]  R Kay,et al.  A Markov model for analysing cancer markers and disease states in survival studies. , 1986, Biometrics.

[23]  J. Kalbfleisch,et al.  The Analysis of Panel Data under a Markov Assumption , 1985 .

[24]  V T Farewell,et al.  A Pearson‐type goodness‐of‐fit test for stationary and time‐continuous Markov regression models , 2002, Statistics in medicine.

[25]  W. Chan,et al.  Analysis of Longitudinal Multinomial Outcome Data , 2006, Biometrical journal. Biometrische Zeitschrift.

[26]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[27]  Heng Lian,et al.  The Expectation-Maximization approach for Bayesian quantile regression , 2016, Comput. Stat. Data Anal..

[28]  C Combescure,et al.  The analysis of asthma control under a Markov assumption with use of covariates , 2003, Statistics in medicine.

[29]  S Shiffman,et al.  A day at a time: predicting smoking lapse from daily urge. , 1997, Journal of abnormal psychology.

[30]  Runze Li,et al.  Time-varying processes involved in smoking lapse in a randomized trial of smoking cessation therapies. , 2014, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[31]  T. Piasecki Relapse to smoking. , 2006, Clinical psychology review.

[32]  Thomas Kneib,et al.  Boosting multi-state models , 2015, Lifetime Data Analysis.

[33]  Robert West,et al.  Predictors of successful and unsuccessful quit attempts among smokers motivated to quit. , 2014, Addictive behaviors.

[34]  Amy Ming-Fang Yen,et al.  A Markov regression random‐effects model for remission of functional disability in patients following a first stroke: A Bayesian approach , 2007, Statistics in medicine.

[35]  Runze Li,et al.  A time-varying effect model for intensive longitudinal data. , 2012, Psychological methods.

[36]  Robert West,et al.  Attempts to quit smoking and relapse: factors associated with success or failure from the ATTEMPT cohort study. , 2009, Addictive behaviors.