Bayesian Sensitivity Analysis for Non-ignorable Missing Data in Longitudinal Studies

The use of Bayesian statistical methods to handle missing data in biomedical studies has become popular in recent years. In this paper, we propose a novel Bayesian sensitivity analysis (BSA) technique that accounts for the influences of missing outcome data on the estimation of treatment effects in longitudinal studies with non-ignorable missing data. The approach uses a pattern-mixture model for the complete data, which is indexed by non-identifiable sensitivity parameters that accounts for the effect of missingness on the observations. We implement the method using the probabilistic programming language Stan, and apply it to data from the Vancouver At Home Study, which is a randomized control trial that provided housing to homeless people with mental illness. We compare the results of BSA to those from an existing Bayesian longitudinal model that ignores the missing data mechanism in the outcome. Furthermore, we demonstrate in a simulation study that when we use a diffuse conservative prior that describes a range of assumptions about the non-ignorable missingness, then BSA credible intervals have greater length and higher coverage rate of the target parameters than existing methods.

[1]  Sander Greenland,et al.  Interval Estimation for Messy Observational Data , 2009, 1010.0306.

[2]  D. Rubin Formalizing Subjective Notions about the Effect of Nonrespondents in Sample Surveys , 1977 .

[3]  Daniel O Scharfstein,et al.  Incorporating prior beliefs about selection bias into the analysis of randomized trials with missing outcomes. , 2003, Biostatistics.

[4]  Stef van Buuren,et al.  Multiple imputation of discrete and continuous data by fully conditional specification , 2007 .

[5]  J. Listing,et al.  A Nonparametric Test for Random Dropouts , 2003 .

[6]  Donald Tomaskovic-Devey,et al.  Organizational Survey Nonresponse , 1994 .

[7]  B. Rannala Identi(cid:142)ability of Parameters in MCMC Bayesian Inference of Phylogeny , 2002 .

[8]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[9]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[10]  J. Frankish,et al.  The Vancouver At Home Study: Overview and Methods of a Housing First Trial Among Individuals Who are Homeless and Living with Mental Illness , 2012 .

[11]  J. Ware,et al.  Applied Longitudinal Analysis , 2004 .

[12]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[13]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[14]  A. Palepu,et al.  Vancouver At Home: pragmatic randomized trials investigating Housing First for homeless and mentally ill adults , 2013, Trials.

[15]  Todd E. Bodner,et al.  What Improves with Increased Missing Data Imputations? , 2008 .

[16]  R D Gill,et al.  Non-response models for the analysis of non-monotone ignorable missing data. , 1997, Statistics in medicine.

[17]  H. Doll,et al.  The Prevalence of Mental Disorders among the Homeless in Western Countries: Systematic Review and Meta-Regression Analysis , 2008, PLoS medicine.

[18]  R. Cooke,et al.  BAYESIAN SENSITIVITY ANALYSIS , 2001 .

[19]  M. Kenward Selection models for repeated measurements with non-random dropout: an illustration of sensitivity. , 1998, Statistics in medicine.

[20]  Joseph G. Ibrahim,et al.  Missing data methods in longitudinal studies: a review , 2009 .

[21]  T. Raghunathan,et al.  A Bayesian Approach for Clustered Longitudinal Ordinal Outcome With Nonignorable Missing Data , 2006 .

[22]  AN INFLUENCE APPROACH FOR SENSITIVITY ANALYSIS OF NON-RANDOM DROPOUT BASED ON THE COVARIANCE STRUCTURE , 2005 .

[23]  A. Lehman,et al.  Graded response modeling of the Quality of Life Interview , 1999 .

[24]  Juned Siddique,et al.  Using an Approximate Bayesian Bootstrap to multiply impute nonignorable missing data , 2008, Comput. Stat. Data Anal..

[25]  M. Kenward,et al.  Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.

[26]  Craig K. Enders,et al.  Applied Missing Data Analysis , 2010 .

[27]  Joseph G. Ibrahim,et al.  Missing covariates in generalized linear models when the missing data mechanism is non‐ignorable , 1999 .

[28]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[29]  C. Adair,et al.  The At Home/Chez Soi trial protocol: a pragmatic, multi-site, randomised controlled trial of a Housing First intervention for homeless individuals with mental illness in five Canadian cities , 2011, BMJ Open.

[30]  T. Gulliver,et al.  The State of Homelessness in Canada 2013 , 2013 .

[31]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[32]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[33]  M. Daniels,et al.  A Note on MAR, Identifying Restrictions, Model Comparison, and Sensitivity Analysis in Pattern Mixture Models with and without Covariates for Incomplete Data , 2011, Biometrics.

[34]  N M Laird,et al.  Missing data in longitudinal studies. , 1988, Statistics in medicine.

[35]  A. Palepu,et al.  Housing First improves subjective quality of life among homeless adults with mental illness: 12-month findings from a randomized controlled trial in Vancouver, British Columbia , 2013, Social Psychiatry and Psychiatric Epidemiology.

[36]  N M Laird,et al.  Mixture models for the joint distribution of repeated measures and event times. , 1997, Statistics in medicine.

[37]  Hui Xie Bayesian inference from incomplete longitudinal data: a simple method to quantify sensitivity to nonignorable dropout. , 2009, Statistics in medicine.

[38]  Geert Molenberghs,et al.  Missing Data in Clinical Studies , 2007 .

[39]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[40]  Niko A Kaciroti,et al.  Bayesian sensitivity analysis of incomplete data: bridging pattern‐mixture and selection models , 2014, Statistics in medicine.

[41]  Balgobin Nandram,et al.  Hierarchical Bayesian Nonresponse Models for Binary Data From Small Areas With Uncertainty About Ignorability , 2002 .

[42]  Roderick J. A. Little,et al.  The Analysis of Social Science Data with Missing Values , 1989 .

[43]  Trivellore E Raghunathan,et al.  A Bayesian model for longitudinal count data with non‐ignorable dropout , 2008, Journal of the Royal Statistical Society. Series C, Applied statistics.

[44]  D. Hand,et al.  Advising on research methods: A consultant's companion , 2011 .

[45]  Edward L Spitznagel,et al.  Are rates of psychiatric disorders in the homeless population changing? , 2004, American journal of public health.

[46]  N M Laird,et al.  Generalized linear mixture models for handling nonignorable dropouts in longitudinal studies. , 2000, Biostatistics.

[47]  Ken P Kleinman,et al.  Much Ado About Nothing , 2007, The American statistician.

[48]  J. Robins,et al.  A Structural Approach to Selection Bias , 2004, Epidemiology.

[49]  A. Rotnitzky,et al.  Missing Data in Longitudinal Studies: Strategies for Bayesian Modeling and Sensitivity Analysis by DANIELS, M. J. and HOGAN, J. W , 2009 .

[50]  A. Troxel,et al.  AN INDEX OF LOCAL SENSITIVITY TO NONIGNORABILITY , 2004 .

[51]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[52]  P. Diggle,et al.  Testing for random dropouts in repeated measurement data. , 1989, Biometrics.

[53]  G Molenberghs,et al.  Identifying the types of missingness in quality of life data from clinical trials. , 1998, Statistics in medicine.

[54]  R. Little A Test of Missing Completely at Random for Multivariate Data with Missing Values , 1988 .

[55]  D. Kiel,et al.  Lack of an Association Between Insulin‐like Growth Factor‐I and Body Composition, Muscle Strength, Physical Performance or Self‐Reported Mobility Among Older Persons with Functional Limitations , 1998, Journal of the American Geriatrics Society.

[56]  J. Listing,et al.  TESTS IF DROPOUTS ARE MISSED AT RANDOM , 1998 .

[57]  P. Gustafson,et al.  Bayesian sensitivity analysis for unmeasured confounding in observational studies , 2007, Statistics in medicine.

[58]  Mohammad Mahdi Shariati,et al.  Identifiability of parameters and behaviour of MCMC chains: a case study using the reaction norm model. , 2009, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[59]  G Molenberghs,et al.  Sensitivity Analysis for Nonrandom Dropout: A Local Influence Approach , 2001, Biometrics.