A Bayesian Vector Autoregressive Model with Nonignorable Missingness in Dependent Variables and Covariates: Development, Evaluation, and Application to Family Processes

Intensive longitudinal designs involving repeated assessments of constructs often face the problems of nonignorable attrition and selected omission of responses on particular occasions. However, time series models, such as vector autoregressive (VAR) models, are often fit to these data without consideration of nonignorable missingness. We introduce a Bayesian model that simultaneously represents the over-time dependencies in multivariate, multiple-subject time series data via a VAR model, and possible ignorable and nonignorable missingness in the data. We provide software code for implementing this model with application to an empirical data set. Moreover, simulation results comparing the joint approach with two-step multiple imputation procedures are included to shed light on the relative strengths and weaknesses of these approaches in practical data analytic scenarios.

[1]  M. Glasser,et al.  Linear Regression Analysis with Missing Observations among the Independent Variables , 1964 .

[2]  B. Gajewski,et al.  Non‐normal path analysis in the presence of measurement error and missing data: a Bayesian analysis of nursing homes' structure and outcomes , 2006, Statistics in medicine.

[3]  J. Heckman Sample Selection Bias as a Specification Error (with an Application to the Estimation of Labor Supply Functions) , 1977 .

[4]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[5]  N. Bolger,et al.  Intensive Longitudinal Methods: An Introduction to Diary and Experience Sampling Research , 2013 .

[6]  Joseph L. Schafer,et al.  Multiple imputation with PAN. , 2001 .

[7]  Sven L Klijn,et al.  Introducing the fit-criteria assessment plot – A visualisation tool to assist class enumeration in group-based trajectory modelling , 2017, Statistical methods in medical research.

[8]  Sy-Miin Chow,et al.  Methodological Issues and Extensions to the Latent Difference Score Framework 1 , 2018, Longitudinal Multivariate Psychology.

[9]  P. Allison Estimation of Linear Models with Incomplete Data , 1987 .

[10]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[11]  J J McArdle,et al.  Latent growth curves within developmental structural equation models. , 1987, Child development.

[12]  Xinyuan Song,et al.  Finite mixture varying coefficient models for analyzing longitudinal heterogenous data , 2012, Statistics in medicine.

[13]  Snigdhansu Chatterjee,et al.  Structural Equation Modeling, A Bayesian Approach , 2008, Technometrics.

[14]  VAR Models in Macroeconomics - New Developments and Applications: Essays in Honor of Christopher A. Sims , 2013 .

[15]  S. Bressler,et al.  Granger Causality: Basic Theory and Application to Neuroscience , 2006, q-bio/0608035.

[16]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[17]  Michael P. Jones Indicator and stratification methods for missing explanatory variables in multiple linear regression , 1996 .

[18]  M. Beck,et al.  Family Violence , 2002 .

[19]  John H. Grych,et al.  Marital conflict and children's adjustment: a cognitive-contextual framework. , 1990, Psychological bulletin.

[20]  Fred C. Schweppe,et al.  Evaluation of likelihood functions for Gaussian signals , 1965, IEEE Trans. Inf. Theory.

[21]  S. Richardson,et al.  Strategy for modelling non-random missing data mechanisms in observational studies using Bayesian methods , 2010 .

[22]  John J. McArdle,et al.  A Simulation Study Comparison of Bayesian Estimation With Conventional Methods for Estimating Unknown Change Points , 2008 .

[23]  Sy-Miin Chow,et al.  (Re)evaluating the Implications of the Autoregressive Latent Trajectory Model Through Likelihood Ratio Tests of Its Initial Conditions , 2017, Multivariate behavioral research.

[24]  Fred C. Schweppe,et al.  Uncertain dynamic systems , 1973 .

[25]  Bradley P Carlin,et al.  Bayesian hierarchical models for network meta-analysis incorporating nonignorable missingness , 2017, Statistical methods in medical research.

[26]  Sy-Miin Chow,et al.  Equivalence and Differences Between Structural Equation Modeling and State-Space Modeling Techniques , 2010 .

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[28]  A. C. Harvey,et al.  Assessing and Modeling the Cyclical Behavior of Rainfall in Northeast Brazil , 1987 .

[29]  Nilam Ram,et al.  Families as Coordinated Symbiotic Systems: Making use of Nonlinear Dynamic Models , 2014 .

[30]  Helmut Ltkepohl,et al.  New Introduction to Multiple Time Series Analysis , 2007 .

[31]  Patrick T. Davies,et al.  Marital conflict and child adjustment: an emotional security hypothesis. , 1994, Psychological bulletin.

[32]  A. Rotnitzky,et al.  Missing Data in Longitudinal Studies: Strategies for Bayesian Modeling and Sensitivity Analysis by DANIELS, M. J. and HOGAN, J. W , 2009 .

[33]  P. Allison Missing data techniques for structural equation modeling. , 2003, Journal of abnormal psychology.

[34]  M J Daniels,et al.  A Bayesian Shrinkage Model for Incomplete Longitudinal Binary Data With Application to the Breast Cancer Prevention Trial , 2010, Journal of the American Statistical Association.

[35]  Sy-Miin Chow,et al.  A Comparison of Bayesian and Frequentist Model Selection Methods for Factor Analysis Models , 2017, Psychological methods.

[36]  Daniel O Scharfstein,et al.  Incorporating prior beliefs about selection bias into the analysis of randomized trials with missing outcomes. , 2003, Biostatistics.

[37]  Karla Hemming,et al.  Bayesian sensitivity models for missing covariates in the analysis of survival data. , 2012, Journal of evaluation in clinical practice.

[38]  Raymond J. Carroll,et al.  Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process , 1988 .

[39]  C. Robert,et al.  Deviance information criteria for missing data models , 2006 .

[40]  Lauren M. Papp,et al.  Changes in marital conflict and youths' responses across childhood and adolescence: A test of sensitization , 2013, Development and Psychopathology.

[41]  S. Meldrum Progress in Ambulatory Assessment: Computer Assisted Psychological and Psychophysiological Methods in Monitoring and Field Studies , 2002 .

[42]  J. Heckman Sample selection bias as a specification error , 1979 .

[43]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[44]  Peter C M Molenaar,et al.  iVAR: A program for imputing missing data in multivariate time series using vector autoregressive models , 2014, Behavior research methods.

[45]  Sabine Schulze,et al.  The Science Of Real Time Data Capture Self Reports In Health Research , 2016 .

[46]  Stef van Buuren,et al.  Flexible Imputation of Missing Data , 2012 .

[47]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[48]  Ellen L. Hamaker,et al.  Conditions for the Equivalence of the Autoregressive Latent Trajectory Model and a Latent Growth Curve Model With Autoregressive Disturbances , 2005 .

[49]  Linying Ji,et al.  Handling Missing Data in the Modeling of Intensive Longitudinal Data , 2018, Structural equation modeling : a multidisciplinary journal.

[50]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[51]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[52]  Sik-Yum Lee,et al.  Structural equation modelling: A Bayesian approach. , 2007 .

[53]  John R. Nesselroade,et al.  Longitudinal Research in the Study of Behavior and Development , 1979 .

[54]  Roderick J. A. Little,et al.  A Class of Pattern-Mixture Models for Normal Incomplete Data , 1994 .

[55]  Peter C M Molenaar,et al.  Statistical Modeling of the Individual: Rationale and Application of Multivariate Stationary Time Series Analysis , 2005, Multivariate behavioral research.

[56]  S. Richardson,et al.  Strategy for Modelling Nonrandom Missing Data Mechanisms in Observational Studies Using Bayesian Methods , 2012 .

[57]  Sy-Miin Chow,et al.  Age differences in dynamical emotion-cognition linkages. , 2007, Psychology and aging.

[58]  D. Qin Rise of VAR Modelling Approach , 2008 .

[59]  Sy-Miin Chow,et al.  Dynamic Structure of Emotions Among Individuals with Parkinson's Disease , 2004 .

[60]  J. Page,et al.  A methodological framework for capturing relative eyetracking coordinate data to determine gaze patterns and fixations from two or more observers , 2014, Behavior research methods.

[61]  Michael W Browne,et al.  Structural Equation Modeling of Multivariate Time Series , 2007, Multivariate behavioral research.

[62]  H. Stern,et al.  The use of multiple imputation for the analysis of missing data. , 2001, Psychological methods.

[63]  Adam Kapelner,et al.  Prediction with missing data via Bayesian Additive Regression Trees , 2013, ArXiv.

[64]  Martyn Plummer,et al.  Bayesian Graphical Models using MCMC , 2016 .

[65]  Sy-Miin Chow,et al.  Developmental family processes and interparental conflict: patterns of microlevel influences. , 2010, Developmental psychology.

[66]  D. Borsboom,et al.  Deconstructing the construct: A network perspective on psychological phenomena , 2013 .

[67]  Sy-Miin Chow,et al.  Dynamic infant-parent affect coupling during the face-to-face/still-face. , 2010, Emotion.

[68]  John W. Graham,et al.  Missing Data: Analysis and Design , 2012 .

[69]  Andrew Harvey,et al.  Forecasting, Structural Time Series Models and the Kalman Filter , 1990 .

[70]  Eva Ceulemans,et al.  DeCon: A tool to detect emotional concordance in multivariate time series data of emotional responding , 2014, Biological Psychology.

[71]  E. A. Thomas,et al.  Analyses of Parent-Infant Interaction. , 1976 .

[72]  D. Borsboom,et al.  Network analysis: an integrative approach to the structure of psychopathology. , 2013, Annual review of clinical psychology.

[73]  N M Laird,et al.  Model-based approaches to analysing incomplete longitudinal and failure time data. , 1997, Statistics in medicine.

[74]  Anurika Priyanjali De Silva,et al.  A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study , 2017, BMC Medical Research Methodology.

[75]  P. Molenaar,et al.  An Idiographic Examination of Day-to-Day Patterns of Substance Use Craving, Negative Affect, and Tobacco Use Among Young Adults in Recovery , 2013, Multivariate behavioral research.

[76]  R Henderson,et al.  Joint modelling of longitudinal measurements and event time data. , 2000, Biostatistics.

[77]  Boris A. Skorohod,et al.  Diffuse Kalman Filter , 2017 .