A Local Influence Approach Applied to Binary Data from a Psychiatric Study

Recently, a lot of concern has been raised about assumptions needed in order to fit statistical models to incomplete multivariate and longitudinal data. In response, research efforts are being devoted to the development of tools that assess the sensitivity of such models to often strong but always, at least in part, unverifiable assumptions. Many efforts have been devoted to longitudinal data, primarily in the selection model context, although some researchers have expressed interest in the pattern-mixture setting as well. A promising tool, proposed by Verbeke et al. (2001, Biometrics 57, 43-50), is based on local influence (Cook, 1986, Journal of the Royal Statistical Society, Series B 48, 133-169). These authors considered the Diggle and Kenward (1994, Applied Statistics 43, 49-93) model, which is based on a selection model, integrating a linear mixed model for continuous outcomes with logistic regression for dropout. In this article, we show that a similar idea can be developed for multivariate and longitudinal binary data, subject to nonmonotone missingness. We focus on the model proposed by Baker, Rosenberger, and DerSimonian (1992, Statistics in Medicine 11, 643-657). The original model is first extended to allow for (possibly continuous) covariates, whereafter a local influence strategy is developed to support the model-building process. The model is able to deal with nonmonotone missingness but has some limitations as well, stemming from the conditional nature of the model parameters. Some analytical insight is provided into the behavior of the local influence graphs.

[1]  D. Rubin Formalizing Subjective Notions about the Effect of Nonrespondents in Sample Surveys , 1977 .

[2]  S. Weisberg,et al.  Residuals and Influence in Regression , 1982 .

[3]  Emmanuel Lesaffre,et al.  Multiple group logistic discrimination , 1986 .

[4]  M. Lindstrom,et al.  A survey of methods for analyzing clustered binary response data , 1996 .

[5]  G. Molenberghs,et al.  Linear Mixed Models for Longitudinal Data , 2001 .

[6]  W F Rosenberger,et al.  Closed-form estimates for missing counts in two-way contingency tables. , 1992, Statistics in medicine.

[7]  R. Cook Assessment of Local Influence , 1986 .

[8]  J G Ibrahim,et al.  Maximum Likelihood Methods for Cure Rate Models with Missing Covariates , 2001, Biometrics.

[9]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[10]  G Molenberghs,et al.  Sensitivity Analysis for Nonrandom Dropout: A Local Influence Approach , 2001, Biometrics.

[11]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[12]  G. Molenberghs,et al.  Protective Estimation of Longitudinal Categorical Data With Nonrandom Dropout , 1997 .

[13]  Geert Molenberghs,et al.  Likelihood Based Frequentist Inference When Data Are Missing at Random , 1998 .

[14]  M. Kenward Selection models for repeated measurements with non-random dropout: an illustration of sensitivity. , 1998, Statistics in medicine.

[15]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[16]  Stuart G. Baker,et al.  Analyzing a Randomized Cancer Prevention Trial with a Missing Binary Outcome, an Auxiliary Variable, and All-or-None Compliance , 2000 .

[17]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[18]  Scott L. Zeger,et al.  A Class of Logistic Regression Models for Multivariate Binary Time Series , 1989 .

[19]  田中 豊 Multivariate Statistical Modelling Based on Generalized Linear Models/Ludwig Fahrmeir,Gerhard Tutz(1994) , 1995 .

[20]  Geert Molenberghs,et al.  A local influence approach to sensitivity analysis of incomplete longitudinal ordinal data , 2001 .

[21]  Geert Molenberghs,et al.  Mastitis in diary cattle: local influence to assess sensitivity of the dropout process , 2001 .

[22]  Geert Molenberghs,et al.  Influence analysis to assess sensitivity of the dropout process , 2001 .

[23]  G. Molenberghs,et al.  Marginal Modeling of Correlated Ordinal Data Using a Multivariate Plackett Distribution , 1994 .

[24]  Donald B. Rubin,et al.  Selection Modeling Versus Mixture Modeling with Nonignorable Nonresponse , 1986 .

[25]  S G Baker The analysis of categorical case-control data subject to nonignorable nonresponse. , 1996, Biometrics.

[26]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[27]  M. Kenward,et al.  The analysis of longitudinal ordinal data with nonrandom drop-out , 1997 .

[28]  G. Molenberghs,et al.  Effect of dropouts in a longitudinal study: an application of a repeated ordinal model. , 1996, Statistics in medicine.

[29]  Geert Molenberghs,et al.  Sensitivity analysis for incomplete contingency tables: the Slovenian plebiscite case , 2001 .

[30]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[31]  B Rosner,et al.  Multivariate methods in ophthalmology with application to other paired-data situations. , 1984, Biometrics.

[32]  Michael G. Kenward,et al.  Nonrandom Missingness in Categorical Data: Strengt hs and Limitations , 1999 .

[33]  R. Wolfinger,et al.  Generalized linear mixed models a pseudo-likelihood approach , 1993 .

[34]  D. Cox The Analysis of Multivariate Binary Data , 1972 .

[35]  G. Molenberghs,et al.  Marginal modelling of multivariate categorical data. , 1999, Statistics in medicine.

[36]  Missing data perspectives of the fluvoxamine data set: a review. , 1999, Statistics in medicine.

[37]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[38]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[39]  Geert Molenberghs,et al.  The Milk Protein Trial: Influence Analysis of the Dropout Process , 2000 .

[40]  J. Ashford,et al.  Multi-variate probit analysis. , 1970, Biometrics.

[41]  J. Ware,et al.  Random-effects models for serial observations with binary response. , 1984, Biometrics.

[42]  S. Burton,et al.  A Review of Fluvoxamine and its Uses in Depression , 1991, International clinical psychopharmacology.

[43]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[44]  G Molenberghs,et al.  An application of maximum likelihood and generalized estimating equations to the analysis of ordinal data from a longitudinal study with cases missing at random. , 1994, Biometrics.