A multiple imputation‐based sensitivity analysis approach for data subject to missing not at random

Missingness mechanism is in theory unverifiable based only on observed data. If there is a suspicion of missing not at random, researchers often perform a sensitivity analysis to evaluate the impact of various missingness mechanisms. In general, sensitivity analysis approaches require a full specification of the relationship between missing values and missingness probabilities. Such relationship can be specified based on a selection model, a pattern-mixture model or a shared parameter model. Under the selection modeling framework, we propose a sensitivity analysis approach using a nonparametric multiple imputation strategy. The proposed approach only requires specifying the correlation coefficient between missing values and selection (response) probabilities under a selection model. The correlation coefficient is a standardized measure and can be used as a natural sensitivity analysis parameter. The sensitivity analysis involves multiple imputations of missing values, yet the sensitivity parameter is only used to select imputing/donor sets. Hence, the proposed approach might be more robust against misspecifications of the sensitivity parameter. For illustration, the proposed approach is applied to incomplete measurements of level of preoperative Hemoglobin A1c, for patients who had high-grade carotid artery stenosisa and were scheduled for surgery. A simulation study is conducted to evaluate the performance of the proposed approach.

[1]  Shihti Yu,et al.  Collinearity and Two-Step Estimation of Sample Selection Models:Problems, Origins, and Remedies , 2000 .

[2]  Qi Long,et al.  Doubly Robust Nonparametric Multiple Imputation for Ignorable Missing Data. , 2012, Statistica Sinica.

[3]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[4]  Jeremy MG Taylor,et al.  Partially parametric techniques for multiple imputation , 1996 .

[5]  J. Heckman Shadow prices, market wages, and labor supply , 1974 .

[6]  J. Copas,et al.  Inference for Non‐random Samples , 1997 .

[7]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[8]  S. Jolani Dual Imputation Strategies for Analyzing Incomplete Data , 2010 .

[9]  Doubly robust multiple imputation using kernel‐based techniques , 2016, Biometrical journal. Biometrische Zeitschrift.

[10]  Sylvie Chevret,et al.  A multiple imputation approach for MNAR mechanisms compatible with Heckman's model , 2016, Statistics in medicine.

[11]  Thomas R Belin,et al.  Multiple imputation using an iterative hot‐deck with distance‐based donor selection , 2008, Statistics in medicine.

[12]  Wei Zhou,et al.  Prospective neurocognitive evaluation of patients undergoing carotid interventions. , 2012, Journal of vascular surgery.

[13]  Donald B Rubin,et al.  Sensitivity analysis for a partially missing binary outcome in a two‐arm randomized clinical trial , 2014, Statistics in medicine.

[14]  Gary S Collins,et al.  A robust imputation method for missing responses and covariates in sample selection models , 2019, Statistical methods in medical research.

[15]  D. Rubin Formalizing Subjective Notions about the Effect of Nonrespondents in Sample Surveys , 1977 .

[16]  M Chavance,et al.  Sensitivity analysis of incomplete longitudinal data departing from the missing at random assumption: Methodology and application in a clinical trial with drop-outs , 2016, Statistical methods in medical research.

[17]  Xu Yan,et al.  Missing Data Handling Methods in Medical Device Clinical Trials , 2009, Journal of biopharmaceutical statistics.

[18]  J. Heckman Sample selection bias as a specification error , 1979 .