Missing Data Theory

In this first chapter, I accomplish several goals. First, building on my 20+ years of work on missing data analysis, I outline a nomenclature or system for talking about the theory underlying the modern analysis of missing data. I intend for this nomenclature to be in plain English, but nevertheless to be an accurate representation of statistical theory relating to missing data analysis. Second, I describe many of the main components of missing data theory, including the causes or mechanisms of missingness. Two general methods for handling missing data, in particular multiple imputation (MI) and maximum-likelihood (ML) methods, have developed out of the missing data theory I describe here. And as will be clear from reading this book, I fully endorse these methods. For the remainder of this chapter, I challenge some of the commonly held beliefs relating to missing data theory and missing data analysis, and make a case that the MI and ML procedures, which have started to become mainstream in statistical analysis with missing data, are applicable in a much larger range of contexts that typically believed.

[1]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[2]  Hakan Demirtas,et al.  Multiple imputation under Bayesianly smoothed pattern‐mixture models for non‐ignorable drop‐out , 2005, Statistics in medicine.

[3]  Joseph L Schafer,et al.  On the performance of random‐coefficient pattern‐mixture models for non‐ignorable drop‐out , 2003, Statistics in medicine.

[4]  John W. Graham,et al.  Evaluating Interventions with Differential Attrition: The Importance of Nonresponse Mechanisms and Use of Follow-up Data , 1993 .

[5]  Michael Windle,et al.  The science of prevention: methodological advances from alcohol and substance abuse research. , 1997 .

[6]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[7]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[8]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[9]  J. Graham,et al.  Preventing alcohol, marijuana, and cigarette use among adolescents: peer pressure resistance training versus establishing conservative norms. , 1991, Preventive medicine.

[10]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[11]  Roderick J. A. Little,et al.  A Class of Pattern-Mixture Models for Normal Incomplete Data , 1994 .

[12]  D. Hedeker,et al.  Bias reduction with an adjustment for participants' intent to dropout of a randomized controlled clinical trial , 2007, Clinical trials.

[13]  P. Bentler,et al.  Cutoff criteria for fit indexes in covariance structure analysis : Conventional criteria versus new alternatives , 1999 .

[14]  P. Bentler,et al.  Significance Tests and Goodness of Fit in the Analysis of Covariance Structures , 1980 .

[15]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[16]  Jonaki Bose,et al.  NONRESPONSE BIAS ANALYSES AT THE NATIONAL CENTER FOR EDUCATION STATISTICS , 2001 .

[17]  P. Bentler,et al.  Comparative fit indexes in structural models. , 1990, Psychological bulletin.

[18]  Craig K Enders,et al.  Missing not at random models for latent growth curve analyses. , 2011, Psychological methods.

[19]  G. Molenberghs,et al.  Linear Mixed Models for Longitudinal Data , 2001 .

[20]  L. Tucker,et al.  A reliability coefficient for maximum likelihood factor analysis , 1973 .

[21]  John W. Graham,et al.  Analysis With Missing Data in Prevention Research , 1997 .

[22]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[23]  T. Cook,et al.  Quasi-experimentation: Design & analysis issues for field settings , 1979 .

[24]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[25]  Donald Hedeker,et al.  Application of random-efiects pattern-mixture models for miss-ing data in longitudinal studies , 1997 .

[26]  M. Browne,et al.  Alternative Ways of Assessing Model Fit , 1992 .

[27]  Craig K. Enders,et al.  A Note on the Use of Missing Auxiliary Variables in Full Information Maximum Likelihood-Based Structural Equation Models , 2008 .

[28]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[29]  Janice L. Krieger,et al.  Adapting School-Based Substance Use Prevention Curriculum Through Cultural Grounding: A Review and Exemplar of Adaptation Processes for Rural Schools , 2013, American journal of community psychology.

[30]  Donald B. Rubin,et al.  Multiple imputation in mixture models for nonignorable nonresponse with follow-ups , 1993 .

[31]  J. S. Long,et al.  Testing Structural Equation Models , 1993 .

[32]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[33]  J. H. Steiger Statistically based tests for the number of common factors , 1980 .