Maximizing the Usefulness of Data Obtained with Planned Missing Value Patterns: An Application of Maximum Likelihood Procedures.

Researchers often face a dilemma: Should they collect little data and emphasize quality, or much data at the expense of quality? The utility of the 3-form design coupled with maximum likelihood methods for estimation of missing values was evaluated. In 3-form design surveys, four sets of items. X, A, B, and C are administered: Each third of the subjects receives X and one combination of two other item sets - AB, BC, or AC. Variances and covariances were estimated with pairwise deletion, mean replacement, single imputation, multiple imputation, raw data maximum likelihood, multiple-group covariance structure modeling, and Expectation-Maximization (EM) algorithm estimation. The simulation demonstrated that maximum likelihood estimation and multiple imputation methods produce the most efficient and least biased estimates of variances and covariances for normally distributed and slightly skewed data when data are missing completely at random (MCAR). Pairwise deletion provided equally unbiased estimates but was less efficient than ML procedures. Further simulation results demonstrated that nun-maximum likelihood methods break down when data are not missing completely at random. Application of these methods with empirical drug use data resulted in similar covariance matrices for pairwise and EM estimation, however, ML estimation produced better and more efficient regression estimates. Maximum likelihood estimation or multiple imputation procedures. which are now becoming more readily available, are always recommended. In order to maximize the efficiency of the ML parameter estimates, it is recommended that scale items be split across forms rather than being left intact within forms.

[1]  D. Mackinnon,et al.  Mediating mechanisms in a school-based drug prevention program: first-year effects of the Midwestern Prevention Project. , 1991, Health psychology : official journal of the Division of Health Psychology, American Psychological Association.

[2]  J. Graham,et al.  Analysis with missing data in drug prevention research. , 1994, NIDA research monograph.

[3]  P. Allison Estimation of Linear Models with Incomplete Data , 1987 .

[4]  J. Graham,et al.  Convergent and discriminant validity for assessment of skill in resisting a role play alcohol offer. , 1989 .

[5]  J. Graham,et al.  Differential Impact of Three Alcohol Prevention Curricula on Hypothesized Mediating Variables , 1988, Journal of drug education.

[6]  I. Higgins The epidemiology of chronic respiratory disease. , 1973, Preventive medicine.

[7]  J. Graham,et al.  Evaluation of resistance skills training using multitrait-multimethod role play skill assessments , 1987 .

[8]  Peter M. Bentler,et al.  EQS : structural equations program manual , 1989 .

[9]  J. Graham,et al.  Program integrity as a moderator of prevention program effectiveness: results for fifth-grade students in the adolescent alcohol prevention trial. , 1991, Journal of studies on alcohol.

[10]  Bengt Muthén,et al.  On structural equation modeling with data that are not missing completely at random , 1987 .

[11]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[12]  John W. Graham,et al.  Analysis With Missing Data in Prevention Research , 1997 .

[13]  Roderick J. A. Little,et al.  The Analysis of Social Science Data with Missing Values , 1989 .

[14]  J. Goodnight A Tutorial on the SWEEP Operator , 1979 .

[15]  J. Graham,et al.  Preventing alcohol, marijuana, and cigarette use among adolescents: peer pressure resistance training versus establishing conservative norms. , 1991, Preventive medicine.