Estimating Missing Values from the General Social Survey: An Application of Multiple Imputation

Objectives. Most researchers who use survey data must grapple with the problem of how best to handle missing information. This article illustrates multiple imputation, a technique for estimating missing values in a multivariate setting. Methods. I use multiple imputation to estimate missing income data and update a recent study that examines the influence of parents’ standard of living on subjective well-being. Using data from the 1998 General Social Survey, two ordered probit models are estimated; one using complete cases only, and the other replacing missing income data with multiple imputation estimates. Results. The analysis produces two major findings: 1) parents’ standard of living is more important than suggested by the complete cases model, and 2) using multiple imputation can help to reduce standard errors. Conclusions. Multiple imputation allows a researcher to use more of the available data, thereby reducing biases that may occur when observations with missing data are simply deleted.

[1]  Richard A. Easterlin,et al.  Explaining happiness , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  T. Raghunathan,et al.  Multiple Imputation of Family Income and Personal Earnings in the National Health Interview Survey: Methods and Examples , 2008 .

[3]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[4]  Michael McBride,et al.  Relative-income effects on subjective well-being in the cross-section , 2001 .

[5]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[6]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[7]  J L Schafer,et al.  Multiple Imputation for Multivariate Missing-Data Problems: A Data Analyst's Perspective. , 1998, Multivariate behavioral research.

[8]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[9]  Rafael Di Tella,et al.  Some Uses of Happiness Data in Economics , 2006 .

[10]  D. Kahneman,et al.  Developments in the Measurement of Subjective Well-Being , 2006 .

[11]  B. Frey,et al.  What Can Economists Learn from Happiness Research? , 2001, SSRN Electronic Journal.

[12]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[13]  “ Multiple Imputation in Practice : Comparison of Software Packages for Regression Models With Missing Variables , ” , 2002 .

[14]  Trivellore E Raghunathan,et al.  What do we do with missing data? Some options for analysis of incomplete data. , 2004, Annual review of public health.