A contribution on the nature and treatment of missing data in large market surveys

ABSTRACT Nonresponse (or missing data) is often encountered in large-scale surveys. To enable the behavioural analysis of these data sets, statistical treatments are commonly applied to complete or remove these data. However, the correctness of such procedures critically depends on the nature of the underlying missingness generation process. Clearly, the efficacy of applying either case deletion or imputation procedures rests on the unknown missingness generation mechanism. The contribution of this article is twofold. The study is the first to propose a simple sequential method to attempt to identify the form of missingness. Second, the effectiveness of the tests is assessed by generating (experimentally) nine missing data sets by imposed missing completely at random, missing at random and not missing at random processes, with data removed.

[1]  Lena Osterhagen,et al.  Multiple Imputation For Nonresponse In Surveys , 2016 .

[2]  Rolph E. Anderson,et al.  Multivariate Data Analysis (7th ed. , 2009 .

[3]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[4]  Patricia Goodson,et al.  Out of sight, not out of mind: strategies for handling missing data. , 2008, American journal of health behavior.

[5]  Vic Hasselblad,et al.  Can one assess whether missing data are missing at random in medical studies? , 2006, Statistical methods in medical research.

[6]  Stefano Tarantola,et al.  Handbook on Constructing Composite Indicators: Methodology and User Guide , 2005 .

[7]  John L.P. Thompson,et al.  Missing data , 2004 .

[8]  Annie Qu,et al.  Testing ignorable missingness in estimating equation approaches for longitudinal data , 2002 .

[9]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[10]  D. Bennett How can I deal with missing data in my study? , 2001, Australian and New Zealand journal of public health.

[11]  Roderick J. A. Little,et al.  A test of missing completely at random for generalised estimating equations with missing data , 1999 .

[12]  C. S. Davis,et al.  A test of the missing data mechanism for repeated categorical data. , 1993, Biometrics.

[13]  R. Little Missing-Data Adjustments in Large Surveys , 1988 .

[14]  R. Little A Test of Missing Completely at Random for Multivariate Data with Missing Values , 1988 .

[15]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[16]  Jasper Becker,et al.  Joint Research Centre , 1982, Nature.