Randomized Response, Statistical Disclosure Control and Misclassificatio: a Review

This paper discusses analysis of categorical data which have been misclassified and where misclassification probabilities are known. Fields where this kind of misclassification occurs are randomized response, statistical disclosure control, and classification with known sensitivity and specificity. Estimates of true frequencies are given, and adjustments to the odds ratio are discussed. Moment estimates and maximum Likelihood estimates are compared and it is proved that they are the same in the interior of the parameter space. Since moment estimators are regularly outside the parameter space, special attention is paid to the possibility of boundary solutions. An example is given.

[1]  J. Wolfowitz,et al.  Introduction to the Theory of Statistics. , 1951 .

[2]  S. Warner The Linear Randomized Response Model , 1971 .

[3]  L. Lucy An iterative technique for the rectification of observed distributions , 1974 .

[4]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[5]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[6]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[7]  H Checkoway,et al.  Bias due to misclassification in the estimation of relative risk. , 1977, American journal of epidemiology.

[8]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9]  S Greenland,et al.  The effect of misclassification in the presence of covariates. , 1980, American journal of epidemiology.

[10]  Joseph E. Schwartz,et al.  The Neglected Problem of Measurement Error in Categorical Data , 1985 .

[11]  Paul E. Tracy,et al.  Randomized Response: A Method for Sensitive Surveys , 1986 .

[12]  A. Chaudhuri,et al.  Randomized Response: Theory and Techniques , 1987 .

[13]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[14]  S Greenland,et al.  Variance estimation for epidemiologic effect estimates under misclassification. , 1988, Statistics in medicine.

[15]  Patrick D. Bourke,et al.  Estimating Proportions from Randomized Response Data Using the EM Algorithm , 1988 .

[16]  T T Chen A review of methods for misclassified categorical data in epidemiology. , 1989, Statistics in medicine.

[17]  G. Jasso Review of "International Encyclopedia of Statistical Sciences, edited by Samuel Kotz, Norman L. Johnson, and Campbell B. Read, New York, Wiley, 1982-1988" , 1989 .

[18]  Anthony Y. C. Kuk,et al.  Asking sensitive questions indirectly , 1990 .

[19]  A. Agresti An introduction to categorical data analysis , 1997 .

[20]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[21]  L. Magder,et al.  Logistic regression when the outcome is measured with uncertainty. , 1997, American journal of epidemiology.

[22]  Chris J. Skinner,et al.  Categorical data analysis and misclassification , 1997 .

[23]  A.D.L. Van den Hout,et al.  The Analysis of Data Perturbed by Pram , 1999 .

[24]  J. Hox,et al.  A Comparison of Randomized Response, Computer-Assisted Self-Interview, and Face-to-Face Direct Questioning , 2000 .

[25]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[26]  Leon Willenborg Optimality models for PRAM , 2000 .

[27]  L. Willenborg,et al.  Elements of Statistical Disclosure Control , 2000 .

[28]  Randomized response: Onderzoek naar regelovertreding. Resultaten ABW, WAO en WW , 2001 .

[29]  M. Rosenberg CATEGORICAL DATA ANALYSIS BY A RANDOMIZED RESPONSE TECHNIQUE FOR STATISTICAL DISCLOSURE CONTROL , 2002 .