Confidentialising Microdata Using Multiple Imputation: Development and Evaluation of a Non-parametric Hierarchical Bayesian Imputation Model for Numerical Data

[1]  D. Rubin The Bayesian Bootstrap , 1981 .

[2]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[3]  M. Escobar Estimating Normal Means with a Dirichlet Process Prior , 1994 .

[4]  Fernando A. Quintana,et al.  Nonparametric Bayesian data analysis , 2004 .

[5]  Purushottam W. Laud,et al.  Bayesian Nonparametric Inference for Random Distributions and Related Functions , 1999 .

[6]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[7]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[8]  Xiao-Li Meng,et al.  Multiple-Imputation Inferences with Uncongenial Sources of Input , 1994 .

[9]  Jerry Nedelman,et al.  Book review: “Bayesian Data Analysis,” Second Edition by A. Gelman, J.B. Carlin, H.S. Stern, and D.B. Rubin Chapman & Hall/CRC, 2004 , 2005, Comput. Stat..

[10]  D J Spiegelhalter,et al.  Flexible random‐effects models using Bayesian semi‐parametric models: applications to institutional comparisons , 2007, Statistics in medicine.

[11]  Jerome P. Reiter,et al.  Estimating Risks of Identification Disclosure in Partially Synthetic Data , 2009, J. Priv. Confidentiality.

[12]  Jörg Drechsler,et al.  Comparing Fully and Partially Synthetic Datasets for Statistical Disclosure Control in the German IAB Establishment Panel , 2008, Trans. Data Priv..

[13]  P. Graham,et al.  Multiply imputed synthetic data: evaluation of Hierarchical Bayesian imputation models , 2009 .

[14]  Jerome P. Reiter,et al.  Satisfying Disclosure Restrictions With Synthetic Data Sets , 2002 .

[15]  S. Greenland Dose‐Response and Trend Analysis in Epidemiology: Alternatives to Categorical Analysis , 1995, Epidemiology.

[16]  R. Sarathy,et al.  A comparison of multiple imputation and data perturbation for masking numerical variables , 2006 .

[17]  Stephen E. Fienberg,et al.  Disclosure limitation using perturbation and related methods for categorical data , 1998 .

[18]  J. Albert Computational methods using a Bayesian hierarchical generalized linear model , 1988 .

[19]  P. Graham HIERARCHICAL BAYESIAN MODELLING OF SOCIAL VARIATION IN THE AGE DEPENDENCE OF DISABILITY PREVALENCE , 2005 .

[20]  D. Hyslop Does Benefit Receipt Affect Future Income? An Econometric Explanation , 2000 .

[21]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[22]  C. Morris,et al.  Inference for multivariate normal hierarchical models , 2000 .

[23]  C. Morris,et al.  Hierarchical Poisson Regression Modeling , 1997 .

[24]  Anna Oganian,et al.  A Framework for Evaluating the Utility of Data Altered to Protect Confidentiality , 2006 .

[25]  C. Morris Parametric Empirical Bayes Inference: Theory and Applications , 1983 .

[26]  S. Maani Secondary and Tertiary Education Attainment and Income Levels for Maori and Non-Maori Over Time , 2000 .

[27]  Simon D. Woodcock,et al.  Disclosure Limitation in Longitudinal Linked Data , 2002 .

[28]  D. Blackwell,et al.  Ferguson Distributions Via Polya Urn Schemes , 1973 .

[29]  Education and Maori Relative Income Levels over Time: The Mediating Effect of Occupation, Industry, Hours of Work and Locality , 2002 .

[30]  James J. Heckman,et al.  Empirical Evidence on the Functional Form of the Earnings-Schooling Relationship , 1974 .

[31]  A. Kennickell Multiple Imputation and Disclosure Protection : TheCase of the 1995 Survey of Consumer Finances , 2000 .

[32]  M. Daniels A prior for the variance in hierarchical models , 1999 .

[33]  Jerome P. Reiter,et al.  Releasing multiply imputed, synthetic public use microdata: an illustration and empirical study , 2005 .

[34]  Albert Y. Lo,et al.  A Bayesian bootstrap for a finite population , 1988 .

[35]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[36]  Lancelot F. James,et al.  Gibbs Sampling Methods for Stick-Breaking Priors , 2001 .

[37]  Julia Lane,et al.  Optimizing the Use of Micro-Data: An Overview of the Issues , 2005 .

[38]  Jerome P. Reiter,et al.  Multiple Imputation for Statistical Disclosure Limitation , 2003 .

[39]  Richard Penny,et al.  Multiply Imputed Synthetic Data Files , 2007 .

[40]  Andrew Gelman,et al.  Let's Practice What We Preach , 2002 .