Nonparametric Bayes modeling with sample survey weights.

In population studies, it is standard to sample data via designs in which the population is divided into strata, with the different strata assigned different probabilities of inclusion. Although there have been some proposals for including sample survey weights into Bayesian analyses, existing methods require complex models or ignore the stratified design underlying the survey weights. We propose a simple approach based on modeling the distribution of the selected sample as a mixture, with the mixture weights appropriately adjusted, while accounting for uncertainty in the adjustment. We focus for simplicity on Dirichlet process mixtures but the proposed approach can be applied more broadly. We sketch a simple Markov chain Monte Carlo algorithm for computation, and assess the approach via simulations and an application.

[1]  R. Little,et al.  Inference for the Population Total from Probability-Proportional-to-Size Samples Based on Predictions from a Penalized Spline Nonparametric Model , 2003 .

[2]  Danny Pfeffermann Struggles with Survey Weighting and Regression Modeling. Comment. , 2007 .

[3]  Sharon L. Lohr,et al.  Asymptotic properties of kernel density estimation with complex survey data , 2005 .

[4]  Peter D. Hoff,et al.  A First Course in Bayesian Statistical Methods , 2009 .

[5]  James E. Stafford,et al.  DENSITY ESTIMATION FROM COMPLEX SURVEYS , 1999 .

[6]  Trent D. Buskirk 1998: NONPARAMETRIC DENSITY ESTIMATION USING COMPLEX SURVEY DATA , 2002 .

[7]  Yajuan Si,et al.  Bayesian Nonparametric Weighted Sampling Inference , 2013, 1309.1799.

[8]  K. Harris,et al.  The National Longitudinal Study of Adolescent Health (Add Health) Twin Data , 2006, Twin Research and Human Genetics.

[9]  Andrew Gelman,et al.  Struggles with survey weighting and regression modeling , 2007, 0710.5005.

[10]  R. Little To Model or Not To Model? Competing Modes of Inference for Finite Population Sampling , 2004 .

[11]  R. Little,et al.  Bayesian Inference for the Finite Population Total from a Heteroscedastic Probability Proportional to Size Sample , 2015 .

[12]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[13]  Lancelot F. James,et al.  Gibbs Sampling Methods for Stick-Breaking Priors , 2001 .

[14]  D. Dunson,et al.  Nonparametric Bayes Modeling of Multivariate Categorical Data , 2009, Journal of the American Statistical Association.

[15]  Jerome P. Reiter,et al.  Incorporating Marginal Prior Information in Latent Class Models , 2016 .

[16]  Jerome P. Reiter,et al.  Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys , 2013 .

[17]  T. Louis,et al.  Bayes and Empirical Bayes Methods for Data Analysis. , 1997 .

[18]  R. Little,et al.  Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples , 2003 .

[19]  Antonio Canale,et al.  Bayesian Kernel Mixtures for Counts , 2011, Journal of the American Statistical Association.

[20]  R. Little,et al.  Bayesian penalized spline model-based inference for finite population proportion in unequal probability sampling. , 2010, Survey methodology.