Bayesian Nonparametric Weighted Sampling Inference

Survey weighting adjusts for known or expected dierences between sample and population. Weights are constructed on design or benchmarking variables that are predictors of inclusion probability. In this paper, we assume that the only information we have about the weighting procedure is the values of the weights in the sample. We propose a hierarchical Bayesian approach in which we model the weights of the nonsampled units in the population and simultaneously include them as predictors in a nonparametric Gaussian process regression to yield valid inference for the underlying nite population and capture the uncertainty induced by sampling and the unobserved outcomes. We use simulation studies to evaluate the performance of our procedure and compare it to the classical design-based estimator. We apply our method to the Fragile Family Child Wellbeing Study. Our studies nd the Bayesian nonparametric nite population estimator to be more robust than the classical design-based estimator without loss in eciency.

[1]  P. Gustafson,et al.  Conservative prior distributions for variance parameters in hierarchical models , 2006 .

[2]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[3]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[4]  Donald B. Rubin,et al.  Validation of Software for Bayesian Models Using Posterior Quantiles , 2006 .

[5]  Jeffrey R. Lax,et al.  How Should We Estimate Public Opinion in the States , 2009 .

[6]  Maternelle et Pédagogie Préscolaire Fragile Families and Child Wellbeing Study , 2012 .

[7]  J. Carlin,et al.  Poststratification and Weighting Adjustments , 2000 .

[8]  Acknowledgments , 2006, Molecular and Cellular Endocrinology.

[9]  Thomas Lumley,et al.  Complex Surveys: A Guide to Analysis Using R , 2010 .

[10]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[11]  Van Der Vaart,et al.  Rates of contraction of posterior distributions based on Gaussian process priors , 2008 .

[12]  R. Little,et al.  Bayesian penalized spline model-based inference for finite population proportion in unequal probability sampling. , 2010, Survey methodology.

[13]  Julien O. Teitler,et al.  Fragile Families: Sample and Design , 2001 .

[14]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[15]  R. Little,et al.  Bayesian Inference for the Finite Population Total from a Heteroscedastic Probability Proportional to Size Sample , 2015 .

[16]  R. Little Post-Stratification: A Modeler's Perspective , 1993 .

[17]  R. Little,et al.  Model-Based Alternatives to Trimming Survey Weights , 2000 .

[18]  Morris H. Hansen,et al.  An Evaluation of Model-Dependent and Probability-Sampling Inferences in Sample Surveys: Rejoinder , 1983 .

[19]  Jeffrey R. Lax,et al.  Gay Rights in the States: Public Opinion and Policy Responsiveness , 2009, American Political Science Review.

[20]  James G. Scott,et al.  On the half-cauchy prior for a global scale parameter , 2011, 1104.4937.

[21]  J. K. Ghosh,et al.  On the Logical Foundations of Survey Sampling , 1988 .

[22]  M. G. Pittau,et al.  A weakly informative default prior distribution for logistic and other regression models , 2008, 0901.4011.

[23]  A. Gelman,et al.  Deep Interactions with MRP: Election Turnout and Voting Patterns Among Small Electoral Subgroups , 2013 .

[24]  Xiao-Li Meng,et al.  POSTERIOR PREDICTIVE ASSESSMENT OF MODEL FITNESS VIA REALIZED DISCREPANCIES , 1996 .

[25]  Andrew Gelman,et al.  Struggles with survey weighting and regression modeling , 2007, 0710.5005.

[26]  R. Little,et al.  Penalized Spline Model-Based Estimation of the Finite Populations Total from Probability-Proportional-to-Size Samples , 2003 .

[27]  M. H. Hansen An Evaluation of Model-Dependent and Probability-Sampling Inferences in Sample Surveys , 1983 .