Hierarchical Bayesian Nonresponse Models for Binary Data From Small Areas With Uncertainty About Ignorability

In the National Crime Survey (NCS), data on victimization can be poststratified into domains determined by urbanization, type of place, and poverty level. There is much difficulty in the analysis of binary data with substantial nonresponse. We consider three Bayesian hierarchical models for binary nonresponse data, like those from the NCS, which are clustered within a number of domains or areas. As in small area estimation, one key feature is that each model “borrows strength” across the areas through the selection approach to nonresponse. This is necessary to estimate the parameters with the least association to the observed data (i.e., weakly identified parameters). The first model assumes that the nonresponse mechanism is ignorable, and the second model assumes that it is nonignorable. We argue that a discrete model expansion (a probabilistic mixture) may be inappropriate for modeling uncertainty about ignorability. Therefore, we propose a third model through a continuous model expansion on an odds ratio for each area. When the odds ratio is 1, we have the ignorable model; otherwise, the model is nonignorable. One important feature is that uncertainty about ignorability is incorporated by “centering” on the ignorable model. We analyze the poststratified data from the NCS to reveal latent features associated with nonresponse. The complexity of the posterior distributions of the parameters forces us to implement the methodology using Markov chain Monte Carlo methods. When the proportion of households with a characteristic (i.e., victimization in the NCS) and the response probability of a household in the population are estimated, we find that the nonignorable model and the expansion model are similar but that they differ from the ignorable model. Although considerable prior information about the nonresponse mechanism is needed, the expansion model indicates that nonresponse for most of the areas is nonignorable. An analysis shows that inference is not very sensitive to an important distribution assumption, and a simulation exercise shows that the expansion model works very well.

[1]  Diane Griffin Saphire,et al.  Estimation of Victimization Prevalence Using Data from the National Crime Survey , 1984 .

[2]  Alan E. Gelfand,et al.  Bayesian statistics without tears: A sampling-resampling perspective , 1992 .

[3]  Joseph B. Kadane,et al.  Subjective Bayesian analysis for surveys with missing data , 1993 .

[4]  S G Baker,et al.  Marginal regression for repeated binary data with outcome subject to non-ignorable non-response. , 1995, Biometrics.

[5]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[6]  D. Rubin Formalizing Subjective Notions about the Effect of Nonrespondents in Sample Surveys , 1977 .

[7]  M R Conaway Causal nonresponse models for repeated categorical measurements. , 1994, Biometrics.

[8]  Michael J. Phillips Contingency tables with missing data , 1993 .

[9]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[10]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[11]  Balgobin Nandram A bayesian analysis of the three-stage hierarchical multinomial model , 1998 .

[12]  S. Lipsitz,et al.  Weighted least squares analysis of repeated categorical measurements with outcomes subject to nonresponse. , 1994, Biometrics.

[13]  Roderick J. A. Little,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models: Comment , 1999 .

[14]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[15]  Elizabeth A. Stasny,et al.  Hierarchical Models for the Probabilities of a Survey Classification and Nonresponse: An Example from the National Crime Survey , 1991 .

[16]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[17]  Nan M. Laird,et al.  Regression Analysis for Categorical Variables with Outcome Subject to Nonignorable Nonresponse , 1988 .

[18]  R. Fay Causal Models for Patterns of Nonresponse , 1986 .

[19]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[20]  Erik V. Nordheim,et al.  Inference from Nonrandomly Missing Categorical Data: An Example from a Genetic Study on Turner's Syndrome , 1984 .

[21]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[22]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[23]  Alan H. Welsh,et al.  Log-linear models for survey data with non-ignorable non-response , 1993 .

[24]  Jonathan J. Forster,et al.  Model‐based inference for categorical survey data subject to non‐ignorable non‐response , 1998 .

[25]  R. Little Models for Nonresponse in Sample Surveys , 1982 .

[26]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[27]  Mark R. Conaway,et al.  The Analysis of Repeated Categorical Measurements Subject to Nonignorable Nonresponse , 1992 .

[28]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .