EMPIRICAL BAYES ESTIMATORS OF SMALL AREA PROPORTIONS IN MULTISTAGE DESIGNS

The importance of small area estimation as a facet of survey sampling cannot be over-emphasized. Of late, there has been an increasing demand for small area statistics in both the public and private sectors. It is widely recognized that direct survey estimates for small areas are likely to be unstable due to the small sample sizes in the areas. This makes it necessary to "borrow strength" from related areas to obtain more accurate estimates. In this study, an empirical Bayes methodology for the estimation of small area proportions is proposed, implemented, and evaluated. The basic idea consists of incorporating into a logistic regression model, random effects that are nested in such a way as to reflect the complex structure of a multistage sample design. This yields both point estimates and associated naive measures of accuracy. The latter do not incorporate the uncertainty that arises from estimating the prior distribution of the random effects. We adjust these naively-estimated measures of uncertainty using the bootstrap techniques developed by Laird and Louis (1987). The proposed estimation approach is applied to data from a United States Census to predict local labour force participation rates. Results are compared with those obtained using unbiased and synthetic estimation, as well as a domain- adjusted synthetic estimation approach which incorporates predictor variables at both the individual and local area levels.

[1]  Thomas J. Tomberlin,et al.  PREDICTING ACCIDENT FREQUENCIES FOR DRIVERS CLASSIFIED BY TWO FACTORS , 1988 .

[2]  Malay Ghosh,et al.  Small Area Estimation: An Appraisal , 1994 .

[3]  T. Louis,et al.  Empirical Bayes Confidence Intervals Based on Bootstrap Samples , 1987 .

[4]  Donald Malec,et al.  Bayesian Predictive Inference for Small Areas for Binary Variables in the National Health Interview Survey , 1993 .

[5]  C. Morris Parametric Empirical Bayes Inference: Theory and Applications , 1983 .

[6]  R. Royall On finite population sampling theory under certain linear regression models , 1970 .

[7]  G. Y. Wong,et al.  The Hierarchical Logistic Regression Model for Multilevel Analysis , 1985 .

[8]  A. Scott,et al.  Estimation in Multi-Stage Surveys , 1969 .

[9]  G. Roberts,et al.  Logistic regression analysis of sample survey data , 1987 .

[10]  T. Stroud Hierarchical baxes predictive means and variances with application to sample survey inference , 1991 .

[11]  Kevin John Leonard Credit scoring via linear logistic models with random parameters , 1988 .

[12]  N. Laird Empirical Bayes methods for two-way contingency tables , 1978 .

[13]  Bradley P. Carlin,et al.  Approaches for Empirical Bayes Confidence Intervals , 1990 .

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  David Lindley,et al.  Bayes Empirical Bayes , 1981 .

[16]  Malay Ghosh,et al.  Bayesian Prediction in Linear Models: Applications to Small Area Estimation , 1991 .

[17]  M. González,et al.  Small-Area Estimation with Application to Unemployment and Housing Estimates , 1978 .

[18]  B. MacGibbon,et al.  Protection against outliers in empirical bayes estimation , 1994 .

[19]  William G. Cochran,et al.  Sampling Techniques, 3rd Edition , 1963 .

[20]  R. Fay,et al.  Estimates of Income for Small Places: An Application of James-Stein Procedures to Census Data , 1979 .