Local Post-Stratification in Dual System Accuracy and Coverage Evaluation for the U.S. Census

We consider a local post-stratification approach to analyze the capture–recapture dual system Accuracy and Coverage Evaluation (A.C.E.) data associated with the 2000 U.S. Census. The local post-stratification is carried out via a nonparametric regression estimation of the census enumeration and the correct enumeration functions. We propose a nonparametric population size estimator that is designed to accommodate some key aspects of the A.C.E.: missing values, erroneous enumerations, and extra covariates affecting the missingness and correct enumeration. The resulting estimates are compared with estimates from a conventional post-stratification and a logistic regression approach in an analysis on the 2000 Census A.C.E. data.

[1]  J. Alho,et al.  Estimating heterogeneity in the probabilities of enumeration for dual-system estimation. , 1993, Journal of the American Statistical Association.

[2]  Jeffrey S. Simonoff,et al.  Smoothing categorical data , 1995 .

[3]  S E Fienberg,et al.  A three-sample multiple-recapture approach to census population estimation with heterogeneous catchability. , 1993, Journal of the American Statistical Association.

[4]  Kenneth H. Pollock,et al.  Modeling capture, recapture, and removal statistics for estimation of demographic parameters for fish and wildlife populations : Past, present, and future , 1991 .

[5]  Zhongwei Han,et al.  A Non-Parametric Approach , 2008 .

[6]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[7]  L. Brown,et al.  Alternative Formulas for Synthetic Dual System Estimation in the 2000 Census , 2008, 0805.2835.

[8]  Q. Lib,et al.  Nonparametric estimation of regression functions with both categorical and continuous data , 2004 .

[9]  K. Wolter Some coverage error models for census data. , 1986, Journal of the American Statistical Association.

[10]  W. Deming,et al.  On a Method of Estimating Birth and Death Rates and the Extent of Registration (Excerpt) , 1949 .

[11]  W. Bell Using information from demographic analysis in post-enumeration survey estimation. , 1993, Journal of the American Statistical Association.

[12]  R. Huggins On the statistical analysis of capture experiments , 1989 .

[13]  K. H. Pollock,et al.  Building Models of Capture-Recapture Experiments , 1976 .

[14]  Roderick J. A. Little,et al.  A Bayesian Approach to Combining Information from a Census, a Coverage Measurement Survey, and Demographic Analysis , 2000 .

[15]  H. Müller,et al.  Local Polynomial Modeling and Its Applications , 1998 .

[16]  J. Aitchison,et al.  Multivariate binary discrimination by the kernel method , 1976 .

[17]  P. Hall On nonparametric multivariate binary discrimination , 1981 .

[18]  Arno Siebes,et al.  Smoothing Categorical Data , 2012, ECML/PKDD.

[19]  J. Shao,et al.  The jackknife and bootstrap , 1996 .

[20]  A. Zaslavsky,et al.  Triple-System Modeling of Census, Post-Enumeration Survey, and Administrative-List Data , 1993 .

[21]  D. Rubin,et al.  Hierarchical logistic regression models for imputation of unresolved enumeration status in undercount estimation. , 1993, Journal of the American Statistical Association.

[22]  Jeffrey S. Racine,et al.  Cross-Validation and the Estimation of Conditional Probability Densities , 2004 .

[23]  M. C. Jones,et al.  Kernel Estimators for Univariate Binary Regression , 2004 .

[24]  Chris Lloyd,et al.  A nonparametric approach to the analysis of two-stage mark-recapture experiments , 2000 .

[25]  R. Little,et al.  A Bayesian Approach to 2000 Census Evaluation Using ACE Survey Data and Demographic Analysis , 2005 .

[26]  J. Shao,et al.  A General Theory for Jackknife Variance Estimation , 1989 .

[27]  Donald Malec,et al.  Small Area Inference for Binary Variables in the National Health Interview Survey , 1997 .

[28]  Using Continuous Variables As Modeling Covariates for Net Coverage Estimation , 2008 .

[29]  K. Wolter CAPTURE-RECAPTURE ESTIMATION IN THE PRESENCE OF A KNOWN SEX RATIO , 1990 .

[30]  A. Chao,et al.  A Sample Coverage Approach to Multiple-System Estimation with Application to Census Undercount , 1998 .

[31]  H. Hogan The 1990 Post-Enumeration Survey: An Overview , 1992 .

[32]  Deborah Nolan,et al.  Probability and Statistics: Essays in Honor of David A. Freedman , 2008, 0806.4441.

[33]  J. Alho Logistic regression in capture-recapture models. , 1990, Biometrics.

[34]  Patricia L. Smith Splines as a Useful and Convenient Statistical Tool , 1979 .

[35]  S. Chen,et al.  Nonparametric regression with discrete covariate and missing values , 2011 .

[36]  W. Deming,et al.  On a Method of Estimating Birth and Death Rates and the Extent of Registration (Excerpt) , 1949 .

[37]  A. Dueck,et al.  Handling missing data. , 2005, Current problems in cancer.