Combining Information From Two Surveys to Estimate County-Level Prevalence Rates of Cancer Risk Factors and Screening

Cancer surveillance research requires estimates of the prevalence of cancer risk factors and screening for small areas such as counties. Two popular data sources are the Behavioral Risk Factor Surveillance System (BRFSS), a telephone survey conducted by state agencies, and the National Health Interview Survey (NHIS), an area probability sample survey conducted through face-to-face interviews. Both data sources have advantages and disadvantages. The BRFSS is a larger survey and almost every county is included in the survey, but it has lower response rates as is typical with telephone surveys and it does not include subjects who live in households with no telephones. On the other hand, the NHIS is a smaller survey, with the majority of counties not included; but it includes both telephone and nontelephone households, and has higher response rates. A preliminary analysis shows that the distributions of cancer screening and risk factors are different for telephone and nontelephone households. Thus, information from the two surveys may be combined to address both nonresponse and noncoverage errors. A hierarchical Bayesian approach that combines information from both surveys is used to construct county-level estimates. The proposed model incorporates potential noncoverage and nonresponse biases in the BRFSS as well as complex sample design features of both surveys. A Markov chain Monte Carlo method is used to simulate draws from the joint posterior distribution of unknown quantities in the model that uses design-based direct estimates and county-level covariates. Yearly prevalence estimates at the county level for 49 states, as well as for the entire state of Alaska and the District of Columbia, are developed for six outcomes using BRFSS and NHIS data from the years 1997–2000. The outcomes include smoking and use of common cancer screening procedures. The NHIS/BRFSS combined county-level estimates are substantially different from those based on the BRFSS alone.

[1]  D. Nelson,et al.  A comparison of national estimates from the National Health Interview Survey and the Behavioral Risk Factor Surveillance System. , 2003, American journal of public health.

[2]  R. Fay,et al.  Estimates of Income for Small Places: An Application of James-Stein Procedures to Census Data , 1979 .

[3]  D K Wagener,et al.  Progress in cancer screening over a decade: results of cancer screening from the 1987, 1992, and 1998 National Health Interview Surveys. , 2001, Journal of the National Cancer Institute.

[4]  F. J. Anscombe,et al.  THE TRANSFORMATION OF POISSON, BINOMIAL AND NEGATIVE-BINOMIAL DATA , 1948 .

[5]  J. Schmee An Introduction to Multivariate Statistical Analysis , 1986 .

[6]  B. Efron,et al.  Data Analysis Using Stein's Estimator and its Generalizations , 1975 .

[7]  D. Binder On the variances of asymptotically normal estimators from complex surveys , 1983 .

[8]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[9]  Massey Jt,et al.  Design and estimation for the National Health Interview Survey 1985-94. , 1989 .

[10]  R. Iachan,et al.  Methods for Combining State BRFSS Survey Data for National Estimation , 2002 .

[11]  Jack R. Anderson Design and estimation for the National Health Interview Survey, 1995-2004. , 2000, Vital and health statistics. Series 2, Data evaluation and methods research.

[12]  Michael R. Elliott,et al.  Obtaining cancer risk factor prevalence estimates in small areas: combining data from two surveys , 2005 .

[13]  K. Warriner,et al.  Evaluating Socio-economic Status (SES) Bias in Survey Nonresponse , 2002 .

[14]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[15]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[16]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[17]  M. Ghosh,et al.  Estimation of Median Income of Four-Person Families: A Bayesian Time Series Approach , 1996 .

[18]  J. N. K. Rao,et al.  Some recent advances in model-based small area estimation , 1999 .

[19]  Dawei Xie Combining information from multiple surveys for small area estimation: Bayesian approaches. , 2004 .

[20]  J. Legler,et al.  Predicting patterns of mammography use: a geographic perspective on national needs for intervention research. , 2002, Health services research.

[21]  J. Tukey,et al.  Transformations Related to the Angular and the Square Root , 1950 .

[22]  Bradley P. Carlin,et al.  Generalized Linear Models for Small-Area Estimation , 1998 .

[23]  Joseph Waksberg,et al.  Sampling Methods for Random Digit Dialing , 1978 .

[24]  J. Rao Small Area Estimation , 2003 .

[25]  L. Tierney Exploring Posterior Distributions Using Markov Chains , 1992 .

[26]  Frederick Mosteller,et al.  Tables of the Freeman-Tukey Transformations for the Binomial and Poisson Distributions , 1961 .

[27]  Nancy Breen,et al.  Progress in cancer screening practices in the United States , 2003, Cancer.

[28]  D. Nelson,et al.  Telephone coverage and measurement of health risk indicators: data from the National Health Interview Survey. , 1998, American journal of public health.