Validation of multilevel regression and poststratification methodology for small area estimation of health indicators from the behavioral risk factor surveillance system.

Small area estimation is a statistical technique used to produce reliable estimates for smaller geographic areas than those for which the original surveys were designed. Such small area estimates (SAEs) often lack rigorous external validation. In this study, we validated our multilevel regression and poststratification SAEs from 2011 Behavioral Risk Factor Surveillance System data using direct estimates from 2011 Missouri County-Level Study and American Community Survey data at both the state and county levels. Coefficients for correlation between model-based SAEs and Missouri County-Level Study direct estimates for 115 counties in Missouri were all significantly positive (0.28 for obesity and no health-care coverage, 0.40 for current smoking, 0.51 for diabetes, and 0.69 for chronic obstructive pulmonary disease). Coefficients for correlation between model-based SAEs and American Community Survey direct estimates of no health-care coverage were 0.85 at the county level (811 counties) and 0.95 at the state level. Unweighted and weighted model-based SAEs were compared with direct estimates; unweighted models performed better. External validation results suggest that multilevel regression and poststratification model-based SAEs using single-year Behavioral Risk Factor Surveillance System data are valid and could be used to characterize geographic variations in health indictors at local levels (such as counties) when high-quality local survey data are not available.

[1]  C. Murray,et al.  Cigarette smoking prevalence in US counties: 1996-2012 , 2014, Population Health Metrics.

[2]  M. Goodman,et al.  Multilevel Reweighted Regression Models to Estimate County-Level Racial Health Disparities Using PROC GLIMMIX , 2013 .

[3]  Donald Malec,et al.  Small Area Inference for Binary Variables in the National Health Interview Survey , 1997 .

[4]  M. Tremblay,et al.  The Bias in Self‐reported Obesity From 1976 to 2005: A Canada–US Comparison , 2010, Obesity.

[5]  Using Zip Code-Level Mortality Data as a Local Health Status Indicator in Mobile, Alabama , 2008, The American journal of the medical sciences.

[6]  Nathaniel Schenker,et al.  Combining Information From Two Surveys to Estimate County-Level Prevalence Rates of Cancer Risk Factors and Screening , 2007 .

[7]  P. Muennig,et al.  Comparison of small-area analysis techniques for estimating county-level outcomes. , 2004, American journal of preventive medicine.

[8]  M. Goodman Comparison of Small-Area Analysis Techniques for Estimating Prevalence by Race , 2010, Preventing chronic disease.

[9]  S. Lemon,et al.  Small-area estimation and prioritizing communities for obesity control in Massachusetts. , 2009, American journal of public health.

[10]  M. Schootman,et al.  Efficient mapping and geographic disparities in breast cancer mortality at the county-level by race and age in the U.S. , 2013, Spatial and spatio-temporal epidemiology.

[11]  M. Clark,et al.  Using Small-Area Estimation to Describe County-Level Disparities in Mammography , 2009, Preventing chronic disease.

[12]  W. Davis,et al.  Model-based small area estimates of overweight prevalence using sample selection adjustment. , 1999, Statistics in medicine.

[13]  Yuna Zhong,et al.  Geographic and Racial Patterns of Preventable Hospitalizations for Hypertension: Medicare Beneficiaries, 2004–2009 , 2014, Public health reports.

[14]  S. Vernon,et al.  County-level estimates of human papillomavirus vaccine coverage among young adult women in Texas, 2008. , 2013, Texas public health journal.

[15]  Kurt J Greenlund,et al.  Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system. , 2014, American journal of epidemiology.

[16]  A Multilevel Approach to Estimating Small Area Childhood Obesity Prevalence at the Census Block-Group Level , 2013, Preventing chronic disease.

[17]  Jun Liu,et al.  Correcting the Bias in the Range of a Statistic Across Small Areas , 2000 .

[18]  Helen Margellos-Anast,et al.  Prevalence of Obesity among Children in Six Chicago Communities: Findings from a Health Survey , 2008, Public health reports.

[19]  Trivellore E Raghunathan,et al.  Estimation of the proportion of overweight individuals in small areas—a robust extension of the Fay–Herriot model , 2007, Statistics in medicine.

[20]  S. Vernon,et al.  Human papillomavirus vaccine coverage among females aged 11 to 17 in Texas counties: an application of multilevel, small area estimation. , 2013, Women's health issues : official publication of the Jacobs Institute of Women's Health.

[21]  J. Rao Small Area Estimation , 2003 .

[22]  C. G. Hudson Validation of a model for estimating state and local prevalence of serious mental illness , 2009, International journal of methods in psychiatric research.

[23]  L. Barker,et al.  Bayesian Small Area Estimates of Diabetes Prevalence by U.S. County, 2005 , 2010 .

[24]  J. N. K. Rao,et al.  A pseudo‐empirical best linear unbiased prediction approach to small area estimation using survey weights , 2002 .

[25]  S. Teutsch,et al.  Small Area Estimates Reveal High Cigarette Smoking Prevalence in Low-Income Cities of Los Angeles County , 2012, Journal of Urban Health.

[26]  J. Kelsey,et al.  Small-area estimation and prioritizing communities for tobacco control efforts in Massachusetts. , 2009, American journal of public health.

[27]  Steven Allender,et al.  Validation of model-based estimates (synthetic estimates) of the prevalence of risk factors for coronary heart disease for wards in England. , 2009, Health & place.

[28]  L. Geiss,et al.  Bayesian Small Area Estimates of Diabetes Incidence by United States County, 2009. , 2013, Journal of data science : JDS.

[29]  Liz Twigg,et al.  Predicting small area health-related behaviour: a comparison of multilevel synthetic estimation and local survey data. , 2002, Social science & medicine.

[30]  Danny Pfeffermann,et al.  New important developments in small area estimation , 2013, 1302.4907.

[31]  Stephen S. Lim,et al.  Prevalence, Awareness, Treatment, and Control of Hypertension in United States Counties, 2001–2009 , 2013, PloS one.

[32]  Michael W. Link,et al.  Monitoring county-level vaccination coverage during the 2004-2005 influenza season. , 2006, American journal of preventive medicine.

[33]  Ali H Mokdad,et al.  A novel framework for validating and applying standardized small area measurement strategies , 2010, Population health metrics.

[34]  Peter Congdon,et al.  International Journal of Health Geographics Open Access a Multilevel Model for Cardiovascular Disease Prevalence in the Us and Its Application to Micro Area Prevalence Estimates , 2022 .

[35]  A. Drewnowski,et al.  The geographic distribution of obesity by census tract among 59 767 insured adults in King County, WA , 2013, International Journal of Obesity.

[36]  Peter Congdon,et al.  Estimating Small Area Diabetes Prevalence in the US Using the Behavioral Risk Factor Surveillance System , 2010, Journal of Data Science.

[37]  A. Penman,et al.  Using Small-Area Estimation Method to Calculate County-Level Prevalence of Obesity in Mississippi, 2007-2009 , 2011, Preventing chronic disease.

[38]  S. Dube,et al.  Adult tobacco survey - 19 States, 2003-2007. , 2010, Morbidity and mortality weekly report. Surveillance summaries.

[39]  C. Murray,et al.  Trends in National and State-Level Obesity in the USA after Correction for Self-Report Bias: Analysis of Health Surveys , 2006 .