Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system.

A variety of small-area statistical models have been developed for health surveys, but none are sufficiently flexible to generate small-area estimates (SAEs) to meet data needs at different geographic levels. We developed a multilevel logistic model with both state- and nested county-level random effects for chronic obstructive pulmonary disease (COPD) using 2011 data from the Behavioral Risk Factor Surveillance System. We applied poststratification with the (decennial) US Census 2010 counts of census-block population to generate census-block-level SAEs of COPD prevalence which could be conveniently aggregated to all other census geographic units, such as census tracts, counties, and congressional districts. The model-based SAEs and direct survey estimates of COPD prevalence were quite consistent at both the county and state levels. The Pearson correlation coefficient was 0.99 at the state level and ranged from 0.88 to 0.95 at the county level. Our extended multilevel regression modeling and poststratification approach could be adapted for other geocoded national health surveys to generate reliable SAEs for population health outcomes at all administrative and legislative geographic levels of interest in a scalable framework.

[1]  M. Goodman Comparison of Small-Area Analysis Techniques for Estimating Prevalence by Race , 2010, Preventing chronic disease.

[2]  M. Clark,et al.  Using Small-Area Estimation to Describe County-Level Disparities in Mammography , 2009, Preventing chronic disease.

[3]  Andrew Gelman,et al.  Bayesian Multilevel Estimation with Poststratification: State-Level Estimates from National Polls , 2004, Political Analysis.

[4]  A. Miniño Death in the United States, 2009. , 2011, NCHS data brief.

[5]  Michael W. Link,et al.  Monitoring county-level vaccination coverage during the 2004-2005 influenza season. , 2006, American journal of preventive medicine.

[6]  Ali H Mokdad,et al.  A novel framework for validating and applying standardized small area measurement strategies , 2010, Population health metrics.

[7]  S. Vernon,et al.  Human papillomavirus vaccine coverage among females aged 11 to 17 in Texas counties: an application of multilevel, small area estimation. , 2013, Women's health issues : official publication of the Jacobs Institute of Women's Health.

[8]  G. Datta Model-Based Approach to Small Area Estimation , 2009 .

[9]  Trivellore E Raghunathan,et al.  Estimation of the proportion of overweight individuals in small areas—a robust extension of the Fay–Herriot model , 2007, Statistics in medicine.

[10]  P. Muennig,et al.  Comparison of small-area analysis techniques for estimating county-level outcomes. , 2004, American journal of preventive medicine.

[11]  S. Lemon,et al.  Small-area estimation and prioritizing communities for obesity control in Massachusetts. , 2009, American journal of public health.

[12]  Jonathan Rodden,et al.  How Should We Measure District-Level Public Opinion on Individual Issues? , 2012, The Journal of Politics.

[13]  Peter Congdon,et al.  International Journal of Health Geographics Open Access a Multilevel Model for Cardiovascular Disease Prevalence in the Us and Its Application to Micro Area Prevalence Estimates , 2022 .

[14]  Jarvis T. Chen,et al.  Geocoding and monitoring of US socioeconomic inequalities in mortality and cancer incidence: does the choice of area-based measure and geographic level matter?: the Public Health Disparities Geocoding Project. , 2002, American journal of epidemiology.

[15]  Malay Ghosh,et al.  Benchmarking small area estimators , 2013 .

[16]  Domingo Morales,et al.  Small area estimation with spatio-temporal Fay-Herriot models , 2013, Comput. Stat. Data Anal..

[17]  K. Mengersen,et al.  Small area estimation of sparse disease counts using shared component models-application to birth defect registry data in New South Wales, Australia. , 2010, Health & place.

[18]  J. Kelsey,et al.  Small-area estimation and prioritizing communities for tobacco control efforts in Massachusetts. , 2009, American journal of public health.

[19]  L. Barker,et al.  Bayesian Small Area Estimates of Diabetes Prevalence by U.S. County, 2005 , 2010 .

[20]  Methodologic changes in the Behavioral Risk Factor Surveillance System in 2011 and potential effects on prevalence estimates. , 2012, MMWR. Morbidity and mortality weekly report.

[21]  Jeffrey R. Lax,et al.  How Should We Estimate Public Opinion in the States , 2009 .

[22]  Peter Congdon,et al.  Estimating Small Area Diabetes Prevalence in the US Using the Behavioral Risk Factor Surveillance System , 2010, Journal of Data Science.

[23]  A. Penman,et al.  Using Small-Area Estimation Method to Calculate County-Level Prevalence of Obesity in Mississippi, 2007-2009 , 2011, Preventing chronic disease.

[24]  Lynn E Eberly,et al.  ZIP-code-based versus tract-based income measures as long-term risk-adjusted mortality predictors. , 2006, American journal of epidemiology.

[25]  J. Rao Small Area Estimation , 2003 .

[26]  Zhenming Shun,et al.  Another Look at the Salamander Mating Data: A Modified Laplace Approximation Approach , 1997 .

[27]  J. Croft,et al.  Geographic disparities in chronic obstructive pulmonary disease (COPD) hospitalization among Medicare beneficiaries in the United States , 2011, International journal of chronic obstructive pulmonary disease.

[28]  Stephen S. Lim,et al.  Prevalence, Awareness, Treatment, and Control of Hypertension in United States Counties, 2001–2009 , 2013, PloS one.

[29]  Deaths from chronic obstructive pulmonary disease--United States, 2000-2005. , 2008, MMWR. Morbidity and mortality weekly report.