Using 3 Health Surveys to Compare Multilevel Models for Small Area Estimation for Chronic Diseases and Health Behaviors

Background We used a multilevel regression and poststratification approach to generate estimates of health-related outcomes using Behavioral Risk Factor Surveillance System 2013 (BRFSS) data for the 500 US cities. We conducted an empirical study to investigate whether the approach is robust using different health surveys. Methods We constructed a multilevel logistic model with individual-level age, sex, and race/ethnicity as predictors (Model I), and sequentially added educational attainment (Model II) and area-level poverty (Model III) for 5 health-related outcomes using the nationwide BRFSS, the Massachusetts BRFSS 2013 (a state subset of nationwide BRFSS), and the Boston BRFSS 2010/2013 (an independent survey), respectively. We applied each model to the Boston population (2010 Census) to predict each outcome in Boston and compared each with corresponding Boston BRFSS direct estimates. Results Using Model I for the nationwide BRFSS, estimates of diabetes, high blood pressure, physical inactivity, and binge drinking fell within the 95% confidence interval of corresponding Boston BRFSS direct estimates. Adding educational attainment and county-level poverty (Models II and III) further improved their accuracy, particularly for current smoking (the model-based estimate was 15.2% by Model I and 18.1% by Model II). The estimates based on state BRFSS and Boston BRFSS models were similar to those based on the nationwide BRFSS, but area-level poverty did not improve the estimates significantly. Conclusion The estimates of health-related outcomes were similar using different health surveys. Model specification could vary by surveys with different geographic coverage.

[1]  Snehal N Shah,et al.  Comparison of Methods for Estimating Prevalence of Chronic Diseases and Health Behaviors for Small Geographic Areas: Boston Validation Study, 2013 , 2017, Preventing chronic disease.

[2]  K. Greenlund,et al.  Validation of multilevel regression and poststratification methodology for small area estimation of health indicators from the behavioral risk factor surveillance system. , 2015, American journal of epidemiology.

[3]  P. Lahiri,et al.  Variable Selection for Linear Mixed Models with Applications in Small Area Estimation , 2015, Sankhya B.

[4]  E. Ford,et al.  Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system. , 2014, American journal of epidemiology.

[5]  B. Highton,et al.  How Does Multilevel Regression and Poststratification Perform with Conventional National Surveys? , 2013, Political Analysis.

[6]  Danny Pfeffermann,et al.  New important developments in small area estimation , 2013, 1302.4907.

[7]  Jonathan Rodden,et al.  How Should We Measure District-Level Public Opinion on Individual Issues? , 2012, The Journal of Politics.

[8]  Jiming Jiang,et al.  Mixed model prediction and small area estimation , 2006 .

[9]  Andrew Gelman,et al.  Bayesian Multilevel Estimation with Poststratification: State-Level Estimates from National Polls , 2004, Political Analysis.

[10]  S. Cummins,et al.  Place effects on health: how can we conceptualise, operationalise and measure them? , 2002, Social science & medicine.

[11]  D. Pfeffermann Small Area Estimation‐New Developments and Directions , 2002 .

[12]  A. Roux Investigating Neighborhood and Area Effects on Health , 2001 .

[13]  Andrew B Lawson,et al.  Bayesian spatially dependent variable selection for small area health modeling , 2018, Statistical methods in medical research.

[14]  Jeffrey R. Lax,et al.  How Should We Estimate Public Opinion in the States , 2009 .