Developing small-area predictions for smoking and obesity prevalence in the United States for use in Environmental Public Health Tracking.

BACKGROUND Globally and in the United States, smoking and obesity are leading causes of death and disability. Reliable estimates of prevalence for these risk factors are often missing variables in public health surveillance programs. This may limit the capacity of public health surveillance to target interventions or to assess associations between other environmental risk factors (e.g., air pollution) and health because smoking and obesity are often important confounders. OBJECTIVES To generate prevalence estimates of smoking and obesity rates over small areas for the United States (i.e., at the ZIP code and census tract levels). METHODS We predicted smoking and obesity prevalence using a combined approach first using a lasso-based variable selection procedure followed by a two-level random effects regression with a Poisson link clustered on state and county. We used data from the Behavioral Risk Factor Surveillance System (BRFSS) from 1991 to 2010 to estimate the model. We used 10-fold cross-validated mean squared errors and the variance of the residuals to test our model. To downscale the estimates we combined the prediction equations with 1990 and 2000 U.S. Census data for each of the four five-year time periods in this time range at the ZIP code and census tract levels. Several sensitivity analyses were conducted using models that included only basic terms, that accounted for spatial autocorrelation, and used Generalized Linear Models that did not include random effects. RESULTS The two-level random effects model produced improved estimates compared to the fixed effects-only models. Estimates were particularly improved for the two-thirds of the conterminous U.S. where BRFSS data were available to estimate the county level random effects. We downscaled the smoking and obesity rate predictions to derive ZIP code and census tract estimates. CONCLUSIONS To our knowledge these smoking and obesity predictions are the first to be developed for the entire conterminous U.S. for census tracts and ZIP codes. Our estimates could have significant utility for public health surveillance.

[1]  Renjun Ma An orthodox blup approach to generalized linear mixed models , 1999 .

[2]  M. Kivimäki,et al.  Average household income, crime, and smoking behaviour in a local area: the Finnish 10-Town study. , 2007, Social science & medicine.

[3]  R N Pierson,et al.  How useful is body mass index for comparison of body fatness across age, sex, and ethnic groups? , 1996, American journal of epidemiology.

[4]  Shava Cureton Environmental victims: environmental injustice issues that threaten the health of children living in poverty , 2011, Reviews on environmental health.

[5]  W. Brown,et al.  Neighborhood disadvantage and physical activity: baseline results from the HABITAT multilevel longitudinal study. , 2010, Annals of epidemiology.

[6]  E. Brunner,et al.  Deprivation and the development of obesity a multilevel, longitudinal study in England. , 2010, American journal of preventive medicine.

[7]  B. Poland,et al.  The social context of smoking: the next frontier in tobacco control? , 2006, Tobacco Control.

[8]  R. Burnett,et al.  Modelling risk factor information for linked census data: The case of smoking. , 2013, Health reports.

[9]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[10]  L. Geiss,et al.  Estimated county-level prevalence of diabetes and obesity - United States, 2007. , 2009, MMWR. Morbidity and mortality weekly report.

[11]  J. Jürimäe,et al.  Relationships between plasma leptin levels and body composition parameters measured by different methods in postmenopausal women , 2003, American journal of human biology : the official journal of the Human Biology Council.

[12]  Michael Jerrett,et al.  Spatial Modeling in Environmental and Public Health Research , 2010, International journal of environmental research and public health.

[13]  M. Boyle,et al.  Smoking in context: a multilevel analysis of 49,088 communities in Canada. , 2012, American journal of preventive medicine.

[14]  Nathaniel Schenker,et al.  Combining Information From Two Surveys to Estimate County-Level Prevalence Rates of Cancer Risk Factors and Screening , 2007 .

[15]  L. Barker,et al.  Bayesian Small Area Estimates of Diabetes Prevalence by U.S. County, 2005 , 2010 .

[16]  B. Jørgensen,et al.  Nested generalized linear mixed models: an orthodox best linear unbiased predictor approach , 2007 .

[17]  S. Galea,et al.  Neighborhood income and income distribution and the use of cigarettes, alcohol, and marijuana. , 2007, American journal of preventive medicine.

[18]  Michael Jerrett,et al.  Geographies of Risk in Studies Linking Chronic Air Pollution Exposure to Health Outcomes , 2005, Journal of toxicology and environmental health. Part A.

[19]  G. Moon,et al.  Sociospatial inequalities in health-related behaviours , 2012 .

[20]  Laura Kettel Khan,et al.  Recommended community strategies and measurements to prevent obesity in the United States. , 2009, MMWR. Recommendations and reports : Morbidity and mortality weekly report. Recommendations and reports.

[21]  Alan D. Lopez,et al.  A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[22]  V. Preedy,et al.  National Health and Nutrition Examination Survey , 2010 .

[23]  R C Brownson,et al.  A comparison of national estimates of obesity prevalence from the behavioral risk factor surveillance system and the national health and nutrition examination survey , 2006, International Journal of Obesity.

[24]  J. Schwartz,et al.  Health, wealth, and air pollution: advancing theory and methods. , 2003, Environmental health perspectives.

[25]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[26]  Daniel Krewski,et al.  Random effects Cox models: A Poisson modelling approach , 2003 .

[27]  James Macinko,et al.  Neighborhoods and obesity. , 2008, Nutrition reviews.

[28]  B Chaix,et al.  The influence of geographic life environments on cardiometabolic risk factors: a systematic review, a methodological assessment and a research agenda , 2011, Obesity reviews : an official journal of the International Association for the Study of Obesity.

[29]  Mika Kivimäki,et al.  Quantifying Neighbourhood Socioeconomic Effects in Clustering of Behaviour-Related Risk Factors: A Multilevel Analysis , 2012, PloS one.

[30]  K. Greenlund,et al.  Factors Explaining Excess Stroke Prevalence in the US Stroke Belt , 2009, Stroke.

[31]  R. Jain,et al.  Regression models to predict corrected weight, height and obesity prevalence from self-reported data: data from BRFSS 1999–2007 , 2010, International Journal of Obesity.