Objective Show the benefits of using a generalized linear mixed model (GLMM) to examine long-term trends in asthma syndrome data. Introduction Over the last decade, the application of syndromic surveillance systems has expanded beyond early event detection to include long-term disease trend monitoring. However, statistical methods employed for analyzing syndromic data tend to focus on early event detection. Generalized linear mixed models (GLMMs) may be a useful statistical framework for examining long-term disease trends because, unlike other models, GLMMs account for clustering common in syndromic data, and GLMMs can assess disease rates at multiple spatial and temporal levels (1). We show the benefits of the GLMM by using a GLMM to estimate asthma syndrome rates in New York City from 2007 to 2012, and to compare high and low asthma rates in Harlem and the Upper East Side (UES) of Manhattan. Methods Asthma related emergency department (ED) visits, and patient age and ZIP code were obtained from data reported daily to the NYC Department of Health and Mental Hygiene. Demographic data were obtained from 2010 US Census. ZIP codes that represented high and low asthma rates in Harlem and the UES of Manhattan were chosen for closer inspection. The ratio of weekly asthma syndrome visits to total ED visits was modeled with a Poisson GLMM with week and ZIP code random intercepts (2). Age and ethnicity were adjusted for because of their association with asthma rates (3). Results The GLMM showed citywide asthma rates remained stable from 2007 to 2012, but seasonal differences and significant inter-ZIP code variation were present. The Harlem ZIP code asthma rate that was estimated with the GLMM was significantly higher (5.83%, 95% CI: 3.65%, 9.49%) than the asthma rate in UES ZIP code (0.78%, 95% CI: 0.50%, 1.21%). A linear time component to the GLMM showed no appreciable change over time despite the seasonal fluctuations in asthma rate. GLMM based asthma rates are shown over time (Figure 1). Conclusions GLMMs have several strengths as statistical frameworks for monitoring trends including: Disease rates can be estimated at multiple spatial and temporal levels, Standard error adjustment for clustering in syndromic data allows for accurate, statistical assessment of changes over time and differences between subgroups, “Strength borrowed” (4) from the aggregated data informs small subgroups and smooths trends, Integration of covariate data reduces bias in estimated rates. GLMMs have previously been suggested for early event detection with syndromic surveillance data (5), but the versatility of GLMM makes them useful for monitoring long-term disease trends as well. In comparison to GLMMs, standard errors from single level GLMs do not account for clustering and can lead to inaccurate statistical hypothesis testing. Bayesian hierarchical models (6), share many of the strengths of GLMMS, but are more complicated to fit. In the future, GLMMs could provide a framework for grouping similar ZIP codes based on their model estimates (e.g. seasonal trends and influence on overall trend), and analyzing long-term disease trends with syndromic data.
[1]
Joseph Hilbe,et al.
Data Analysis Using Regression and Multilevel/Hierarchical Models
,
2009
.
[2]
L. Claudio,et al.
Socioeconomic factors and asthma hospitalization rates in New York City.
,
1999,
The Journal of asthma : official journal of the Association for the Care of Asthma.
[3]
R. Platt,et al.
A generalized linear mixed models approach for detecting incident clusters of disease in small areas, with an application to biological terrorism.
,
2004,
American journal of epidemiology.
[4]
P. Diggle.
Analysis of Longitudinal Data
,
1995
.
[5]
Po-Huang Chiang,et al.
Probabilistic Daily ILI Syndromic Surveillance with a Spatio-Temporal Bayesian Hierarchical Model
,
2010,
PloS one.
[6]
C. Pattie,et al.
People, Places and Regions: Exploring the Use of Multi-Level Modelling in the Analysis of Electoral Data
,
1992,
British Journal of Political Science.