Cohort study designs are often used to assess the association between community-based ambient air pollution concentrations and health outcomes, such as mortality, development and prevalence of disease, and pulmonary function. Typically, a large number of subjects are enrolled in the study in each of a small number of communities. Fixed-site monitors are used to determine long-term exposure to ambient pollution. The association between community average pollution levels and health is determined after controlling for risk factors of the health outcome measured at the individual level (i.e., smoking). We present a new spatial regression model linking spatial variation in ambient air pollution to health. Health outcomes can be measured as continuous variables (pulmonary function), binary variables (prevalence of disease), or time-to-event data (survival or development of disease). The model incorporates risk factors measured at the individual level, such as smoking, and at the community level, such as air pollution. We demonstrate that the spatial autocorrelation in community health outcomes, an indication of not fully characterizing potentially confounding risk factors to the air pollution--health association, can be accounted for through the inclusion of location in the deterministic component of the model assessing the effects of air pollution on health or through a distance-decay spatial autocorrelation function in the stochastic component of the model, or both. We present a statistical approach that can be implemented for very large cohort studies. Our methods are illustrated with an analysis of the American Cancer Society cohort to determine whether the prevalence of heart disease is associated with concentrations of sulfate particles. From a statistical point of view, it appears that a location surface in the deterministic component of the model was preferred to a distance-decay autocorrelation structure in the model's stochastic component.
[1]
D. Krewski,et al.
Adjusting for temporal variation in the analysis of parallel time series of health and environmental variables.
,
1998,
Journal of exposure analysis and environmental epidemiology.
[2]
Stephen P. Kaluzny,et al.
S+SpatialStats: User’s Manual for Windows® and UNIX®
,
1998
.
[3]
D. Dockery,et al.
Particulate air pollution as a predictor of mortality in a prospective study of U.S. adults.
,
1995,
American journal of respiratory and critical care medicine.
[4]
A G Babiker,et al.
Floating absolute risk: an alternative to relative risk in survival and case-control analysis avoiding an arbitrary reference group.
,
1991,
Statistics in medicine.
[5]
J. Ware,et al.
Statistical issues in epidemiologic studies of the health effects of ambient acid aerosols
,
1988
.
[6]
W. Cleveland.
Robust Locally Weighted Regression and Smoothing Scatterplots
,
1979
.
[7]
G. Matheron.
Principles of geostatistics
,
1963
.
[8]
Daniel Krewski,et al.
Reanalysis of the Harvard Six Cities Study and the American Cancer Society Study of Particulate Air
,
2000
.
[9]
Daniel Krewski,et al.
Non-linear mixed regression models
,
1995
.
[10]
John R. Miron,et al.
Spatial Autocorrelation in Regression Analysis: A Beginner’s Guide
,
1984
.