Using Supervised Principal Components Analysis to Assess Multiple Pollutant Effects

Background Many investigations of the adverse health effects of multiple air pollutants analyze the time series involved by simultaneously entering the multiple pollutants into a Poisson log-linear model. This method can yield unstable parameter estimates when the pollutants involved suffer high intercorrelation; therefore, traditional approaches to dealing with multicollinearity, such as principal component analysis (PCA), have been promoted in this context. Objectives A characteristic of PCA is that its construction does not consider the relationship between the covariates and the adverse health outcomes. A refined version of PCA, supervised principal components analysis (SPCA), is proposed that specifically addresses this issue. Methods Models controlling for long-term trends and weather effects were used in conjunction with each SPCA and PCA to estimate the association between multiple air pollutants and mortality for U.S. cities. The methods were compared further via a simulation study. Results Simulation studies demonstrated that SPCA, unlike PCA, was successful in identifying the correct subset of multiple pollutants associated with mortality. Because of this property, SPCA and PCA returned different estimates for the relationship between air pollution and mortality. Conclusions Although a number of methods for assessing the effects of multiple pollutants have been proposed, such methods can falter in the presence of high correlation among pollutants. Both PCA and SPCA address this issue. By allowing the exclusion of pollutants that are not associated with the adverse health outcomes from the mixture of pollutants selected, SPCA offers a critical improvement over PCA.

[1]  R. Burnett,et al.  Risk Models for Particulate Air Pollution , 2003, Journal of toxicology and environmental health. Part A.

[2]  L. Lave,et al.  Effect of the Fine Fraction of Particulate Matter versus the Coarse Mass and Other Pollutants on Daily Mortality in Santiago, Chile , 2000, Journal of the Air & Waste Management Association.

[3]  S H Moolgavkar,et al.  Air pollution and daily mortality in three U.S. counties. , 2000, Environmental health perspectives.

[4]  Steven Roberts,et al.  A critical assessment of shrinkage-based regression approaches for estimating the adverse health effects of multiple air pollutants , 2005 .

[5]  R. Burnett,et al.  The Effect of Concurvity in Generalized Additive Models Linking Mortality to Ambient Particulate Matter , 2003, Epidemiology.

[6]  Robert L. Mason,et al.  A Comparison of Least Squares and Latent Root Regression Estimators , 1976 .

[7]  D. Christiani,et al.  PM(10) exposure, gaseous pollutants, and daily mortality in Inchon, South Korea. , 1999, Environmental health perspectives.

[8]  Richard T Burnett,et al.  Associations between ambient air pollution and daily mortality among persons with congestive heart failure. , 2003, Environmental research.

[9]  F. Dominici,et al.  On the use of generalized additive models in time-series studies of air pollution and health. , 2002, American journal of epidemiology.

[10]  G. Pershagen,et al.  Effects of Ambient Air Pollution on Daily Mortality in a Cohort of Patients with Congestive Heart Failure , 2001, Epidemiology.

[11]  S L Zeger,et al.  Estimating particulate matter-mortality dose-response curves and threshold levels: an analysis of daily time-series for the 20 largest US cities. , 2000, American journal of epidemiology.

[12]  Steven Roberts,et al.  Investigating the mixture of air pollutants associated with adverse health outcomes , 2006 .

[13]  D. Krewski,et al.  ASSOCIATION BETWEEN PARTICULATE- AND GAS-PHASE COMPONENTS OF URBAN AIR POLLUTION AND DAILY MORTALITY IN EIGHT CANADIAN CITIES , 2000, Inhalation toxicology.

[14]  Karen Y. Fung,et al.  Association of Ambient Air Pollution with Respiratory Hospitalization in a Government-Designated “Area of Concern”: The Case of Windsor, Ontario , 2004, Environmental health perspectives.

[15]  Lawrence H. Cox Statistical issues in the study of air pollution involving airborne particulate matter , 2000 .

[16]  R. F. Ling,et al.  Some cautionary notes on the use of principal components regression , 1998 .

[17]  A Study of the Association between Daily Mortality and Ambient Air Pollutant Concentrations in Pittsburgh, Pennsylvania , 2000, Journal of the Air & Waste Management Association.

[18]  T. Wong,et al.  Associations between daily mortalities from respiratory and cardiovascular diseases and air pollution in Hong Kong, China , 2002, Occupational and environmental medicine.

[19]  Michael A. Martin,et al.  The question of nonlinearity in the dose-response relation between particulate matter air pollution and mortality: can Akaike's Information Criterion be trusted to take the right turn? , 2006, American journal of epidemiology.

[20]  F. Dominici,et al.  Seasonal analyses of air pollution and mortality in 100 US cities. , 2005, American journal of epidemiology.

[21]  Michael Brauer,et al.  Air pollution and daily mortality in a city with low levels of pollution. , 2003, Environmental health perspectives.

[22]  J. T. Webster,et al.  Latent Root Regression Analysis , 1974 .

[23]  R. Tibshirani,et al.  Prediction by Supervised Principal Components , 2006 .

[24]  R. Gunst Latent Root Regression , 2006 .

[25]  B. Ostro,et al.  Air pollution and daily mortality in the Coachella Valley, California: a study of PM10 dominated by coarse particles. , 1999, Environmental research.

[26]  Evelyne Vigneau,et al.  Latent root regression analysis: an alternative method to PLS , 2001 .

[27]  A. Höskuldsson PLS regression methods , 1988 .

[28]  D. Dockery,et al.  A case-crossover analysis of air pollution and mortality in Philadelphia. , 1999, Environmental health perspectives.

[29]  Jerome Sacks,et al.  Regression models for air pollution and daily mortality: analysis of data from Birmingham, Alabama , 2000 .

[30]  S L Zeger,et al.  Air pollution and mortality in Philadelphia, 1974-1988. , 1997, American journal of epidemiology.

[31]  S. Roberts A New Model for Investigating the Mortality Effects of Multiple Air Pollutants in Air Pollution Mortality Time-Series Studies , 2006, Journal of toxicology and environmental health. Part A.

[32]  David M Stieb,et al.  Meta-Analysis of Time-Series Studies of Air Pollution and Mortality: Effects of Gases and Particles and the Influence of Cause of Death, Age, and Season , 2002, Journal of the Air & Waste Management Association.

[33]  S. Moolgavkar Air Pollution and Daily Mortality in Two U.S. Counties: Season-Specific Analyses and Exposure-Response Relationships , 2003, Inhalation toxicology.