Multivariate data analysis of pollutant profiles: PCB levels across Europe.

It is not always recognised that standard multivariate analyses applied to pollution profile data (i.e. where data are relative amounts of pollutants expressed as proportions of their total) give rise to problems in the analysis and interpretation of results: a simple solution is to carry out analyses on log-ratios of proportions. However, while solving many problems, this approach is very sensitive to the issue of values below detection limits. These approaches have been applied to a dataset of the levels of 29 PCB congeners in ambient air samples across Europe during the summer of 2002. Multivariate descriptive methods (principal component analysis and cluster analysis) and inferential techniques (multivariate ANOVA, multiple linear and logistic regression) and graphical tools (2D and 3D plots, principal components plots, biplots and triangular diagrams) were used to analyse the proportions of five PCB homologues (tri-hepta). These established that there was considerable difference in the pollution profiles of the 71 samples: the greatest variation was between samples with differing ratios of tri-hexa and tri-hepta PCB homologues, and the samples showed little sign of consistent clusters. There was a significant difference between typical profiles from rural and urban areas such that urban samples (and those with high total PCBs) had higher proportions of tetra- and tri-PCBs compared to hexa- and hepta-PCBs.

[1]  James E. Dunn,et al.  Effects of combustion parameters on polychlorinated dibenzodioxin and dibenzofuran homologue profiles from municipal waste and coal co-combustion , 1998 .

[2]  H. Rollinson Using Geochemical Data: Evaluation, Presentation, Interpretation , 1993 .

[3]  Enrico Savazzi,et al.  Aspects of Multivariate Statistical Analysis in Geology , 1999 .

[4]  J. Cerhan,et al.  Epidemiologic Evaluation of Measurement Data in the Presence of Detection Limits , 2004, Environmental Health Perspectives.

[5]  Brian Everitt,et al.  Principles of Multivariate Analysis , 2001 .

[6]  V. Pawlowsky-Glahn,et al.  Dealing with Zeros and Missing Values in Compositional Data Sets Using Nonparametric Imputation , 2003 .

[7]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[8]  Jorge Cadima Departamento de Matematica Loading and correlations in the interpretation of principle compenents , 1995 .

[9]  J. Aitchison,et al.  Biplots of Compositional Data , 2002 .

[10]  R. Renner,et al.  Endmember analysis of heavy-metal pollution in surficial sediments from the Gulf of Gdansk and the southern Baltic Sea off Poland , 1998 .

[11]  J. Aitchison Reducing the dimensionality of compositional data sets , 1984 .

[12]  D. Howel,et al.  Does long term residency near industry have an impact on the body burden of polychlorinated dibenzo-p-dioxins, furans, and polychlorinated biphenyls in older women? , 2005, Occupational and Environmental Medicine.

[13]  J. Calambokidis,et al.  Harbor seals (Phoca vitulina) in British Columbia, Canada, and Washington State, USA, reveal a combination of local and global polychlorinated biphenyl, dioxin, and furan signals , 2004, Environmental toxicology and chemistry.

[14]  John Aitchison,et al.  Relative variation diagrams for describing patterns of compositional variability , 1990 .

[15]  Tx Station Stata Statistical Software: Release 7. , 2001 .

[16]  Nicholas J. Cox,et al.  Speaking Stata: Graphing Categorical and Compositional Data , 2004 .

[17]  C. M. Jackson,et al.  Compositional Data Analysis of Some Alkaline Glasses , 2005 .

[18]  John Aitchison,et al.  Measures of location of compositional data sets , 1989 .

[19]  R. Olea,et al.  Geostatistical Analysis of Compositional Data , 2004 .

[20]  B. Bonn Polychlorinated dibenzo-p-dioxin and dibenzofuran concentration profiles in sediment and fish tissue of the Willamette Basin, Oregon , 1998 .

[21]  Desire L. Massart,et al.  Regularised discriminant analysis (RDA) - modelling for the binary discrimination between pollution types , 1997 .

[22]  Dennis R. Helsel,et al.  Less than obvious - statistical treatment of data below the detection limit , 1990 .

[23]  J. Aitchison On criteria for measures of compositional difference , 1992 .

[24]  K. Jones,et al.  Passive air sampling of PCBs, PBDEs, and organochlorine pesticides across Europe. , 2004, Environmental science & technology.

[25]  K. Pearson Mathematical contributions to the theory of evolution.—On a form of spurious correlation which may arise when indices are used in the measurement of organs , 1897, Proceedings of the Royal Society of London.