Comparison of the results obtained by four receptor modelling methods in aerosol source apportionment studies

In this work the performance and theoretical background behind two of the most commonly used receptor modelling methods in aerosol science, principal components analysis (PCA) and positive matrix factorization (PMF), as well as multivariate curve resolution by alternating least squares (MCR-ALS) and weighted alternating least squares (MCR-WALS), are examined. The performance of the four methods was initially evaluated under standard operational conditions, and modifications regarding data pre-treatment were then included. The methods were applied using raw and scaled data, with and without uncertainty estimations. Strong similarities were found among the sources identified by PMF and MCR-WALS (weighted models), whereas discrepancies were obtained with MCR-ALS (unweighted model). Weighting of input data by means of uncertainty estimates was found to be essential to obtain robust and accurate factor identification. The use of scaled (as opposed to raw) data highlighted the contribution of trace elements to the compositional profiles, which was key to the correct interpretation of the nature of the sources. Our results validate the performance of MCR-WALS for aerosol pollution studies.

[1]  John D. Spengler,et al.  A QUANTITATIVE ASSESSMENT OF SOURCE CONTRIBUTIONS TO INHALABLE PARTICULATE MATTER POLLUTION IN METROPOLITAN BOSTON , 1985 .

[2]  Yuqiu Wang,et al.  Source apportionment of PM10 in six cities of northern China , 2007 .

[3]  Allen L. Robinson,et al.  Sources of organic aerosol: Positive matrix factorization of molecular marker data and comparison of results from different source apportionment models , 2007 .

[4]  Xavier Querol,et al.  Identification and characterisation of sources of PM10 in Madrid (Spain) by statistical methods , 2004 .

[5]  Philip K. Hopke,et al.  Source identification of airborne PM2.5 at the St. Louis‐Midwest Supersite , 2006 .

[6]  Sabine Van Huffel,et al.  Total least squares problem - computational aspects and analysis , 1991, Frontiers in applied mathematics.

[7]  B. Kowalski,et al.  Selectivity, local rank, three‐way data analysis and ambiguity in multivariate curve resolution , 1995 .

[8]  Xavier Querol,et al.  PM10 and PM2.5 source apportionment in the Barcelona Metropolitan area, Catalonia, Spain , 2001 .

[9]  Philip K. Hopke,et al.  Source apportionment of Baltimore aerosol from combined size distribution and chemical composition data , 2006 .

[10]  Jürgen W. Einax,et al.  Chemometrics in Environmental Analysis , 1997 .

[11]  P. Hopke,et al.  Source apportionment of particulate matter in Europe: A review of methods and results , 2008 .

[12]  Darren T. Andrews,et al.  Maximum likelihood principal component analysis , 1997 .

[13]  Romà Tauler,et al.  Investigation of geographical and temporal distribution of tropospheric ozone in Catalonia (North-East Spain) during the period 2000–2004 using multivariate data analysis methods , 2006 .

[14]  Romà Tauler,et al.  Resolution of a structural competition involving dimeric G-quadruplex and its C-rich complementary strand , 2006, Nucleic acids research.

[15]  R. Tauler Multivariate curve resolution applied to second order data , 1995 .

[16]  I. Jolliffe Principal Component Analysis , 2002 .

[17]  M. Viana,et al.  Identification of PM sources by principal component analysis (PCA) coupled with wind direction data. , 2006, Chemosphere.

[18]  Peter A. Scheff,et al.  Fine particulate source apportionment using data from the USEPA speciation trends network in Chicago, Illinois: Comparison of two source apportionment models , 2007 .

[19]  P. Faure,et al.  Evolution of the source apportionment of the lipidic fraction from sediments along the Fensch River, France: a multimolecular approach. , 2008, The Science of the total environment.

[20]  J. Schauer,et al.  Source contributions to carbonaceous aerosols in the Tennessee Valley Region , 2007 .

[21]  Yu Song,et al.  Source apportionment of PM2.5 in Beijing using principal component analysis/absolute principal component scores and UNMIX. , 2006, The Science of the total environment.

[22]  S. D. Jong,et al.  Handbook of Chemometrics and Qualimetrics , 1998 .

[23]  R. Henry Multivariate receptor modeling by N-dimensional edge detection , 2003 .

[24]  Romà Tauler,et al.  Chemometrics applied to unravel multicomponent processes and mixtures: Revisiting latest trends in multivariate resolution , 2003 .

[25]  Romà Tauler,et al.  Alternative calibration approaches for LC-MS quantitative determination of coeluted compounds in complex environmental mixtures using multivariate curve resolution. , 2007, Analytica chimica acta.

[26]  Gene H. Golub,et al.  Matrix computations , 1983 .

[27]  Margaret Werner-Washburne,et al.  BMC Bioinformatics BioMed Central Methodology article Multivariate curve resolution of time course microarray data , 2006 .

[28]  R. Henry,et al.  Extension of self-modeling curve resolution to mixtures of more than three components: Part 1. Finding the basic feasible region , 1990 .

[29]  S. Van Huffel,et al.  On the equivalence between total least squares and maximum likelihood PCA , 2005 .

[30]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[31]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[32]  Philip K. Hopke,et al.  Comparison between sample-species specific uncertainties and estimated uncertainties for the source apportionment of the speciation trends network data , 2007 .

[33]  Romà Tauler,et al.  Chemometric modeling of main contamination sources in surface waters of Portugal , 2004, Environmental toxicology and chemistry.

[34]  A. Srivastava,et al.  Size distribution and source identification of total suspended particulate matter and associated heavy metals in the urban atmosphere of Delhi. , 2007, Chemosphere.

[35]  Ashutosh Kumar Singh,et al.  Treatment of nondetects in multivariate analysis of groundwater geochemistry data , 2002 .

[36]  E. I. Hamilton Receptor modelling in environmental chemistry , 1986 .