A look at aerosol formation using data mining techniques

Abstract. Atmospheric aerosol particle formation is frequently observed throughout the atmosphere, but despite various attempts of explanation, the processes behind it remain unclear. In this study data mining techniques were used to find the key parameters needed for atmospheric aerosol particle formation to occur. A dataset of 8 years of 80 variables collected at the boreal forest station (SMEAR II) in Southern Finland was used, incorporating variables such as radiation, humidity, SO2, ozone and present aerosol surface area. This data was analyzed using clustering and classification methods. The aim of this approach was to gain new parameters independent of any subjective interpretation. This resulted in two key parameters, relative humidity and preexisting aerosol particle surface (condensation sink), capable in explaining 88% of the nucleation events. The inclusion of any further parameters did not improve the results notably. Using these two variables it was possible to derive a nucleation probability function. Interestingly, the two most important variables are related to mechanisms that prevent the nucleation from starting and particles from growing, while parameters related to initiation of particle formation seemed to be less important. Nucleation occurs only with low relative humidity and condensation sink values. One possible explanation for the effect of high water content is that it prevents biogenic hydrocarbon ozonolysis reactions from producing sufficient amounts of low volatility compounds, which might be able to nucleate. Unfortunately the most important biogenic hydrocarbon compound emissions were not available for this study. Another effect of water vapour may be due to its linkage to cloudiness which may prevent the formation of nucleating and/or condensing vapours. A high number of preexisting particles will act as a sink for condensable vapours that otherwise would have been able to form sufficient supersaturation and initiate the nucleation process.

[1]  P. Hari,et al.  Atmospheric trace gas and aerosol particle concentration measurements in Eastern Lapland, Finland 1992-2001 , 2003 .

[2]  Miikka Dal Maso,et al.  Long-term measurements of surface fluxes above a Scots pine forest in Hyytiälä, southern Finland, 1996-2001 , 2003 .

[3]  P. Hari,et al.  Long-term field measurements of atmosphere-surface interactions in boreal forest combining forest ecology, micrometeorology, aerosol physics and atmospheric chemistry , 1998 .

[4]  Pasi Aalto,et al.  Aerosol formation: Atmospheric particles from organic vapours , 2002, Nature.

[5]  L. Pirjola,et al.  Modelling the formation of H2SO4–H2O particles in rural, urban and marine conditions , 1998 .

[6]  G. Shields,et al.  Thermodynamics of forming water clusters at various temperatures and pressures by Gaussian-2, Gaussian-3, complete basis set-QB3, and complete basis set-APNO model chemistries; implications for atmospheric chemistry. , 2004, Journal of the American Chemical Society.

[7]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[8]  R. Hillamo,et al.  Number size distributions and concentrations of marine aerosols: Observations during a cruise between the English Channel and the coast of Antarctica , 2002 .

[9]  Ari Laaksonen,et al.  Organic aerosol formation via sulphate cluster activation , 2004 .

[10]  G. Moortgat,et al.  Sesquiterpene ozonolysis: Origin of atmospheric new particle formation from biogenic hydrocarbons , 2003 .

[11]  K. Lehtinen,et al.  Measurements in a highly polluted Asian mega city: observations of aerosol number size distribution, modal parameters and nucleation events , 2004 .

[12]  F. Arnold,et al.  Cosmic ray‐induced aerosol‐formation: First observational evidence from aircraft‐based ion mass spectrometer measurements in the upper troposphere , 2002 .

[13]  Measurements of the concentration of carbon di oxide at mauna loa observatory hawaii usa , 1982 .

[14]  Ü. Rannik,et al.  Estimates of the annual net carbon and water exchange of forests: the EUROFLUX methodology , 2000 .

[15]  K. Lehtinen,et al.  Ion production rate in a boreal forest based on ion, particle and radiation measurements , 2004 .

[16]  Ü. Rannik,et al.  Gap filling strategies for defensible annual sums of net ecosystem exchange , 2001 .

[17]  N. Fuchs,et al.  HIGH-DISPERSED AEROSOLS , 1971 .

[18]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[19]  G. Moortgat,et al.  New particle formation during a - and b -pinene oxidation by O 3 , OH and NO 3 , and the influence of water vapour: particle size distribution studies , 2002 .

[20]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[21]  On the surface layer similarity at a complex forest site , 1998 .

[22]  Pasi Aalto,et al.  Atmospheric Chemistry and Physics on the Growth of Nucleation Mode Particles: Source Rates of Condensable Vapor in Polluted and Clean Environments , 2022 .

[23]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[25]  J. Seinfeld,et al.  Marine aerosol formation from biogenic iodine emissions , 2002, Nature.

[26]  Hanna Vehkamäki,et al.  Formation and growth rates of ultrafine atmospheric particles: a review of observations , 2004 .

[27]  Miikka Dal Maso,et al.  Formation and growth of fresh atmospheric aerosols: eight years of aerosol size distribution data from SMEAR II, Hyytiälä, Finland , 2005 .

[28]  A. Wiedensohler,et al.  New particle formation in the continental boundary layer: Meteorological and gas phase parameter influence , 2000 .

[29]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[30]  M. Kulmala,et al.  Nucleation events in the continental boundary layer: Influence of physical and meteorological parameters , 2001 .

[31]  M. Boy,et al.  Effects of air masses and synoptic weather on aerosol formation in the continental boundary layer , 2001 .

[32]  C. O'Dowd,et al.  Physical characterization of aerosol particles during nucleation events , 2001 .

[33]  A. G. Sutugin,et al.  Highly Dispersed Aerosols (Vysokodispersne Aerozoli) , 1971 .

[34]  Boris Bonn,et al.  Influence of water vapor on the process of new particle formation during monoterpene ozonolysis , 2002 .

[35]  H. Lihavainen,et al.  Observations of ultrafine aerosol particle formation and growth in boreal forest , 1997 .

[36]  D. Allen,et al.  Special issue of Atmospheric Environment on findings from EPA's Particulate Matter Supersites Program☆ , 2004 .

[37]  J. Seinfeld,et al.  Ternary nucleation of H2SO4, NH3, and H2O in the atmosphere , 1999 .

[38]  L. Pirjola,et al.  Stable sulphate clusters as a source of new atmospheric particles , 2000, Nature.

[39]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[40]  S. Twomey Pollution and the Planetary Albedo , 1974 .

[41]  P. Mcmurry,et al.  Measurement of Expected Nucleation Precursor Species and 3–500-nm Diameter Particles at Mauna Loa Observatory, Hawaii , 1995 .

[42]  Cleve B. Moler,et al.  Numerical computing with MATLAB , 2004 .