Identification of new particle formation events with deep learning

Abstract. New particle formation (NPF) in the atmosphere is globally an important source of climate relevant aerosol particles. Occurrence of NPF events is typically analyzed by researchers manually from particle size distribution data day by day, which is time consuming and the classification of event types may be inconsistent. To get more reliable and consistent results, the NPF event analysis should be automatized. We have developed an automatic analysis method based on deep learning, a subarea of machine learning, for NPF event identification. To our knowledge, this is the first time that a deep learning method, i.e., transfer learning of a convolutional neural network (CNN), has successfully been used to automatically classify NPF events into different classes directly from particle size distribution images, similarly to how the researchers carry out the manual classification. The developed method is based on image analysis of particle size distributions using a pretrained deep CNN, named AlexNet, which was transfer learned to recognize NPF event classes (six different types). In transfer learning, a partial set of particle size distribution images was used in the training stage of the CNN and the rest of the images for testing the success of the training. The method was utilized for a 15-year-long dataset measured at San Pietro Capofiume (SPC) in Italy. We studied the performance of the training with different training and testing of image number ratios as well as with different regions of interest in the images. The results show that clear event (i.e., classes 1 and 2) and nonevent days can be identified with an accuracy of ca. 80 %, when the CNN classification is compared with that of an expert, which is a good first result for automatic NPF event analysis. In the event classification, the choice between different event classes is not an easy task even for trained researchers, and thus overlapping or confusion between different classes occurs. Hence, we cross-validated the learning results of CNN with the expert-made classification. The results show that the overlapping occurs, typically between the adjacent or similar type of classes, e.g., a manually classified Class 1 is categorized mainly into classes 1 and 2 by CNN, indicating that the manual and CNN classifications are very consistent for most of the days. The classification would be more consistent, by both human and CNN, if only two different classes are used for event days instead of three classes. Thus, we recommend that in the future analysis, event days should be categorized into classes of “quantifiable” (i.e., clear events, classes 1 and 2) and “nonquantifiable” (i.e., weak events, Class  3). This would better describe the difference of those classes: both formation and growth rates can be determined for quantifiable days but not both for nonquantifiable days. Furthermore, we investigated more deeply the days that are classified as clear events by experts and recognized as nonevents by the CNN and vice versa. Clear misclassifications seem to occur more commonly in manual analysis than in the CNN categorization, which is mostly due to the inconsistency in the human-made classification or errors in the booking of the event class. In general, the automatic CNN classifier has a better reliability and repeatability in NPF event classification than human-made classification and, thus, the transfer-learned pretrained CNNs are powerful tools to analyze long-term datasets. The developed NPF event classifier can be easily utilized to analyze any long-term datasets more accurately and consistently, which helps us to understand in detail aerosol–climate interactions and the long-term effects of climate change on NPF in the atmosphere. We encourage researchers to use the model in other sites. However, we suggest that the CNN should be transfer learned again for new site data with a minimum of ca. 150 figures per class to obtain good enough classification results, especially if the size distribution evolution differs from training data. In the future, we will utilize the method for data from other sites, develop it to analyze more parameters and evaluate how successfully CNN could be trained with synthetic NPF event data.

[1]  C. O'Dowd,et al.  Global analysis of continental boundary layer new particle formation based on long-term measurements , 2018, Atmospheric Chemistry and Physics.

[2]  Leo Kärkkäinen,et al.  Deep convolutional neural networks for estimating porous material parameters with ultrasound tomography. , 2017, The Journal of the Acoustical Society of America.

[3]  Demis Hassabis,et al.  Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.

[4]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[5]  Liangpei Zhang,et al.  Pre-Trained AlexNet Architecture with Pyramid Pooling and Supervision for High Spatial Resolution Remote Sensing Image Scene Classification , 2017, Remote. Sens..

[6]  Antonio Di Noia,et al.  Combined neural network/Phillips-Tikhonov approach to aerosol retrievals over land from the NASA Research Scanning Polarimeter , 2017 .

[7]  Nikhil Buduma,et al.  Fundamentals of Deep Learning: Designing Next-Generation Machine Intelligence Algorithms , 2017 .

[8]  B. Wehner,et al.  Contributions of volatile and nonvolatile compounds (at 300°C) to condensational growth of atmospheric nanoparticles: An assessment based on 8.5 years of observations at the Central Europe background site Melpitz , 2017 .

[9]  L. Morawska,et al.  New particle formation in China: Current knowledge and further directions. , 2017, The Science of the total environment.

[10]  Xiang Li,et al.  Deep learning architecture for air quality predictions , 2016, Environmental Science and Pollution Research.

[11]  Warren Stanley,et al.  Evaluation of Machine Learning Algorithms for Classification of Primary Biological Aerosol using a new UV-LIF spectrometer , 2016 .

[12]  K. Lehtinen,et al.  Retrieval of aerosol optical depth from surface solar radiation measurementsusing machine learning algorithms, non-linear regression and a radiativetransfer-based look-up table , 2016 .

[13]  A. Ding,et al.  On secondary new particle formation in China , 2016, Frontiers of Environmental Science & Engineering.

[14]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[15]  Taghi M. Khoshgoftaar,et al.  A survey of transfer learning , 2016, Journal of Big Data.

[16]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[17]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[19]  Amir Hossein Alavi,et al.  Machine learning in geosciences and remote sensing , 2016 .

[20]  A. Ding,et al.  Aerosol size distribution and new particle formation in the western Yangtze River Delta of China: 2 years of measurements at the SORPES station , 2015 .

[21]  T. Petäjä,et al.  Technical note: New particle formation event forecasts during PEGASOS–Zeppelin Northern mission 2013 in Hyytiälä, Finland , 2015 .

[22]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[23]  M. Steinbacher,et al.  Analysis of long‐term aerosol size distribution data from Jungfraujoch with emphasis on free tropospheric conditions, cloud influence, and air mass transport , 2015 .

[24]  Koji Zettsu,et al.  Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM2.5 , 2015, Neural Computing and Applications.

[25]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[26]  Wei Gong,et al.  Transfer learning used to analyze the dynamic evolution of the dust aerosol , 2015 .

[27]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[29]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[30]  T. Petäjä,et al.  Trends in atmospheric new-particle formation: 16 years of observations in a boreal-forest environment , 2014 .

[31]  Christoph U. Keller,et al.  Use of neural networks in ground-based aerosol retrievals from multi-angle spectropolarimetric observations , 2014 .

[32]  Sang-Woo Kim,et al.  Observation of new particle formation and growth under cloudy conditions at Gosan Climate Observatory, Korea , 2014, Meteorology and Atmospheric Physics.

[33]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[34]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[35]  The effect of cloudiness on new particle formation: investigation of radiation levels , 2014 .

[36]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[37]  P. Hari,et al.  Trends in new particle formation in eastern Lapland, Finland: effect of decreasing sulfur emissions from Kola Peninsula , 2013 .

[38]  George Christakos,et al.  Estimation of Citywide Air Pollution in Beijing , 2013, PloS one.

[39]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.

[40]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[41]  I. Riipinen,et al.  Cloud condensation nuclei production associated with atmospheric nucleation: a synthesis based on existing literature and new results , 2012 .

[42]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[43]  Sang Woo Kim,et al.  Observation of new particle formation and growth events in Asian continental outflow , 2012 .

[44]  Miikka Dal Maso,et al.  Measurement of the nucleation of atmospheric aerosol particles , 2012, Nature Protocols.

[45]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[46]  Jürgen Schmidhuber,et al.  Transfer learning for Latin and Chinese characters with Deep Neural Networks , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[47]  Yoshua Bengio,et al.  Unsupervised and Transfer Learning Challenge: a Deep Learning Approach , 2011, ICML Unsupervised and Transfer Learning.

[48]  E. Asmi,et al.  Secondary new particle formation in Northern Finland Pallas site between the years 2000 and 2010 , 2011 .

[49]  Luca Maria Gambardella,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Flexible, High Performance Convolutional Neural Networks for Image Classification , 2022 .

[50]  A. Wiedensohler,et al.  Analysis of number size distributions of tropical free tropospheric aerosol particles observed at Pico Espejo (4765 m a.s.l.), Venezuela , 2011 .

[51]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[52]  C. O'Dowd,et al.  On the occurrence of open ocean particle production and growth events , 2010 .

[53]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[54]  Maria Cristina Facchini,et al.  Meteorological and trace gas factors affecting the number concentration of atmospheric Aitken ( D p = 50 nm) particles in the continental boundary layer: parameterization using a multivariate mixed effects model , 2010 .

[55]  G. Mann,et al.  Impact of nucleation on global CCN , 2009 .

[56]  F. Yu,et al.  Simulation of particle size distribution with a global aerosol model: contribution of nucleation to aerosol and CCN number concentrations , 2009 .

[57]  M. Komppula,et al.  Long term particle size distribution measurements at Mount Waliguan, a high-altitude site in inland China , 2009 .

[58]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Pasi Aalto,et al.  Characteristic features of air ions at Mace Head on the west coast of Ireland , 2008 .

[60]  Meinrat O. Andreae,et al.  Aerosol cloud precipitation interactions. Part 1. The nature and sources of cloud-active aerosols , 2008 .

[61]  G. Mann,et al.  Contribution of particle formation to global cloud condensation nuclei concentrations , 2008 .

[62]  An Algorithm for Automatic Classification of Two-dimensional Aerosol Data , 2007 .

[63]  M. Facchini,et al.  Using discriminant analysis as a nucleation event classification method , 2006 .

[64]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[65]  Patrice Y. Simard,et al.  High Performance Convolutional Neural Networks for Document Processing , 2006 .

[66]  K. Lehtinen,et al.  The composition of nucleation and Aitken modes particles during coastal nucleation events: evidence for marine secondary organic contribution , 2006 .

[67]  Maria Cristina Facchini,et al.  Nucleation and growth of new particles in Po Valley, Italy , 2006 .

[68]  I. Riipinen,et al.  Identification and classification of the formation of intermediate ions measured in boreal forest , 2006 .

[69]  Heikki Mannila,et al.  A look at aerosol formation using data mining techniques , 2005 .

[70]  André Nel,et al.  ATMOSPHERE: Enhanced: Air Pollution-Related Illness: Effects of Particles , 2005 .

[71]  M. Facchini,et al.  Cloud condensation nucleus production from nucleation events at a highly polluted region , 2005 .

[72]  P. J. Huber Robust Statistics: Huber/Robust Statistics , 2005 .

[73]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[74]  Miikka Dal Maso,et al.  Formation and growth of fresh atmospheric aerosols: eight years of aerosol size distribution data from SMEAR II, Hyytiälä, Finland , 2005 .

[75]  Cheol-Heon Jeong,et al.  Characteristics of nucleation and growth events of ultrafine particles measured in Rochester, NY. , 2004, Environmental science & technology.

[76]  Hanna Vehkamäki,et al.  Formation and growth rates of ultrafine atmospheric particles: a review of observations , 2004 .

[77]  R. Harrison,et al.  Observations of new particle formation in urban air , 2003 .

[78]  B. Brunekreef,et al.  Air pollution and health , 2002, The Lancet.

[79]  Brian D. Ripley,et al.  Modern Applied Statistics with S Fourth edition , 2002 .

[80]  J. Sarnat,et al.  Fine particulate air pollution and mortality in 20 U.S. cities. , 2001, The New England journal of medicine.

[81]  Arto Voutilainen,et al.  Statistical inversion of aerosol size measurement data , 2001 .

[82]  F. Dominici,et al.  Fine particulate air pollution and mortality in 20 U.S. cities, 1987-1994. , 2000, The New England journal of medicine.

[83]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[84]  Liisa Pirjola,et al.  Characteristics of the atmospheric particle formation events observed at a borel forest site in southern Finland , 2000 .

[85]  Shun-ichi Amari,et al.  Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[86]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[87]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[88]  A. Berner,et al.  A new electromobility spectrometer for the measurement of aerosol size distributions in the size range from 1 to 1000 nm , 1991 .

[89]  B. Albrecht Aerosols, Cloud Microphysics, and Fractional Cloudiness , 1989, Science.

[90]  D. O. Hebb,et al.  The organization of behavior , 1988 .

[91]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[92]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[93]  S. Twomey Pollution and the Planetary Albedo , 1974 .

[94]  W. A. Clark,et al.  Simulation of self-organizing systems by digital computer , 1954, Trans. IRE Prof. Group Inf. Theory.

[95]  F. Attneave,et al.  The Organization of Behavior: A Neuropsychological Theory , 1949 .