A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization

In this paper, we tackle air quality forecasting by using machine learning approaches to predict the hourly concentration of air pollutants (e.g., ozone, particle matter ( PM 2.5 ) and sulfur dioxide). Machine learning, as one of the most popular techniques, is able to efficiently train a model on big data by using large-scale optimization algorithms. Although there exist some works applying machine learning to air quality prediction, most of the prior studies are restricted to several-year data and simply train standard regression models (linear or nonlinear) to predict the hourly air pollution concentration. In this work, we propose refined models to predict the hourly air pollution concentration on the basis of meteorological data of previous days by formulating the prediction over 24 h as a multi-task learning (MTL) problem. This enables us to select a good model with different regularization techniques. We propose a useful regularization by enforcing the prediction models of consecutive hours to be close to each other and compare it with several typical regularizations for MTL, including standard Frobenius norm regularization, nuclear norm regularization, and l 2 , 1 -norm regularization. Our experiments have showed that the proposed parameter-reducing formulations and consecutive-hour-related regularizations achieve better performance than existing standard regression models and existing regularizations.

[1]  David A. McAllester,et al.  Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence , 2009, UAI 2009.

[2]  Xiuji Zhou,et al.  Long-term trend of visibility and its characterizations in the Pearl River Delta (PRD) region, China , 2008 .

[3]  Jerry M. Davis,et al.  An Automated Classification Scheme Designed to Better Elucidate the Dependence of Ozone on Meteorology , 1994 .

[4]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[5]  S. Twomey The Influence of Pollution on the Shortwave Albedo of Clouds , 1977 .

[6]  Tong Zhang,et al.  Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.

[7]  Tianbao Yang,et al.  SVD-free Convex-Concave Approaches for Nuclear Norm Regularization , 2017, IJCAI.

[8]  Jianping Fan,et al.  Integrating Concept Ontology and Multitask Learning to Achieve More Effective Classifier Training for Multilevel Image Annotation , 2008, IEEE Transactions on Image Processing.

[9]  D. Barker,et al.  Change in ozone air pollution over Chicago associated with global climate change , 2008 .

[10]  L. Kalkstein,et al.  A Synoptic Climatological Approach For Geographical Analysis: Assessment of Sulfur Dioxide Concentrations , 1986 .

[11]  Charles A. Micchelli,et al.  On Spectral Learning , 2010, J. Mach. Learn. Res..

[12]  A. Comrie A synoptic climatology of rural ozone pollution at three forest sites in Pennsylvania , 1994 .

[13]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[14]  Minglei Fu,et al.  Prediction of particular matter concentrations by developed feed-forward neural network with rolling mechanism and gray model , 2015, Neural Computing and Applications.

[15]  J. Horel,et al.  MESOWEST: COOPERATIVE MESONETS IN THE WESTERN UNITED STATES , 2002 .

[16]  Andreas Maurer,et al.  Bounds for Linear Multi-Task Learning , 2006, J. Mach. Learn. Res..

[17]  Konstantinos Demertzis,et al.  FuSSFFra, a fuzzy semi-supervised forecasting framework: the case of the air pollution in Athens , 2018, Neural Computing and Applications.

[18]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[19]  Rich Caruana,et al.  Multitask Learning , 1997, Machine-mediated learning.

[20]  D. Strachan,et al.  Effects of air pollution on daily hospital admissions for respiratory disease in London between 1987-88 and 1991-92. , 1996, Journal of epidemiology and community health.

[21]  H. Akbari Shade trees reduce building energy use and CO2 emissions from power plants. , 2002, Environmental pollution.

[22]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[23]  Jaime G. Carbonell,et al.  Multitask learning for host–pathogen protein interactions , 2013, Bioinform..

[24]  J. Gerring A case study , 2011, Technology and Society.

[25]  Yun Zeng,et al.  Progress in developing an ANN model for air pollution index forecast , 2004 .

[26]  Mingrui Liu,et al.  ADMM without a Fixed Penalty Parameter: Faster Convergence with New Adaptive Penalization , 2017, NIPS.

[27]  Hong Huang,et al.  Relevance analysis and short-term prediction of PM2.5 concentrations in Beijing based on multi-source data , 2017 .

[28]  Yves Rybarczyk,et al.  Modeling PM2.5 Urban Pollution Using Machine Learning and Selected Meteorological Parameters , 2017, J. Electr. Comput. Eng..

[29]  Nikolaos M. Avouris,et al.  Short-term air quality prediction using a case-based classifier , 2001, Environ. Model. Softw..

[30]  J Schwartz,et al.  Increased mortality in Philadelphia associated with daily air pollution concentrations. , 1992, The American review of respiratory disease.

[31]  H. Elminir Dependence of urban air pollutants on meteorology. , 2005, The Science of the total environment.

[32]  Gunnar Rätsch,et al.  Leveraging Sequence Classification by Taxonomy-Based Multitask Learning , 2010, RECOMB.

[33]  A. Wiedensohler,et al.  New particle formation in the continental boundary layer: Meteorological and gas phase parameter influence , 2000 .

[34]  Jong-Tae Lee,et al.  Air Pollution and Asthma Among Children in Seoul, Korea , 2002, Epidemiology.

[35]  Jeffrey Young,et al.  Incremental testing of the Community Multiscale Air Quality (CMAQ) modeling system version 4.7 , 2009 .

[36]  Ayse Betül Oktay,et al.  Forecasting air pollutant indicator levels with geographic models 3 days in advance using neural networks , 2010, Expert Syst. Appl..

[37]  J. Seinfeld,et al.  Atmospheric Chemistry and Physics: From Air Pollution to Climate Change , 1997 .

[38]  Pericles A. Mitkas,et al.  Applying Machine Learning Techniques on Air Quality Data for Real-Time Decision Support , 2003 .

[39]  Jian He,et al.  Decadal application of WRF/Chem for regional air quality and climate modeling over the U.S. under the representative concentration pathways scenarios. Part 1: Model evaluation and impact of downscaling , 2017 .

[40]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[41]  B. R. Appel,et al.  Visibility as related to atmospheric aerosol constituents , 1985 .

[42]  Dennis J. Snower,et al.  Multitask Learning and the Reorganization of Work: From Tayloristic to Holistic Organization , 2000, Journal of Labor Economics.

[43]  M. L. Laucks,et al.  Aerosol Technology Properties, Behavior, and Measurement of Airborne Particles , 2000 .

[44]  Shikha Gupta,et al.  Identifying pollution sources and predicting urban air quality using ensemble learning methods , 2013 .

[45]  H. Mayer Air pollution in cities , 1999 .

[46]  Arthur T. DeGaetano,et al.  Temporal, spatial and meteorological variations in hourly PM2.5 concentration extremes in New York City , 2004 .

[47]  J. Schwartz,et al.  The National Morbidity, Mortality, and Air Pollution Study. Part II: Morbidity and mortality from air pollution in the United States. , 2000, Research report.

[48]  S. Becker,et al.  Human alveolar macrophage responses to air pollution particulates are associated with insoluble components of coarse material, including particulate endotoxin. , 2001, Toxicology and applied pharmacology.

[49]  J Schwartz,et al.  Short term fluctuations in air pollution and hospital admissions of the elderly for respiratory disease. , 1995, Thorax.

[50]  Giorgio Corani,et al.  Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning , 2005 .

[51]  Y. Chung,et al.  Analysis of dust storms observed in Mongolia during 1937-1999 , 2003 .

[52]  W. Rea,et al.  Adverse health effects of outdoor air pollutants. , 2006, Environment international.

[53]  Daniel J. Jacob,et al.  Effect of Climate Change on Air Quality , 2009 .

[54]  Tianbao Yang,et al.  Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence , 2017, ICML.

[55]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[56]  Peter Kulchyski and , 2015 .

[57]  Kebin He,et al.  Incorporation of new particle formation and early growth treatments into WRF/Chem: Model improvement, evaluation, and impacts of anthropogenic aerosols over East Asia , 2016 .

[58]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[59]  Tareq Hussein,et al.  Diurnal and annual characteristics of particle mass and number concentrations in urban, rural and Arctic environments in Finland , 2003 .

[60]  M. Zelenka An analysis of the meteorological parameters affecting ambient concentrations of acid aerosols in Uniontown, Pennsylvania☆ , 1997 .

[61]  J Schwartz,et al.  Air pollution and daily mortality: associations with particulates and acid aerosols. , 1992, Environmental research.

[62]  Tianbao Yang,et al.  Predicting Traffic Accidents Through Heterogeneous Urban Data : A Case Study , 2017 .