Spatiotemporal prediction of fine particulate matter during the 2008 northern California wildfires using machine learning.

Estimating population exposure to particulate matter during wildfires can be difficult because of insufficient monitoring data to capture the spatiotemporal variability of smoke plumes. Chemical transport models (CTMs) and satellite retrievals provide spatiotemporal data that may be useful in predicting PM2.5 during wildfires. We estimated PM2.5 concentrations during the 2008 northern California wildfires using 10-fold cross-validation (CV) to select an optimal prediction model from a set of 11 statistical algorithms and 29 predictor variables. The variables included CTM output, three measures of satellite aerosol optical depth, distance to the nearest fires, meteorological data, and land use, traffic, spatial location, and temporal characteristics. The generalized boosting model (GBM) with 29 predictor variables had the lowest CV root mean squared error and a CV-R2 of 0.803. The most important predictor variable was the Geostationary Operational Environmental Satellite Aerosol/Smoke Product (GASP) Aerosol Optical Depth (AOD), followed by the CTM output and distance to the nearest fire cluster. Parsimonious models with various combinations of fewer variables also predicted PM2.5 well. Using machine learning algorithms to combine spatiotemporal data from satellites and CTMs can reliably predict PM2.5 concentrations during a major wildfire event.

[1]  P. van Spijk,et al.  On human health , 2014, Medicine, Health Care and Philosophy.

[2]  Yang Liu,et al.  Calibrating MODIS aerosol optical depth for predicting daily PM2.5 concentrations via statistical downscaling , 2014, Journal of Exposure Science and Environmental Epidemiology.

[3]  Arwa S. Sayegh,et al.  Comparing the Performance of Statistical Models for Predicting PM10 Concentrations , 2014 .

[4]  Zhang Ying,et al.  Using support vector regression to predict PM10 and PM2.5 , 2014 .

[5]  Sarah B Henderson,et al.  An empirical model to estimate daily forest fire smoke exposure over a large geographic area using air quality, meteorological, and remote sensing data , 2013, Journal of Exposure Science and Environmental Epidemiology.

[6]  T. Loboda,et al.  Modeling acute respiratory illness during the 2007 San Diego wildland fires using a coupled emissions-transport system and generalized additive modeling , 2013, Environmental Health.

[7]  High‐resolution MODIS aerosol retrieval during wildfire events in California for use in exposure assessment , 2013 .

[8]  Zev Ross,et al.  Application of the deletion/substitution/addition algorithm to selecting land use regression models for interpolating air pollution measurements in California , 2013 .

[9]  M. Jerrett,et al.  Spatiotemporal modeling of wildfire smoke exposure in Northern California using satellite data and chemical transport models , 2013 .

[10]  M. Brauer,et al.  Evaluation of a Wildfire Smoke Forecasting System as a Tool for Public Health Protection , 2013, Environmental health perspectives.

[11]  Bin Zhang,et al.  Predicting submicron air pollution indicators: a machine learning approach. , 2013, Environmental science. Processes & impacts.

[12]  M. G. Estes,et al.  Estimating ground-level PM(2.5) concentrations in the southeastern U.S. using geographically weighted regression. , 2013, Environmental research.

[13]  Sarah B Henderson,et al.  Time series analysis of fine particulate matter and asthma reliever dispensations in populations affected by forest fires , 2013, Environmental Health.

[14]  Dan Chen,et al.  Improving the accuracy of daily satellite-derived ground-level fine aerosol concentration estimates for North America. , 2012, Environmental science & technology.

[15]  J. Schwartz,et al.  Incorporating local land use regression and satellite aerosol optical depth in a hybrid model of spatiotemporal PM2.5 exposures in the Mid-Atlantic states. , 2012, Environmental science & technology.

[16]  Grant J. Williamson,et al.  The Relationship between Particulate Pollution Levels in Australian Cities, Meteorology, and Landscape Fire Activity Detected from MODIS Hotspots , 2012, PloS one.

[17]  Ana G Rappold,et al.  Cardio-respiratory outcomes associated with exposure to wildfire smoke are modified by measures of community health , 2012, Environmental Health.

[18]  Petros Koutrakis,et al.  Prediction of daily fine particulate matter concentrations using aerosol optical depth retrievals from the Geostationary Operational Environmental Satellite (GOES) , 2012, Journal of the Air & Waste Management Association.

[19]  S. Henderson,et al.  Measures of forest fire smoke exposure and their associations with respiratory health outcomes , 2012, Current opinion in allergy and clinical immunology.

[20]  Michael Brauer,et al.  Estimated Global Mortality Attributable to Smoke from Landscape Fires , 2012, Environmental health perspectives.

[21]  J. Schwartz,et al.  Assessing temporally and spatially resolved PM2.5 exposures for epidemiological studies using satellite aerosol optical depth measurements , 2011 .

[22]  Michal Krzyzanowski,et al.  Satellite-based estimates of ground-level fine particulate matter during extreme events: A case study of the Moscow fires in 2010 , 2011 .

[23]  M. Carraway,et al.  Peat Bog Wildfire Smoke Exposure in Rural North Carolina Is Associated with Cardiopulmonary Emergency Department Visits Assessed through Syndromic Surveillance , 2011, Environmental health perspectives.

[24]  K. Katsouyanni,et al.  Forest fires are associated with elevated mortality in a dense urban setting , 2011, Occupational and Environmental Medicine.

[25]  B. Hubbell,et al.  Improving the Linkages between Air Pollution Epidemiology and Quantitative Risk Assessment , 2011, Environmental health perspectives.

[26]  D. Edwards,et al.  CO source contribution analysis for California during ARCTAS-CARB , 2011 .

[27]  S. Henderson,et al.  Extreme air pollution events from bushfires and dust storms and their association with mortality in Sydney, Australia 1994-2007. , 2011, Environmental research.

[28]  M. Brauer,et al.  Three Measures of Forest Fire Smoke Exposure and Their Associations with Respiratory and Cardiovascular Health Outcomes in a Population-Based Cohort , 2011, Environmental health perspectives.

[29]  S. Henderson,et al.  Creating an Integrated Historical Record of Extreme Particulate Air Pollution Events in Australian Cities from 1994 to 2007 , 2011, Journal of the Air & Waste Management Association.

[30]  Duncan C. Thomas,et al.  Health consequences of forest fires in Indonesia , 2004, Demography.

[31]  Narayan Sastry,et al.  Forest fires, air pollution, and mortality in Southeast Asia , 2002, Demography.

[32]  Christopher A. Barnes,et al.  Completion of the 2006 National Land Cover Database for the conterminous United States. , 2011 .

[33]  S. K. Akagi,et al.  The Fire INventory from NCAR (FINN): a high resolution global model to estimate the emissions from open burning , 2010 .

[34]  A. Peters,et al.  Particulate Matter Air Pollution and Cardiovascular Disease: An Update to the Scientific Statement From the American Heart Association , 2010, Circulation.

[35]  R. Hoff,et al.  An Improved Method for Estimating Surface Fine Particle Concentrations Using Seasonally Adjusted Satellite Aerosol Optical Depth , 2010, Journal of the Air & Waste Management Association.

[36]  M. Brauer,et al.  Global Estimates of Ambient Fine Particulate Matter Concentrations from Satellite-Based Aerosol Optical Depth: Development and Application , 2010, Environmental health perspectives.

[37]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[38]  T. Lumley,et al.  Effects of Bushfire Smoke on Daily Mortality and Hospital Admissions in Sydney, Australia , 2010, Epidemiology.

[39]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[40]  R. Hoff,et al.  The Relation between Moderate Resolution Imaging Spectroradiometer (MODIS) Aerosol Optical Depth and PM2.5 over the United States: A Geographical Comparison by U.S. Environmental Protection Agency Regions , 2009, Journal of the Air & Waste Management Association.

[41]  Tze-san Lee,et al.  Risk factors associated with clinic visits during the 1999 forest fires near the Hoopa Valley Indian Reservation, California, USA , 2009, International journal of environmental health research.

[42]  Shobha Kondragunta,et al.  Comparison of GOES and MODIS Aerosol Optical Depth (AOD) to Aerosol Robotic Network (AERONET) AOD and IMPROVE PM2.5 Mass at Bondville, Illinois , 2009, Journal of the Air & Waste Management Association.

[43]  P. Gupta,et al.  Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: Multiple regression approach , 2009 .

[44]  A. Kolbe,et al.  An extreme bushfire smoke pollution event: health impacts and public health challenges. , 2009, New South Wales public health bulletin.

[45]  Yang Liu,et al.  Estimating Regional Spatial and Temporal Variability of PM2.5 Concentrations Using Satellite Data, Meteorology, and Land Use Information , 2009, Environmental health perspectives.

[46]  M. Abramson,et al.  The impact of smoke on respiratory hospital outcomes during the 2002–2003 bushfire season, Victoria, Australia , 2009, Respirology.

[47]  B. Ostro,et al.  The relationship of respiratory and cardiovascular hospital admissions to the southern California wildfires of 2003 , 2008, Occupational and Environmental Medicine.

[48]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[49]  G. Pfister,et al.  Impacts of the fall 2007 California wildfires on surface ozone: Integrating local observations with global model simulations , 2008 .

[50]  Yang Liu,et al.  Spatiotemporal associations between GOES aerosol optical depth retrievals and ground-level PM2.5. , 2008, Environmental science & technology.

[51]  Michael E. Chang,et al.  Simulation of air quality impacts from prescribed fires on an urban area. , 2008, Environmental science & technology.

[52]  H. Fernando,et al.  Simulation of smoke plumes from agricultural burns: application to the San Luis/Rio Colorado airshed along the U.S./Mexico border. , 2007, The Science of the total environment.

[53]  S. Christopher,et al.  Multi year satellite remote sensing of particulate matter air quality over Sydney, Australia , 2007 .

[54]  Shobha Kondragunta,et al.  GOES Aerosol/Smoke Product (GASP) over North America: Comparisons to AERONET and MODIS observations , 2007 .

[55]  Kazuhiko Ito,et al.  A land use regression for predicting fine particulate matter concentrations in the New York City region , 2007 .

[56]  Michael Brauer,et al.  Application of land use regression to estimate long-term concentrations of traffic-related nitrogen oxides and fine particulate matter. , 2007, Environmental science & technology.

[57]  N Künzli,et al.  A land use regression model for predicting ambient fine particulate matter across Los Angeles, CA. , 2007, Journal of environmental monitoring : JEM.

[58]  M. Brauer,et al.  Woodsmoke Health Effects: A Review , 2007, Inhalation toxicology.

[59]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[60]  Kiros Berhane,et al.  Health effects of the 2003 Southern California wildfires on children. , 2006, American journal of respiratory and critical care medicine.

[61]  Jassim A. Al-Saadi,et al.  Integrating lidar and satellite optical depth with ambient monitoring for 3-dimensional particulate characterization , 2006 .

[62]  R. Koelemeijer,et al.  Comparison of spatial and temporal variations of aerosol optical thickness and particulate matter over Europe , 2006 .

[63]  Jun Wang,et al.  Satellite remote sensing of particulate matter and air quality assessment over global cities , 2006 .

[64]  R. Delfino,et al.  Exposure Assessment of Particulate Matter Air Pollution Before, During, and After the 2003 Southern California Wildfires , 2006 .

[65]  Shilu Tong,et al.  Air particulate pollution due to bushfires and respiratory hospital admissions in Brisbane, Australia , 2006, International journal of environmental health research.

[66]  Greg Ridgeway,et al.  Generalized Boosted Models: A guide to the gbm package , 2006 .

[67]  Youngihn Kho,et al.  GeoDa: An Introduction to Spatial Data Analysis , 2006 .

[68]  Wei-Zhen Lu,et al.  Potential assessment of the "support vector machine" method in forecasting ambient air pollutant trends. , 2005, Chemosphere.

[69]  Altaf Arain,et al.  A review and evaluation of intraurban air pollution exposure models , 2005, Journal of Exposure Analysis and Environmental Epidemiology.

[70]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[71]  P. Elliott,et al.  A regression-based method for mapping traffic-related air pollution: application and testing in four contrasting urban environments. , 2000, The Science of the total environment.

[72]  S L Zeger,et al.  Exposure measurement error in time-series studies of air pollution: concepts and consequences. , 2000, Environmental health perspectives.

[73]  Stefan Sperlich,et al.  Generalized Additive Models , 2014 .

[74]  Ping Zhang Model Selection Via Multifold Cross Validation , 1993 .

[75]  V. Rich Personal communication , 1989, Nature.