Harmful Algae Bloom Prediction Model for Western Lake Erie Using Stepwise Multiple Regression and Genetic Programming

The Great Lakes are most important freshwater bodies providing water resources and other various related businesses to the northeastern part of North America. However, harmful algal blooms (HABs) are more often and severe in those lakes than before and thus threatening lake environments and economies. Researchers have studied the factors influencing HABs characteristics using different scientific methods. In this study, all possible predictors and predictand variables were collected from various data source and then eight final predictors and one predictand were selected based on correlation between predictors and predictand variables. This study tests two machine learning techniques, Stepwise Multiple Regression (SMR) and Genetic Programming (GP), to forecast monthly HAB indicators in Western Lake Erie from July to October. SMR and GP models were created with selected input variables for two training periods, 2002 to 2011 and 2002 to 2014. A Spearman rank correlation coefficient was used to choose input variable sets for each HAB month considering 224 different combinations of lag time and average periods. The SMR models showed a correlation coefficient increase from 0.71 to 0.78 when extending the training period. The GP models followed a similar trend

[1]  Grzegorz Dudek Pattern-based local linear regression models for short-term load forecasting , 2016 .

[2]  Seok Soon Park,et al.  Factors affecting algal blooms in a man-made lake and prediction using an artificial neural network , 2014 .

[3]  J. Landsberg,et al.  The Effects of Harmful Algal Blooms on Aquatic Organisms , 2002 .

[4]  S. Schottler,et al.  Atrazine, alachlor, and cyanazine in a large agricultural river system. , 1994, Environmental science & technology.

[5]  I. Jenkinson,et al.  Harmful algal blooms , 1993, The Lancet.

[6]  S. Kalogirou,et al.  Artificial Intelligence Techniques in Solar Energy Applications , 2010 .

[7]  C. Gobler,et al.  Harmful algal blooms and eutrophication: Examining linkages from selected coastal regions of the United States. , 2008, Harmful algae.

[8]  F. Recknagel,et al.  Artificial neural network approach for modelling and prediction of algal blooms , 1997 .

[9]  Craig A. Stow,et al.  Probabilistically assessing the role of nutrient loading in harmful algal bloom formation in western Lake Erie , 2016 .

[10]  Nathan S. Bosch,et al.  Record-setting algal bloom in Lake Erie caused by agricultural and meteorological trends consistent with expected future conditions , 2013, Proceedings of the National Academy of Sciences.

[11]  Xabier Irigoien,et al.  Phytoplankton blooms: a ‘loophole’ in microzooplankton grazing impact? , 2005 .

[12]  L. Sitoki,et al.  Spatial variation of phytoplankton composition, biovolume, and resulting microcystin concentrations in the Nyanza Gulf (Lake Victoria, Kenya) , 2012, Hydrobiologia.

[13]  Holger R. Maier,et al.  Forecasting cyanobacterium Anabaena spp. in the River Murray, South Australia, using B-spline neurofuzzy models , 2001 .

[14]  M. Modigh,et al.  Changes in phytoplankton and microzooplankton populations during grazing experiments at a Mediterranean coastal site , 2009 .

[15]  Nitin Muttil,et al.  Genetic programming for analysis and real-time prediction of coastal algal blooms , 2005 .

[16]  Raphael M Kudela,et al.  Harmful algal blooms and climate change: Learning from the past and present to forecast the future. , 2015, Harmful algae.

[17]  Nitin Muttil,et al.  Prediction of algal blooms using genetic programming. , 2010, Marine pollution bulletin.

[18]  Wayne W. Carmichael,et al.  A Drinking Water Crisis in Lake Taihu, China: Linkage to Climatic Variability and Lake Management , 2010, Environmental management.

[19]  Qiuwen Chen,et al.  Online forecasting chlorophyll a concentrations by an auto-regressive integrated moving average model: Feasibilities and potentials , 2015 .

[20]  Richard P. Stumpf,et al.  Forecasting annual cyanobacterial bloom biomass to inform management decisions in Lake Erie , 2016 .

[21]  J. Huisman,et al.  Summer heatwaves promote blooms of harmful cyanobacteria , 2008 .

[22]  Corinne Le Quéré,et al.  Climate Change 2013: The Physical Science Basis , 2013 .

[23]  Richard P. Stumpf,et al.  Interannual Variability of Cyanobacterial Blooms in Lake Erie , 2012, PloS one.

[24]  Yan Huang,et al.  Neural network modelling of coastal algal blooms , 2003 .

[25]  J. Brookes,et al.  The influence of changes in wind patterns on the areal extension of surface cyanobacterial blooms in a large shallow lake in China. , 2015, The Science of the total environment.

[26]  B. Locke,et al.  Tracking ghosts: combined electrofishing and environmental DNA surveillance efforts for Asian carps in Ontario waters of Lake Erie. , 2014 .

[27]  Khaled Assaleh,et al.  Automatic modulation classification using hierarchical polynomial classifier and stepwise regression , 2016, 2016 IEEE Wireless Communications and Networking Conference.

[28]  D. Dietrich,et al.  Cyanobacterial toxins: removal during drinking water treatment, and human risk assessment. , 2000, Environmental health perspectives.

[29]  P. Hoagland,et al.  The economic effects of harmful algal blooms in the United States: Estimates, assessment issues, and information needs , 2002 .

[30]  Michael R. Landry,et al.  Phytoplankton growth, microzooplankton grazing, and carbon cycling in marine systems , 2004 .

[31]  Keith M Somers,et al.  Forecasting cyanobacteria dominance in Canadian temperate lakes. , 2015, Journal of environmental management.

[32]  Dong‐Kyun Kim,et al.  A commentary on the modelling of the causal linkages among nutrient loading, harmful algal blooms, and hypoxia patterns in Lake Erie , 2014 .

[33]  H. Paerl,et al.  Climate change: a catalyst for global expansion of harmful cyanobacterial blooms. , 2009, Environmental microbiology reports.

[34]  D. Brandes,et al.  BASE FLOW RECESSION RATES, LOW FLOWS, AND HYDROLOGIC FEATURES OF SMALL WATERSHEDS IN PENNSYLVANIA, USA 1 , 2005 .

[35]  John P. Connolly,et al.  A Post Audit of a Lake Erie Eutrophication Model , 1987 .

[36]  G. Tootle,et al.  Upper Green River Basin (United States) Streamflow Reconstructions , 2010 .

[37]  G. Hallegraeff A review of harmful algal blooms and their apparent global increase , 1993 .

[38]  T. Smayda,et al.  Complexity in the eutrophication–harmful algal bloom relationship, with comment on the importance of grazing , 2008 .

[39]  Upmanu Lall,et al.  A Simple Framework for Incorporating Seasonal Streamflow Forecasts into Existing Water Resource Management Practices 1 , 2010 .

[40]  Wai Keung Li,et al.  Modelling algal blooms using vector autoregressive model with exogenous variables and long memory filter , 2007 .

[41]  Friedrich Recknagel,et al.  Forecasting and explanation of algal dynamics in two shallow lakes by recurrent artificial neural network and hybrid evolutionary algorithm , 2008, Math. Comput. Simul..

[42]  Weiping Hu,et al.  An improved ecological model and software for short-term algal bloom forecasting , 2013, Environ. Model. Softw..

[43]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[44]  Peter A. Whigham,et al.  Comparative application of artificial neural networks and genetic algorithms for multivariate time-series modelling of algal blooms in freshwater lakes , 2002 .

[45]  Timothy W Davis,et al.  Challenges for mapping cyanotoxin patterns from remote sensing of cyanobacteria. , 2016, Harmful algae.

[46]  P. Burkill,et al.  Microzooplankton grazing and selectivity of phytoplankton in coastal waters , 1987 .

[47]  S. Ryding,et al.  Eutrophication as a 'wicked' problem , 2013 .

[48]  Donald M. Anderson,et al.  Harmful algal blooms , 2018, General Information Product.

[49]  Pierre Legendre,et al.  Predicting microcystin concentrations in lakes and reservoirs at a continental scale: A new framework for modelling an important health risk factor , 2017 .

[50]  K. Chau,et al.  Neural network and genetic programming for modelling coastal algal blooms , 2006 .

[51]  Hui Liu,et al.  Analysis of cyanobacteria bloom in the Waihai part of Dianchi Lake, China , 2012, Ecol. Informatics.

[52]  L. Darrell Whitley,et al.  An Executable Model of a Simple Genetic Algorithm , 1992, FOGA.

[53]  Vishwamitra Oree,et al.  A hybrid method for forecasting the energy output of photovoltaic systems , 2015 .

[54]  A. Dijk,et al.  The role of climatic and terrain attributes in estimating baseflow recession in tropical catchments , 2010 .

[55]  Inchio Lou,et al.  Integrating Support Vector Regression with Particle Swarm Optimization for numerical modeling for algal blooms of freshwater , 2015 .

[56]  Jason P. Antenucci,et al.  Application of a 3D hydrodynamic-biological model for seasonal and spatial dynamics of water quality and phytoplankton in Lake Erie , 2011 .

[57]  Bart Muys,et al.  Regionalisation of the parameters of a hydrological model: Comparison of linear regression models with artificial neural nets , 2006 .

[58]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[59]  K. Chang,et al.  Predicting algal bloom in the Techi reservoir using Landsat TM data , 2004 .

[60]  Porter Hoagland,et al.  Estimated Annual Economic Impacts from Harmful Algal Blooms (HABs) in the United States , 2000 .

[61]  D. Dietrich,et al.  Occurrence and elimination of cyanobacterial toxins in drinking water treatment plants. , 2005, Toxicology and applied pharmacology.

[62]  Lars Håkanson,et al.  Coefficients of variation for chlorophyll, green algae, diatoms, cryptophytes and blue-greens in rivers as a basis for predictive modelling and aquatic management. , 2003 .

[63]  Taher Rajaee,et al.  Forecasting of chlorophyll-a concentrations in South San Francisco Bay using five different models , 2015 .

[64]  Anna M. Michalak,et al.  Challenges in tracking harmful algal blooms: A synthesis of evidence from Lake Erie , 2015 .

[65]  C. Gobler,et al.  The interactive roles of nutrient loading and zooplankton grazing in facilitating the expansion of harmful algal blooms caused by the pelagophyte, Aureoumbra lagunensis, to the Indian River Lagoon, FL, USA , 2015 .

[66]  Peter A. Whigham,et al.  Predictive modelling of plankton dynamics in freshwater lakes using genetic programming , 1999 .

[67]  J. Faraway Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models , 2005 .

[68]  Joon Hong Park,et al.  A Survey of Applications of Artificial Intelligence Algorithms in Eco-environmental Modelling , 2009 .

[69]  Runhe Shi,et al.  Ensemble and enhanced PM10 concentration forecast model based on stepwise regression and wavelet analysis , 2013 .

[70]  Hongyan Zhang,et al.  A two-dimensional ecological model of Lake Erie: Application to estimate dreissenid impacts on large lake plankton populations , 2008 .

[71]  Wei Li,et al.  Cyanobacterial bloom management through integrated monitoring and forecasting in large shallow eutrophic Lake Taihu (China). , 2015, Journal of hazardous materials.

[72]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[73]  Mithun J. Sharma,et al.  Stepwise regression data envelopment analysis for variable reduction , 2015, Appl. Math. Comput..

[74]  Fanxiang Kong,et al.  Growth characteristics of algae during early stages of phytoplankton bloom in Lake Taihu, China. , 2013, Journal of environmental sciences.

[75]  C. Stow,et al.  Using a Bayesian hierarchical model to improve Lake Erie cyanobacteria bloom forecasts , 2014 .

[76]  Arwa S. Sayegh,et al.  Comparing the Performance of Statistical Models for Predicting PM10 Concentrations , 2014 .

[77]  Joseph H. W. Lee,et al.  Hydrodynamic tracking of the massive spring 1998 red tide in Hong Kong , 2004 .

[78]  C. Gobler,et al.  Eutrophication and Harmful Algal Blooms: A Scientific Consensus. , 2008, Harmful algae.

[79]  Future PM10 Concentration Prediction Using Quantile Regression Models , .

[80]  R. Bidigare,et al.  Pigment specific growth and grazing rates of phytoplankton in the central equatorial Pacific , 1997 .

[81]  David J. Schwab,et al.  Assessing and addressing the re-eutrophication of Lake Erie: Central basin hypoxia , 2014 .

[82]  J. Burkholder,et al.  Harmful algal blooms and eutrophication: Nutrient sources, composition, and consequences , 2002 .