Modeling algal atypical proliferation using the hybrid DE–MARS–based approach and M5 model tree in La Barca reservoir: A case study in northern Spain

Abstract Algal atypical proliferation is a consequence of water fertilization (also called eutrophication) and one of the main causes of the degradation of reservoir and lake ecosystems. Its intensification during the last decades has led the stakeholders to seek water management and restoration solutions, including those based on modelling approaches. In this way, this paper presents one reservoir eutrophication modelling based on a new hybrid algorithm that combines multivariate adaptive regression splines (MARS) and differential evolution (DE) to estimate the algal abnormal proliferation from physical-chemical and biological variables. This technique involves the optimization of the MARS hyperparameters during the training process. Additionally, an M5 model tree was fitted to the experimental data for comparison purposes. Apart from successfully forecasting algal atypical growth (coefficients of determination equal to 0.83 and 0.91), the model showed here can establish the significance of each biological and physical-chemical parameter of the algal enhanced growth. Finally, the main conclusions of this research work are exposed.

[1]  Dan Simon,et al.  Evolutionary Optimization Algorithms , 2013 .

[2]  R. Peters Predictive Model of the Effects on Lake Metabolism of Decreased Airborne Litterfall through Riparian Deforestation , 1995 .

[3]  P. J. García Nieto,et al.  Hybrid PSO–MARS–based model for forecasting a successful growth cycle of the Spirulina platensis from experimental data in open raceway ponds , 2015 .

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[5]  L. C. Brown,et al.  The influence of climatic inputs on stream-flow pattern forecasting: case study of Upper Senegal River , 2018, Environmental Earth Sciences.

[6]  Thomas Stützle,et al.  Ant Colony Optimization , 2009, EMO.

[7]  Mahesh Pal,et al.  Estimation of Discharge and End Depth in Trapezoidal Channel by Support Vector Machines , 2007 .

[8]  Elizabeth S. Allman,et al.  Mathematical models in biology : an introduction , 2003 .

[9]  Ángel Borja,et al.  Assessing the environmental quality status in estuarine and coastal systems: Comparing methodologies and indices , 2008 .

[10]  Helmut Hillebrand,et al.  BIOVOLUME CALCULATION FOR PELAGIC AND BENTHIC MICROALGAE , 1999 .

[11]  Lionel Ho,et al.  Elucidating the factors influencing the biodegradation of cylindrospermopsin in drinking water sources , 2008, Environmental toxicology.

[12]  James J. Roberts,et al.  Seasonal and interannual effects of hypoxia on fish habitat quality in central Lake Erie , 2011 .

[13]  R. Steedman,et al.  Changes in phytoplankton communities following logging in the drainage basins of three boreal forest lakes in northwestern Ontario (Canada), 1991-2000 , 2003 .

[14]  Z. Yaseen,et al.  Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso , 2018, Agricultural Water Management.

[15]  David J. Barnes,et al.  Introduction to Modeling for Biosciences , 2010 .

[16]  M. Clerc,et al.  Particle Swarm Optimization , 2006 .

[17]  Zaher Mundher Yaseen,et al.  Learning from Multiple Models Using Artificial Intelligence to Improve Model Prediction Accuracies: Application to River Flows , 2018, Water Resources Management.

[18]  J. C. Vega,et al.  Phytoplankton structure and dynamics in Lake Sanabria and Valparaíso reservoir (NW Spain) , 2000, Hydrobiologia.

[19]  Wengang Zhang,et al.  Assessment of soil liquefaction based on capacity energy concept and multivariate adaptive regression splines , 2015 .

[20]  Jürg Bloesch,et al.  Eutrophication of Lake Tasaul, Romania—proposals for rehabilitation , 2009, Environmental science and pollution research international.

[21]  Iztok Fister,et al.  Adaptation and Hybridization in Nature-Inspired Algorithms , 2015 .

[22]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[23]  Yuehjen E. Shao,et al.  Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression splines , 2004, Expert Syst. Appl..

[24]  Lucien Hoffmann,et al.  Distribution of Hepatotoxic Cyanobacterial Blooms in Belgium and Luxembourg , 2005, Hydrobiologia.

[25]  Sheng-rui Wang,et al.  Effects of dissolved oxygen supply level on phosphorus release from lake sediments , 2008 .

[26]  Zaher Mundher Yaseen,et al.  Quantifying hourly suspended sediment load using data mining models: Case study of a glacierized Andean catchment in Chile , 2018, Journal of Hydrology.

[27]  C. Pérez-Martínez,et al.  Temporal occurrence of Ceratium hirundinella in Spanish reservoirs , 2001, Hydrobiologia.

[28]  Antonio Quesada,et al.  Toxicity of Aphanizomenon ovalisporum (Cyanobacteria) in a Spanish water reservoir , 2006 .

[29]  R. Rosenberg,et al.  Introduction to Environmental and Economic Consequences of Hypoxia , 2011 .

[30]  M. Teixeira,et al.  Comparing dissolved air flotation and conventional sedimentation to remove cyanobacterial cells of Microcystis aeruginosa: Part I: The key operating conditions , 2006 .

[31]  Charles B. Roosen,et al.  An introduction to multivariate adaptive regression splines , 1995, Statistical methods in medical research.

[32]  Bruce R. Kowalski,et al.  MARS: A tutorial , 1992 .

[33]  Dimitra Kitsiou,et al.  Coastal marine eutrophication assessment: a review on data analysis. , 2011, Environment international.

[34]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[35]  H. M. Vinkers,et al.  Multivariate adaptive regression splines—studies of HIV reverse transcriptase inhibitors , 2004 .

[36]  Jamie Bartram,et al.  Toxic Cyanobacteria in Water: a Guide to Their Public Health Consequences, Monitoring and Management Chapter 2. Cyanobacteria in the Environment 2.1 Nature and Diversity 2.1.1 Systematics , 2022 .

[37]  Francesco Vidoli,et al.  Evaluating the water sector in Italy through a two stage method using the conditional robust nonparametric frontier and multivariate adaptive regression splines , 2011, Eur. J. Oper. Res..

[38]  Özgür Kisi,et al.  Survey of different data-intelligent modeling strategies for forecasting air temperature using geographic information as model predictors , 2018, Comput. Electron. Agric..

[39]  D. Sano,et al.  Surface-retained organic matter of Microcystis aeruginosa inhibiting coagulation with polyaluminum chloride in drinking water treatment. , 2010, Water research.

[40]  S. Fields Global Nitrogen: Cycling out of Control , 2004, Environmental health perspectives.

[41]  P. Rocca,et al.  Differential Evolution as Applied to Electromagnetics , 2011, IEEE Antennas and Propagation Magazine.

[42]  David Ardia,et al.  DEoptim: An R Package for Global Optimization by Differential Evolution , 2009 .

[43]  Michael Karydis,et al.  Eutrophication assessment of coastal waters based on indicators: a literature review. , 2009 .

[44]  Mohammad Ali Ghorbani,et al.  What Is the Potential of Integrating Phase Space Reconstruction with SVM-FFA Data-Intelligence Model? Application of Rainfall Forecasting over Regional Scale , 2018, Water Resources Management.

[45]  Vitaliy Feoktistov Differential Evolution: In Search of Solutions , 2006 .

[46]  Ahmad Sharafati,et al.  The Integration of Nature-Inspired Algorithms with Least Square Support Vector Regression Models: Application to Modeling River Dissolved Oxygen Concentration , 2018, Water.

[47]  Min-Yuan Cheng,et al.  Accurately predicting building energy performance using evolutionary multivariate adaptive regression splines , 2014, Appl. Soft Comput..

[48]  M.H. Hassoun,et al.  Fundamentals of Artificial Neural Networks , 1996, Proceedings of the IEEE.

[49]  H. Paerl,et al.  Evidence for the Importance of Atmospheric Nitrogen Deposition to Eutrophic Lake Dianchi, China. , 2017, Environmental science & technology.

[50]  Yong Liu,et al.  Ecological–economic modeling as a tool for watershed management: A case study of Lake Qionghai watershed, China , 2008 .

[51]  J. Friedman Multivariate adaptive regression splines , 1990 .

[52]  A. Landis,et al.  Eutrophication potential of food consumption patterns. , 2010, Environmental science & technology.

[53]  O. Kisi Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree , 2015 .

[54]  M. Puig,et al.  Intracellular and dissolved microcystin in reservoirs of the river Segura basin, Murcia, SE Spain. , 2005, Toxicon : official journal of the International Society on Toxinology.

[55]  Mahesh Pal,et al.  M5 model tree for land cover classification , 2006 .

[56]  Zaher Mundher Yaseen,et al.  Precipitation pattern modeling using cross-station perception: regional investigation , 2018, Environmental Earth Sciences.

[57]  Yuling Huang,et al.  Variation of Dissolved Oxygen in the Experiments of Occurrence & Disappearance for Microcystis Bloom☆ , 2013 .

[58]  Dimitri P. Solomatine,et al.  M5 Model Trees and Neural Networks: Application to Flood Forecasting in the Upper Reach of the Huai River in China , 2004 .

[59]  V. Smith,et al.  Low Nitrogen to Phosphorus Ratios Favor Dominance by Blue-Green Algae in Lake Phytoplankton , 1983, Science.

[60]  D. Findlay,et al.  Responses of Diatom and Chrysophyte Assemblages in Lake 227 Sediments to Experimental Eutrophication , 1994 .

[61]  George Tsirtsis,et al.  Ecological quality scales based on phytoplankton for the implementation of Water Framework Directive in the Eastern Mediterranean , 2010 .

[62]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[63]  Andrea E. Olsson Particle Swarm Optimization: Theory, Techniques and Applications , 2010 .

[64]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[65]  R. Dennis Cook,et al.  Cross-Validation of Regression Models , 1984 .

[66]  Larry Wasserman,et al.  All of Statistics: A Concise Course in Statistical Inference , 2004 .

[67]  G. E. Fogg,et al.  The Blue-Green Algae. , 1975 .

[68]  Ali Rahimikhoob,et al.  A Comparison Between Conventional and M5 Model Tree Methods for Converting Pan Evaporation to Reference Evapotranspiration for Semi-Arid Region , 2013, Water Resources Management.

[69]  C. Reynolds The Ecology of Phytoplankton , 2006 .

[70]  R. D. Grundy,et al.  Strategies for control of man-made eutrophication , 1971 .

[71]  Zhihua Cui,et al.  Swarm Intelligence and Bio-Inspired Computation: Theory and Applications , 2013 .

[72]  Mahesh Pal,et al.  M5 model tree based modelling of reference evapotranspiration , 2009 .

[73]  Vinay Kumar Gadi,et al.  Spatial and temporal variation of hydraulic conductivity and vegetation growth in green infrastructures using infiltrometer and visual technique , 2017 .

[74]  Ozgur Kisi,et al.  Non-tuned data intelligent model for soil temperature estimation: A new approach , 2018, Geoderma.

[75]  Lars-Anders Hansson,et al.  The biology of lakes and ponds , 1998 .

[76]  Anthony T. C. Goh,et al.  Multivariate adaptive regression splines for analysis of geotechnical engineering systems , 2013 .