A new approach to predict the missing values of algae during water quality monitoring programs based on a hybrid moth search algorithm and the random vector functional link network

Abstract Here, we propose a new alternative machine learning method that combines the advantage of the Random Vector Functional Link Network (RVFL) with Moth Search Algorithm (MSA) to predict the missing values of total algal count during water quality monitoring of surface waters that providing drinking water treatment plants in Fayoum, Egypt. Total of 34 water quality parameters was measured in 270 water samples during the period 2015–2017. The MSA algorithm was used for optimal selection of input features to improve the performance of the RVFL. The predicted missing values of the total algal count, by the proposed MSA-RVFL method, were strongly correlated with the real observed ones. The results of the MSA-RVFL were better than the Support Vector Machine (SVM) and the Adaptive Neural Fuzzy Inference System (ANFIS) models at different sizes of training tests. Compared with GA-RVFL and PSO-RVFL methods, using the MSA-RVFL was relevant to minimize the input variables and to reduce the processing time. The MSA-RVFL model could reduce the number of input variables from thirty-four to eighteen and eventually to four variables. The most significant variables selected by the MSA-RVFL to predict the total algal count were pH, NO3, P and Ca. Based on these four variables, the predicted values of algae were significantly matching with the real observations (R2 = 0.9594). Accordingly, this makes the MSA-RVFL model to be a useful cost-efficient tool during water quality monitoring programs. Finally, the MSA-RVFL showed higher performance whenever the number of inputs is large or small that gives our suggested method more advantages than the traditional ANN models.

[1]  Shengwu Xiong,et al.  Analysis of Bioactive Amino Acids from Fish Hydrolysates with a New Bioinformatic Intelligent System Approach , 2017, Scientific Reports.

[2]  Beidou Xi,et al.  Using Artificial Neural Network Models for Eutrophication Prediction , 2013 .

[3]  K. S. Patel,et al.  Combustion Characteristics of Sewage Sludge and Algae , 2016 .

[4]  D. Helsel,et al.  Statistical methods in water resources , 2020, Techniques and Methods.

[5]  T. D. Brock Lower pH Limit for the Existence of Blue-Green Algae: Evolutionary and Ecological Implications , 1973, Science.

[6]  M. Kornaros,et al.  Effect of pH on growth and lipid accumulation kinetics of the microalga Chlorella vulgaris grown heterotrophically under sulfur limitation. , 2016, Bioresource technology.

[7]  Aboul Ella Hassanien,et al.  Quantitative Structure-Activity Relationship Model for HCVNS5B inhibitors based on an Antlion Optimizer-Adaptive Neuro-Fuzzy Inference System , 2018, Scientific Reports.

[8]  Qinghua Cai,et al.  Forecasting Daily Chlorophyll a Concentration during the Spring Phytoplankton Bloom Period in Xiangxi Bay of the Three-Gorges Reservoir by Means of a Recurrent Artificial Neural Network , 2009 .

[9]  C. L. Philip Chen,et al.  A rapid learning and dynamic stepwise updating algorithm for flat neural networks and the application to time-series prediction , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[10]  Songfeng Lu,et al.  Galaxy images classification using hybrid brain storm optimization with moth flame optimization , 2018, Journal of Astronomical Telescopes, Instruments, and Systems.

[11]  Joseph Shapiro,et al.  Blue-Green Algae: Why They Become Dominant , 1973, Science.

[12]  Dianhui Wang,et al.  Distributed learning for Random Vector Functional-Link networks , 2015, Inf. Sci..

[13]  Nhat-Duc Hoang,et al.  Predicting algal appearance on mortar surface with ensembles of adaptive neuro fuzzy models: a comparative study of ensemble strategies , 2018, Int. J. Mach. Learn. Cybern..

[14]  Han Yan,et al.  Adaptive neuro fuzzy inference system for classification of water quality status. , 2010, Journal of environmental sciences.

[15]  A. Malik,et al.  Artificial neural network modeling of the river water quality—A case study , 2009 .

[16]  Francisco Herrera,et al.  A study on the use of imputation methods for experimentation with Radial Basis Function Network classifiers handling missing attribute values: The good synergy between RBFNs and EventCovering method , 2010, Neural Networks.

[17]  Seok Soon Park,et al.  Factors affecting algal blooms in a man-made lake and prediction using an artificial neural network , 2014 .

[18]  Tong Wang,et al.  A four-node membrane element model with bending modification for one-step algorithm for bus rollover impact , 2015 .

[19]  Ke Chen,et al.  Hybrid particle swarm optimization with spiral-shaped mechanism for feature selection , 2019, Expert Syst. Appl..

[20]  Yoh-Han Pao,et al.  Unconstrained word-based approach for off-line script recognition using density-based random-vector functional-link net , 2000, Neurocomputing.

[21]  K. Chau,et al.  Neural network and genetic programming for modelling coastal algal blooms , 2006 .

[22]  Dirk Husmeier,et al.  Neural Networks for Predicting Conditional Probability Densities: Improved Training Scheme Combining EM and RVFL , 1998, Neural Networks.

[23]  Mohamad Javad Alizadeh,et al.  Development of wavelet-ANN models to predict water quality parameters in Hilo Bay, Pacific Ocean. , 2015, Marine pollution bulletin.

[24]  C. Chen,et al.  Effects of pH on the growth and carbon uptake of marine phytoplankton , 1994 .

[25]  Joon Ha Kim,et al.  Development of early-warning protocol for predicting chlorophyll-a concentration using machine learning models in freshwater and estuarine reservoirs, Korea. , 2015, The Science of the total environment.

[26]  S. Soyupak,et al.  Case studies on the use of neural networks in eutrophication modeling , 2000 .

[27]  Vassilis Z. Antonopoulos,et al.  The use of a Neural Network technique for the prediction of water quality parameters , 2005, Oper. Res..

[28]  Chiping Nieh,et al.  Using Mass Balance, Factor Analysis, and Multiple Imputation to Assess Health Effects of Water Quality , 2012 .

[29]  Ponnuthurai N. Suganthan,et al.  Random vector functional link network for short-term electricity load demand forecasting , 2016, Inf. Sci..

[30]  Nitin Muttil,et al.  Genetic programming for analysis and real-time prediction of coastal algal blooms , 2005 .

[31]  Carol A. Gotway,et al.  Statistical Methods in Water Resources , 1994 .

[32]  Danilo Comminiello,et al.  A semi-supervised random vector functional-link network based on the transductive framework , 2016, Inf. Sci..

[33]  Craig K Enders,et al.  A primer on the use of modern missing-data methods in psychosomatic medicine research. , 2006, Psychosomatic medicine.

[35]  T. Maekawa,et al.  Use of artificial neural network in the prediction of algal blooms. , 2001, Water research.

[36]  Francis Eng Hock Tay,et al.  Support vector machine with adaptive parameters in financial time series forecasting , 2003, IEEE Trans. Neural Networks.

[37]  Gaige Wang,et al.  Moth search algorithm: a bio-inspired metaheuristic algorithm for global optimization problems , 2016, Memetic Computing.

[38]  Junjie Li,et al.  Prediction of long-term temperature effect in structural health monitoring of concrete dams using support vector machines with Jaya optimizer and salp swarm algorithms , 2019, Adv. Eng. Softw..

[39]  Keun-Chang Kwak,et al.  Algal Bloom Prediction Using Extreme Learning Machine Models at Artificial Weirs in the Nakdong River, Korea , 2018, International journal of environmental research and public health.

[40]  Sangmok Lee,et al.  Improved Prediction of Harmful Algal Blooms in Four Major South Korea’s Rivers Using Deep Learning Models , 2018, International journal of environmental research and public health.

[41]  Zakariya Yahya Algamal,et al.  A new hybrid firefly algorithm and particle swarm optimization for tuning parameter estimation in penalized support vector machine with application in chemometrics , 2019, Chemometrics and Intelligent Laboratory Systems.

[42]  B. Dong,et al.  Applying support vector machines to predict building energy consumption in tropical region , 2005 .

[43]  N. Heidarzadeh,et al.  Eutrophication modelling of Amirkabir Reservoir (Iran) using an artificial neural network approach , 2019, Lakes & Reservoirs: Science, Policy and Management for Sustainable Use.

[44]  R. L. Holder Multiple regression in hydrology , 1986 .

[45]  Mac McKee,et al.  Effect of missing data on performance of learning algorithms for hydrologic predictions: Implications to an imputation technique , 2007 .

[46]  Jan-Tai Kuo,et al.  USING ARTIFICIAL NEURAL NETWORK FOR RESERVOIR EUTROPHICATION PREDICTION , 2007 .

[47]  Vikrant Bhateja,et al.  Deluge based Genetic Algorithm for feature selection , 2019, Evolutionary Intelligence.

[48]  C. L. Philip Chen,et al.  A rapid supervised learning neural network for function interpolation and approximation , 1996, IEEE Trans. Neural Networks.

[49]  Jlřina DvOřáková-Hladká The effect of calcium on the growth ofChlorella andScenedesmus , 1976, Biologia plantarum.

[50]  J. Diogène,et al.  Artificial neural network approach to population dynamics of harmful algal blooms in Alfacs Bay (NW Mediterranean): Case studies of Karlodinium and Pseudo-nitzschia , 2016 .

[51]  Friedrich Recknagel,et al.  Prediction and elucidation of phytoplankton dynamics in the Nakdong River (Korea) by means of a recurrent artificial neural network , 2001 .

[52]  A. E. Greenberg,et al.  Standard methods for the examination of water and wastewater : supplement to the sixteenth edition , 1988 .

[53]  Jian Shen,et al.  A data-driven modeling approach for simulating algal blooms in the tidal freshwater of James River in response to riverine nutrient loading , 2019, Ecological Modelling.

[54]  Aboul Ella Hassanien,et al.  Modified cuckoo search algorithm with rough sets for feature selection , 2018, Neural Computing and Applications.

[55]  Xizhao Wang,et al.  A review on neural networks with random weights , 2018, Neurocomputing.

[56]  Emilio Soria-Olivas,et al.  Hardware implementation methods in Random Vector Functional-Link Networks , 2013, Applied Intelligence.

[57]  Friedrich Recknagel,et al.  Towards a generic artificial neural network model for dynamic predictions of algal abundance in freshwater lakes , 2001 .

[58]  B. Moss,et al.  The Influence of Environmental Factors on the Distribution of Freshwater Algae: An Experimental Study: I. Introduction and the Influence of Calcium Concentration , 1972 .

[59]  Le Zhang,et al.  A survey of randomized algorithms for training neural networks , 2016, Inf. Sci..

[60]  Lutgarde M. C. Buydens,et al.  Using support vector machines for time series prediction , 2003 .

[61]  Yan Huang,et al.  Neural network modelling of coastal algal blooms , 2003 .

[62]  Dejan J. Sobajic,et al.  Learning and generalization characteristics of the random vector Functional-link net , 1994, Neurocomputing.

[63]  Raheb Bagherpour,et al.  A new hybrid ANFIS–PSO model for prediction of peak particle velocity due to bench blasting , 2016, Engineering with Computers.

[64]  Kwang-Seuk Jeong,et al.  Determination of sensitive variables regardless of hydrological alteration in artificial neural network model of chlorophyll a: Case study of Nakdong River , 2019, Ecological Modelling.

[65]  Dan Yang,et al.  An Improved Ensemble of Random Vector Functional Link Networks Based on Particle Swarm Optimization with Double Optimization Strategy , 2016, PloS one.

[66]  Antonino Staiano,et al.  A multilayer perceptron neural network-based approach for the identification of responsiveness to interferon therapy in multiple sclerosis patients , 2010, Inf. Sci..

[67]  Innocent Rangeti,et al.  Validity and Errors in Water Quality Data — A Review , 2015 .

[68]  Hyun Kang The prevention and handling of the missing data , 2013, Korean journal of anesthesiology.

[69]  P. N. Suganthan,et al.  A comprehensive evaluation of random vector functional link networks , 2016, Inf. Sci..

[70]  Dianhui Wang,et al.  Fast decorrelated neural network ensembles with random weights , 2014, Inf. Sci..

[71]  J. C. Goldman Letter: Carbon dioxide and pH: effect on species succession of algae. , 1973, Science.

[72]  Chun Kiat Chang,et al.  Prediction of water quality index in constructed wetlands using support vector machine , 2015, Environmental Science and Pollution Research.