Design and implementation of a hybrid model based on two-layer decomposition method coupled with extreme learning machines to support real-time environmental monitoring of water quality parameters.

Accurate prediction of water quality parameters plays a crucial and decisive role in environmental monitoring, ecological systems sustainability, human health, aquaculture and improved agricultural practices. In this study a new hybrid two-layer decomposition model based on the complete ensemble empirical mode decomposition algorithm with adaptive noise (CEEMDAN) and the variational mode decomposition (VMD) algorithm coupled with extreme learning machines (ELM) and also least square support vector machine (LSSVM) was designed to support real-time environmental monitoring of water quality parameters, i.e. chlorophyll-a (Chl-a) and dissolved oxygen (DO) in a Lake reservoir. Daily measurements of Chl-a and DO for June 2012-May 2013 were employed where the partial autocorrelation function was applied to screen the relevant inputs for the model construction. The variables were then split into training, validation and testing subsets where the first stage of the model testing captured the superiority of the ELM over the LSSVM algorithm. To improve these standalone predictive models, a second stage implemented a two-layer decomposition with the model inputs decomposed in the form of high and low frequency oscillations, represented by the intrinsic mode function (IMF) through the CEEMDAN algorithm. The highest frequency component, IMF1 was further decomposed with the VMD algorithm to segregate key model input features, leading to a two-layer hybrid VMD-CEEMDAN model. The VMD-CEEMDAN-ELM model was able to reduce the root mean square and the mean absolute error by about 14.04% and 7.12% for the Chl-a estimation and about 5.33% and 4.30% for the DO estimation, respectively, compared with the standalone counterparts. Overall, the developed methodology demonstrates the robustness of the two-phase VMD-CEEMDAN-ELM model in identifying and analyzing critical water quality parameters with a limited set of model construction data over daily horizons, and thus, to actively support environmental monitoring tasks, especially in case of high-frequency, and relatively complex, real-time datasets.

[1]  Yanxue Wang,et al.  Research on variational mode decomposition and its application in detecting rub-impact fault of the rotor system , 2015 .

[2]  J. Adamowski,et al.  Assessing the suitability of extreme learning machines (ELM) for groundwater level prediction , 2017 .

[3]  Nelson F. F. Ebecken,et al.  Fuzzy modelling of chlorophyll production in a Brazilian upwelling system , 2009 .

[4]  Beidou Xi,et al.  Using Artificial Neural Network Models for Eutrophication Prediction , 2013 .

[5]  Daoliang Li,et al.  Prediction of Dissolved Oxygen Content in Aquaculture of Hyriopsis Cumingii Using Elman Neural Network , 2011, CCTA.

[6]  Walter Wildi,et al.  Antibiotic resistant bacteria/genes dissemination in lacustrine sediments highly increased following cultural eutrophication of Lake Geneva (Switzerland). , 2012, Chemosphere.

[7]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[8]  Rob J Hyndman,et al.  Minimum Sample Size requirements for Seasonal Forecasting Models , 2007 .

[9]  Ozgur Kisi,et al.  Modelling of chemical oxygen demand by using ANNs, ANFIS and k-means clustering techniques , 2014 .

[10]  C. Willmott ON THE VALIDATION OF MODELS , 1981 .

[11]  Guang-Bin Huang,et al.  Trends in extreme learning machines: A review , 2015, Neural Networks.

[12]  Huihui Yu,et al.  Dissolved oxygen content prediction in crab culture using a hybrid intelligent method , 2016, Scientific Reports.

[13]  K. Coughlin,et al.  Eleven-year solar cycle signal throughout the lower atmosphere , 2004 .

[14]  Han Wang,et al.  Ensemble Based Extreme Learning Machine , 2010, IEEE Signal Processing Letters.

[15]  Juan Huan,et al.  Prediction of dissolved oxygen in aquaculture based on EEMD and LSSVM optimized by the Bayesian evidence framework , 2018, Comput. Electron. Agric..

[16]  Fábio Roland,et al.  Determinants of chlorophyll-a concentration in tropical reservoirs , 2014, Hydrobiologia.

[17]  Sue Ellen Haupt,et al.  Artificial Intelligence Methods in the Environmental Sciences , 2008 .

[18]  R. Deo,et al.  An extreme learning machine model for the simulation of monthly mean streamflow water level in eastern Queensland , 2016, Environmental Monitoring and Assessment.

[19]  Mohamed Elshemy,et al.  Data-driven modeling for water quality prediction case study: The drains system associated with Manzala Lake, Egypt , 2017 .

[20]  Zaher Mundher Yaseen,et al.  Predicting compressive strength of lightweight foamed concrete using extreme learning machine model , 2018, Adv. Eng. Softw..

[21]  Zhiqiang Deng,et al.  How Reliable Are ANN, ANFIS, and SVM Techniques for Predicting Longitudinal Dispersion Coefficient in Natural Rivers? , 2016 .

[22]  Xu Fan,et al.  A combined model based on CEEMDAN and modified flower pollination algorithm for wind speed forecasting , 2017 .

[23]  Vladimir M. Krasnopolsky,et al.  Some neural network applications in environmental sciences. Part II: advancing computational efficiency of environmental numerical models , 2003, Neural Networks.

[24]  Dominique Zosso,et al.  Variational Mode Decomposition , 2014, IEEE Transactions on Signal Processing.

[25]  Gabriel Rilling,et al.  On empirical mode decomposition and its algorithms , 2003 .

[26]  K. Lai,et al.  A new approach for crude oil price analysis based on Empirical Mode Decomposition , 2008 .

[27]  R. Deo,et al.  Forecasting long-term global solar radiation with an ANN algorithm coupled with satellite-derived (MODIS) land surface temperature (LST) for regional locations in Queensland , 2017 .

[28]  Zhenbo Li,et al.  A Hybrid Model for Dissolved Oxygen Prediction in Aquaculture based on Multi-scale Features , 2017 .

[29]  Yan Li,et al.  Soil moisture forecasting by a hybrid machine learning technique: ELM integrated with ensemble empirical mode decomposition , 2018, Geoderma.

[30]  R. Deo,et al.  Stream-flow forecasting using extreme learning machines: a case study in a semi-arid region in Iraq , 2016 .

[31]  T. Hu,et al.  Rainfall–runoff modeling using principal component analysis and neural network , 2007 .

[32]  Ozgur Kisi Modeling discharge-suspended sediment relationship using least square support vector machine , 2012 .

[33]  Guangren Qian,et al.  Method to predict key factors affecting lake eutrophication:a new approach based on Support Vector Regression model , 2015 .

[34]  Snejana Moncheva,et al.  Application of a new multi-metric phytoplankton index to the assessment of ecological status in marine and transitional waters , 2012 .

[35]  Jery R. Stedinger,et al.  Water Resources Systems Planning And Management , 2006 .

[36]  Jiping Xu,et al.  Research on Water Bloom Prediction Based on Least Squares Support Vector Machine , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[37]  R. Deo,et al.  Input selection and performance optimization of ANN-based streamflow forecasts in the drought-prone Murray Darling Basin region using IIS and MODWT algorithm , 2017 .

[38]  Shaolong Sun,et al.  A novel hybrid decomposition-ensemble model based on VMD and HGWO for container throughput forecasting , 2018 .

[39]  Joon Ha Kim,et al.  Development of early-warning protocol for predicting chlorophyll-a concentration using machine learning models in freshwater and estuarine reservoirs, Korea. , 2015, The Science of the total environment.

[40]  Chuntian Cheng,et al.  A comparison of performance of several artificial intelligence , 2009 .

[41]  Dawei Han,et al.  Assessment of input variables determination on the SVM model performance using PCA, Gamma test, and forward selection techniques for monthly stream flow prediction , 2011 .

[42]  Salim Heddam,et al.  Use of Optimally Pruned Extreme Learning Machine (OP-ELM) in Forecasting Dissolved Oxygen Concentration (DO) Several Hours in Advance: a Case Study from the Klamath River, Oregon, USA , 2016, Environmental Processes.

[43]  Norden E. Huang,et al.  Ensemble Empirical Mode Decomposition: a Noise-Assisted Data Analysis Method , 2009, Adv. Data Sci. Adapt. Anal..

[44]  Junfeng Gao,et al.  An ensemble simulation approach for artificial neural network: An example from chlorophyll a simulation in Lake Poyang, China , 2017, Ecol. Informatics.

[45]  Ravinesh C. Deo,et al.  Multi-stage hybridized online sequential extreme learning machine integrated with Markov Chain Monte Carlo copula-Bat algorithm for rainfall forecasting , 2018, Atmospheric Research.

[46]  D. Legates,et al.  Evaluating the use of “goodness‐of‐fit” Measures in hydrologic and hydroclimatic model validation , 1999 .

[47]  Dianhui Wang,et al.  Extreme learning machines: a survey , 2011, Int. J. Mach. Learn. Cybern..

[48]  Yaguo Lei,et al.  Application of the EEMD method to rotor fault diagnosis of rotating machinery , 2009 .

[49]  Chu Zhang,et al.  Multi-step ahead wind speed forecasting using a hybrid model based on two-stage decomposition technique and AdaBoost-extreme learning machine , 2017 .

[50]  Özgür Kişi,et al.  Estimation of dissolved oxygen by using neural networks and neuro fuzzy computing techniques , 2017 .

[51]  Gastón Schlotthauer,et al.  Analysis of hydroclimatic variability and trends using a novel empirical mode decomposition: Application to the Paraná River Basin , 2014 .

[52]  Norden E. Huang,et al.  A review on Hilbert‐Huang transform: Method and its applications to geophysical studies , 2008 .

[53]  Patrick Flandrin,et al.  Noise-Assisted EMD Methods in Action , 2012, Adv. Data Sci. Adapt. Anal..

[54]  R. Deo,et al.  Forecasting effective drought index using a wavelet extreme learning machine (W-ELM) model , 2017, Stochastic Environmental Research and Risk Assessment.

[55]  A. A. Masrur Ahmed,et al.  Prediction of dissolved oxygen in Surma River by biochemical oxygen demand and chemical oxygen demand using the artificial neural networks (ANNs) , 2017 .

[56]  Shaolong Sun,et al.  Application of decomposition-ensemble learning paradigm with phase space reconstruction for day-ahead PM2.5 concentration forecasting. , 2017, Journal of environmental management.

[57]  R. Deo,et al.  Very short‐term reactive forecasting of the solar ultraviolet index using an extreme learning machine integrated with the solar zenith angle , 2017, Environmental research.

[58]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[59]  Qiuwen Chen,et al.  Integration of data mining techniques and heuristic knowledge in fuzzy logic modelling of eutrophication in Taihu Lake , 2003 .

[60]  Fanrang Kong,et al.  Adaptive variational mode decomposition based on artificial fish swarm algorithm for fault diagnosis of rolling bearings , 2017 .

[61]  Jan Adamowski,et al.  Comparison of machine learning models for predicting fluoride contamination in groundwater , 2017, Stochastic Environmental Research and Risk Assessment.

[62]  K. Chau,et al.  Prediction of rainfall time series using modular artificial neural networks coupled with data-preprocessing techniques , 2010 .

[63]  E. Tziritis,et al.  Environmental monitoring of Micro Prespa Lake basin (Western Macedonia, Greece): hydrogeochemical characteristics of water resources and quality trends , 2014, Environmental Monitoring and Assessment.

[64]  Norden E. Huang,et al.  Complementary Ensemble Empirical Mode Decomposition: a Novel Noise Enhanced Data Analysis Method , 2010, Adv. Data Sci. Adapt. Anal..

[65]  G. Hollis,et al.  The physical basis of the Lake Mikri Prespa systems: geology, climate, hydrology and water quality , 1997, Hydrobiologia.

[66]  J. Adamowski,et al.  Application of wavelet-artificial intelligence hybrid models for water quality prediction: a case study in Aji-Chay River, Iran , 2016, Stochastic Environmental Research and Risk Assessment.

[67]  Jan Adamowski,et al.  Multi-step water quality forecasting using a boosting ensemble multi-wavelet extreme learning machine model , 2018, Stochastic Environmental Research and Risk Assessment.

[68]  Johan A. K. Suykens,et al.  Weighted least squares support vector machines: robustness and sparse approximation , 2002, Neurocomputing.

[69]  Rahim Barzegar,et al.  Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms. , 2018, The Science of the total environment.

[70]  Jian-Da Wu,et al.  Speaker identification system using empirical mode decomposition and an artificial neural network , 2011, Expert Syst. Appl..

[71]  Roohollah Noori,et al.  A reduced-order adaptive neuro-fuzzy inference system model as a software sensor for rapid estimation of five-day biochemical oxygen demand , 2013 .

[72]  Yufang Wang,et al.  A novel hybrid decomposition-and-ensemble model based on CEEMD and GWO for short-term PM2.5 concentration forecasting , 2016 .

[73]  X. Wen,et al.  A wavelet-coupled support vector machine model for forecasting global incident solar radiation using limited meteorological dataset , 2016 .

[74]  A. M. El-Otify,et al.  Evaluation of the physicochemical and chlorophyll-a conditions of a subtropical aquaculture in Lake Nasser area, Egypt , 2015 .

[75]  Wei Liu,et al.  Applications of variational mode decomposition in seismic time-frequency analysis , 2016 .

[76]  C. L. Wu,et al.  Rainfall–runoff modeling using artificial neural network coupled with singular spectrum analysis , 2011 .

[77]  Maryam Abbasi,et al.  Uncertainty analysis of support vector machine for online prediction of five-day biochemical oxygen demand , 2015 .

[78]  N. Huang,et al.  A new view of nonlinear water waves: the Hilbert spectrum , 1999 .

[79]  David Mouillot,et al.  Cost effective prediction of the eutrophication status of lakes and reservoirs , 2010 .

[80]  O. Kisi,et al.  Short-term and long-term streamflow forecasting using a wavelet and neuro-fuzzy conjunction model , 2010 .

[81]  Yan-ping Wang,et al.  A forecasting and forewarning model for methane hazard in working face of coal mine based on LS-SVM , 2008 .

[82]  Mohanad S. Al-Musaylh,et al.  Two-phase particle swarm optimized-support vector regression hybrid model integrated with improved empirical mode decomposition with adaptive noise for multiple-horizon electricity demand forecasting , 2018 .

[83]  Ozgur Kisi,et al.  Modelling daily dissolved oxygen concentration using least square support vector machine, multivariate adaptive regression splines and M5 model tree , 2018 .

[84]  Tinghui Li,et al.  EMD-Based Study of the Volatility Mechanism in Economic Growth , 2017 .

[85]  Rahim Barzegar,et al.  Forecasting of groundwater level fluctuations using ensemble hybrid multi-wavelet neural network-based models. , 2017, The Science of the total environment.

[86]  Miki Hondzo,et al.  Prediction of lake water temperature, dissolved oxygen, and fish habitat under changing climate , 2017, Climatic Change.

[87]  Ozgur Kisi,et al.  Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors , 2017, Environmental Science and Pollution Research.

[88]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[89]  Rahim Barzegar,et al.  Combining the advantages of neural networks using the concept of committee machine in the groundwater salinity prediction , 2016, Modeling Earth Systems and Environment.

[90]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[91]  Christian Albrecht,et al.  Concurrent evolution of ancient sister lakes and sister species: the freshwater gastropod genus Radix in lakes Ohrid and Prespa , 2008, Hydrobiologia.

[92]  Andrés Bueno-Crespo,et al.  Neural architecture design based on extreme learning machine , 2013, Neural Networks.

[93]  Patrick Flandrin,et al.  A complete ensemble empirical mode decomposition with adaptive noise , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[94]  Ping-Feng Pai,et al.  Predicting engine reliability by support vector machines , 2006 .

[95]  Asghar Asghari Moghaddam,et al.  A supervised committee machine artificial intelligent for improving DRASTIC method to assess groundwater contamination risk: a case study from Tabriz plain aquifer, Iran , 2016, Stochastic Environmental Research and Risk Assessment.

[96]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[97]  Olivier Grunder,et al.  Multi-step ahead electricity price forecasting using a hybrid model based on two-layer decomposition technique and BP neural network optimized by firefly algorithm , 2017 .

[98]  Pijush Samui,et al.  Forecasting Evaporative Loss by Least-Square Support-Vector Regression and Evaluation with Genetic Programming, Gaussian Process, and Minimax Probability Machine Regression: Case Study of Brisbane City , 2017 .

[99]  Chengwei Li,et al.  Friction Signal Denoising Using Complete Ensemble EMD with Adaptive Noise and Mutual Information , 2015, Entropy.

[100]  Mohammad Ali Abdoli,et al.  Prediction of municipal solid waste generation with combination of support vector machine and principal component analysis: A case study of Mashhad , 2009 .