Groundwater contamination sources identification based on the Long-Short Term Memory network

Abstract When simulation–optimization (S/O) is used for groundwater contamination source identification (GCSI), to reduce the calculation load and time generated by calling the simulation model, a surrogate model is often used instead of the simulation model. However, when the conversion relationship between the simulation model inputs and outputs is complex and the model is highly nonlinear, the commonly used approach of building a surrogate model may lose its advantage. This study use and check a deep learning method with the long-short term memory (LSTM) network which has great potential for characterizing the input–output conversion relationship of complex nonlinear numerical simulations, to a surrogate model of the simulation model. The accuracy of the surrogate model developed by the LSTM method was compared with that of commonly used surrogate models developed by the Kriging method, the radial basis function network method, and the kernel extreme learning machine method. The surrogate model with the highest accuracy was linked to the optimization model, and the optimization model was solved to identify the contamination source information. Results show that compared with the other three methods, the surrogate model constructed by the LSTM method had the best accuracy and generalization performance. LSTM is therefore an effective method of building a surrogate model. Linking the LSTM surrogate model to an optimization model and then solving the optimization model can save approximately 99% of the computing load and time otherwise required.

[1]  Yu Wang,et al.  Application of Mixed-Integer Nonlinear Optimization Programming Based on Ensemble Surrogate Model for Dense Nonaqueous Phase Liquid Source Identification in Groundwater , 2019, Environmental Engineering Science.

[2]  Jack P. C. Kleijnen,et al.  Robustness of Kriging when interpolating in random simulation with heterogeneous variances: Some experiments , 2005, Eur. J. Oper. Res..

[3]  S. Gorelick,et al.  Identifying sources of groundwater pollution: An optimization approach , 1983 .

[4]  R. Haftka,et al.  Multiple surrogates: how cross-validation errors can help us to obtain the best predictor , 2009 .

[5]  Jin Lin,et al.  A comparative research of different ensemble surrogate models based on set pair analysis for the DNAPL-contaminated aquifer remediation strategy optimization. , 2017, Journal of contaminant hydrology.

[6]  A. Bagtzoglou,et al.  Application of particle methods to reliable identification of groundwater pollution sources , 1992 .

[7]  Wenxi Lu,et al.  Modeling and uncertainty analysis of seawater intrusion based on surrogate models , 2019, Environmental Science and Pollution Research.

[8]  Wenxi Lu,et al.  A Kriging surrogate model coupled in simulation-optimization approach for identifying release history of groundwater sources. , 2016, Journal of contaminant hydrology.

[9]  Yuzhuo Fu,et al.  A hybrid ACO algorithm for capacitated vehicle routing problems , 2017, 2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC).

[10]  T. Skaggs,et al.  Recovering the release history of a groundwater contaminant , 1994 .

[11]  Giancarlo Sangalli,et al.  Capturing Small Scales in Elliptic Problems Using a Residual-Free Bubbles Finite Element Method , 2003, Multiscale Model. Simul..

[12]  Mazda Kompani-Zare,et al.  Groundwater depth and elevation interpolation by kriging methods in Mohr Basin of Fars province in Iran , 2010, Environmental monitoring and assessment.

[13]  Han Wang,et al.  Groundwater contamination sources identification based on kernel extreme learning machine and its effect due to wavelet denoising technique , 2020, Environmental Science and Pollution Research.

[14]  A. Krzyżak Nonlinear function learning using optimal radial basis function newtworks , 2001 .

[15]  Dong-Hoon Choi,et al.  Kriging interpolation methods in geostatistics and DACE model , 2002 .

[16]  S. Ashby,et al.  A parallel multigrid preconditioned conjugate gradient algorithm for groundwater flow simulations , 1996 .

[17]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[18]  Anthony J. Jakeman,et al.  A review of surrogate models and their application to groundwater modeling , 2015 .

[19]  Antônio de Pádua Braga,et al.  An efficient multi-objective learning algorithm for RBF neural network , 2010, Neurocomputing.

[20]  George F. Pinder,et al.  Application of the Digital Computer for Aquifer Evaluation , 1968 .

[21]  Shuai Li,et al.  Chance-constrained multi-objective optimization of groundwater remediation design at DNAPLs-contaminated sites using a multi-algorithm genetically adaptive method. , 2017, Journal of contaminant hydrology.

[22]  Hongli Deng,et al.  Sichuan dialect speech recognition with deep LSTM network , 2019, Frontiers of Computer Science.

[23]  T. Yeh,et al.  Simulation of karst spring discharge using a combination of time–frequency analysis methods and long short-term memory neural networks , 2020 .

[24]  J. Gómez-Hernández,et al.  Simultaneous identification of a contaminant source and hydraulic conductivity via the restart normal-score ensemble Kalman filter , 2018 .

[25]  John P. Boyd,et al.  Error saturation in Gaussian radial basis functions on a finite interval , 2010, J. Comput. Appl. Math..

[26]  Guang-Bin Huang,et al.  Trends in extreme learning machines: A review , 2015, Neural Networks.

[27]  William P. Ball,et al.  Application of inverse methods to contaminant source identification from aquitard diffusion profiles at Dover AFB, Delaware , 1999 .

[28]  Yan Shi,et al.  Recognition Model Based Feature Extraction and Kernel Extreme Learning Machine for High Dimensional Data , 2014 .

[29]  D. Ouazar,et al.  Ground Water Contaminant Source and Transport Parameter Identification by Correlation Coefficient Optimization , 1998 .

[30]  Wenxi Lu,et al.  Surrogate-Based Sensitivity Analysis and Uncertainty Analysis for DNAPL-Contaminated Aquifer Remediation , 2016 .

[31]  R. Srinivasan,et al.  Groundwater Modeling Under Variable Operating Conditions Using SWAT, MODFLOW and MT3DMS: a Catchment Scale Approach to Water Resources Management , 2018, Water Resources Management.

[32]  Manping,et al.  RADIAL BASIS FUNCTION INTERPOLATION IN SOBOLEV SPACES AND ITS APPLICATIONS , 2007 .

[33]  Alexandre Megretski,et al.  Fourier Series for Accurate, Stable, Reduced-Order Models in Large-Scale Linear Applications , 2005, SIAM J. Sci. Comput..

[34]  Paulin Coulibaly,et al.  Groundwater level forecasting using artificial neural networks , 2005 .

[35]  Wenxi Lu,et al.  Identification of groundwater contamination sources using a statistical algorithm based on an improved Kalman filter and simulation optimization , 2019, Hydrogeology Journal.

[36]  M. Ye,et al.  Developing a Long Short-Term Memory (LSTM) based model for predicting water table depth in agricultural areas , 2018, Journal of Hydrology.

[37]  Bryan A. Tolson,et al.  Review of surrogate modeling in water resources , 2012 .

[38]  Yi Ji,et al.  Identifying the release history of a groundwater contaminant source based on an ensemble surrogate model , 2019, Journal of Hydrology.

[39]  Atharv Bhosekar,et al.  Advances in surrogate based modeling, feasibility analysis, and optimization: A review , 2018, Comput. Chem. Eng..

[40]  Randhir Rawatlal,et al.  Response surface strategies in constructing statistical bubble flow models for the development of a novel bubble column simulation approach , 2012, Comput. Chem. Eng..

[41]  Thomas J. Santner,et al.  Design and analysis of computer experiments , 1998 .

[42]  E. Acar Various approaches for constructing an ensemble of metamodels using local measures , 2010 .

[43]  Alexander Popov Deep Learning Architecture for Part-of-Speech Tagging with Word and Suffix Embeddings , 2016, AIMSA.

[44]  Z. J. Kabala,et al.  Recovering the release history of a groundwater contaminant using a non-linear least-squares method. , 2000 .

[45]  Raghavan Srinivasan,et al.  Approximating SWAT Model Using Artificial Neural Network and Support Vector Machine 1 , 2009 .

[46]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[47]  Micha Werner,et al.  Reduction of Monte-Carlo simulation runs for uncertainty estimation in hydrological modelling , 2003 .

[48]  Pak Kin Wong,et al.  Modeling and optimization of biodiesel engine performance using kernel-based extreme learning machine and cuckoo search , 2015 .

[49]  Bithin Datta,et al.  Identification of unknown groundwater pollution sources using classical optimization with linked simulation , 2011 .

[50]  Oskar von Stryk,et al.  A mixed-integer simulation-based optimization approach with surrogate functions in water resources management , 2008 .

[51]  Jian Song,et al.  Adaptive surrogate model based multiobjective optimization for coastal aquifer management , 2018, Journal of Hydrology.

[52]  Wenxi Lu,et al.  Comparative study of surrogate models for groundwater contamination source identification at DNAPL-contaminated sites , 2018, Hydrogeology Journal.

[53]  Adel Mellit,et al.  Radial Basis Function Network-based prediction of global solar radiation data: Application for sizing of a stand-alone photovoltaic system at Al-Madinah, Saudi Arabia , 2010 .

[54]  Chen Chen,et al.  Spectral-Spatial Classification of Hyperspectral Image Based on Kernel Extreme Learning Machine , 2014, Remote. Sens..

[55]  Jichun Wu,et al.  Deep Autoregressive Neural Networks for High‐Dimensional Inverse Problems in Groundwater Contaminant Source Identification , 2018, Water Resources Research.

[56]  R. Neupauer,et al.  Adjoint‐derived location and travel time probabilities for a multidimensional groundwater system , 2001 .

[57]  Mehdi Ghommem,et al.  Mode decomposition methods for flows in high-contrast porous media. Global-local approach , 2013, J. Comput. Phys..

[58]  Wenxi Lu,et al.  Application of ensemble surrogates and adaptive sequential sampling to optimal groundwater remediation design at DNAPLs-contaminated sites. , 2017, Journal of contaminant hydrology.

[59]  Jia-Yuan Guo,et al.  The application of 0-1 mixed integer nonlinear programming optimization model based on a surrogate model to identify the groundwater pollution source. , 2019, Journal of contaminant hydrology.

[60]  William W.-G. Yeh,et al.  Groundwater Management Using Model Reduction via Empirical Orthogonal Functions , 2008 .

[61]  Yaohua Xiong,et al.  Training Reformulated Radial Basis Function Neural Networks Capable of Identifying Uncertainty in Data Classification , 2006, IEEE Transactions on Neural Networks.

[62]  Adam J. Siade,et al.  Reduced order parameter estimation using quasilinearization and quadratic programming , 2012 .

[63]  K. Lee,et al.  A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer , 2011 .

[64]  Bithin Datta,et al.  Optimal Identification of Ground-Water Pollution Sources and Parameter Estimation , 2001 .

[65]  B. Datta,et al.  Identification of groundwater pollution sources using GA-based linked simulation optimization model , 2006 .

[66]  N. Cressie The origins of kriging , 1990 .

[67]  Ying Zhao,et al.  Identifying groundwater contaminant sources based on a KELM surrogate model together with four heuristic optimization algorithms , 2020 .

[68]  Ranji S. Ranjithan,et al.  A parallel evolutionary strategy based simulation–optimization approach for solving groundwater source identification problems , 2009 .

[69]  Weixing Guo,et al.  MODFLOW/MT3DMS–Based Simulation of Variable‐Density Ground Water Flow and Transport , 2006, Ground water.

[70]  Peter Giesl,et al.  Construction of a local and global Lyapunov function for discrete dynamical systems using radial basis functions , 2008, J. Approx. Theory.

[71]  Radial basis function and its application in tourism management , 2018 .

[72]  Zoubeida Bargaoui,et al.  Comparison of two kriging interpolation methods applied to spatiotemporal rainfall , 2009 .

[73]  I. Dag,et al.  Numerical solutions of KdV equation using radial basis functions , 2008 .

[74]  M Tamer Ayvaz,et al.  A linked simulation-optimization model for solving the unknown groundwater pollution source identification problems. , 2010, Journal of contaminant hydrology.

[75]  Bithin Datta,et al.  Global Optimal Design of Ground Water Monitoring Network Using Embedded Kriging , 2009, Ground water.

[76]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[77]  G. Matheron Principles of geostatistics , 1963 .

[78]  Wenxi Lu,et al.  Ensemble of surrogates-based optimization for identifying an optimal surfactant-enhanced aquifer remediation strategy at heterogeneous DNAPL-contaminated sites , 2015, Comput. Geosci..

[79]  R. Healy,et al.  Modeling Variably Saturated Subsurface Solute Transport with MODFLOW‐UZF and MT3DMS , 2012, Ground water.

[80]  Amvrossios C. Bagtzoglou,et al.  Radial basis function neural network for hydrologic inversion: an appraisal with classical and spatio-temporal geostatistical techniques in the context of site characterization , 2009 .