A Review of Surrogate Models and Their Ensembles to Develop Saltwater Intrusion Management Strategies in Coastal Aquifers

The complex physical processes in a typical coastal aquifer system with transient inputs to numerical simulation models (NSM) result in substantial computational burden in a coupled simulation–optimization (S/O) approach. In such situations, an approximate emulator of the complex physical processes provides a computationally efficient alternative to the NSM. The reliability of these surrogate models (SM) within the coupled S/O approach depends on how accurately they capture the trend of the underlying physical processes. Moreover, these SMs are often associated with prediction uncertainties, which hinder optimality of the solution of the coupled S/O methodology. In this review article, we summarize ensemble approach of combining data-driven SMs to address this prediction uncertainty. Different techniques of ensemble formation as well as their relative advantages and disadvantages are also discussed. Although a wide range of data-driven SMs have been used to approximate associated physical processes of coastal aquifers, the use of ensemble SMs is quite limited. Moreover, these ensemble-based modelling approaches are based on manipulating the training data set, i.e., using different realizations of training data set to train individual SMs within the ensemble. Although ensemble formation by combining multiple SMs based on different algorithms can be found in other application domains, the application of ensemble SMs in the prediction of saltwater intrusion processes has not been developed yet. In addition, more advanced ensemble surrogate-modelling approaches are yet to be established in the context of developing regional scale saltwater intrusion management models.

[1]  Bithin Datta,et al.  Optimal management of groundwater extraction to control saltwater intrusion in multi-layered coastal aquifers using ensembles of adaptive neuro-fuzzy inference system , 2017 .

[2]  Bithin Datta,et al.  Multi-objective management of saltwater intrusion in coastal aquifers using genetic programming and modular neural network based surrogate models. , 2010 .

[3]  Robert Piché,et al.  Mixture surrogate models based on Dempster-Shafer theory for global optimization problems , 2011, J. Glob. Optim..

[4]  B. Datta,et al.  Artificial Neural Networks Approximation of Density Dependent Saltwater Intrusion Process in Coastal Aquifers , 2007 .

[5]  Kalyanmoy Deb,et al.  Controlled Elitist Non-dominated Sorting Genetic Algorithms for Better Convergence , 2001, EMO.

[6]  Bithin Datta,et al.  Design of an Optimal Compliance Monitoring Network and Feedback Information for Adaptive Management of Saltwater Intrusion in Coastal Aquifers , 2014 .

[7]  D. Savić,et al.  A symbolic data-driven technique based on evolutionary polynomial regression , 2006 .

[8]  Jerome H. Friedman Multivariate adaptive regression splines (with discussion) , 1991 .

[9]  David R. Richards,et al.  FEMWATER: A Three-Dimensional Finite Element Computer Model for Simulating Density-Dependent Flow and Transport in Variably Saturated Media. , 1997 .

[10]  V. Cherkassky Fuzzy Inference Systems: A Critical Review , 1998 .

[11]  A. Naji,et al.  Optimal Extraction of Groundwater in Gaza Coastal Aquifer , 2009 .

[12]  Rommel G. Regis,et al.  Surrogate-based pumping optimization of coastal aquifers under limited computational budgets , 2018 .

[13]  Radiša Jovanović,et al.  Ensemble of various neural networks for prediction of heating energy consumption , 2015 .

[14]  Arthur P. Dempster,et al.  A Generalization of Bayesian Inference , 1968, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[15]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[16]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[17]  Bithin Datta,et al.  Coupled simulation‐optimization model for coastal aquifer management using genetic programming‐based ensemble surrogate models and multiple‐realization optimization , 2011 .

[18]  Kalyanmoy Deb,et al.  A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimisation: NSGA-II , 2000, PPSN.

[19]  Michio Sugeno,et al.  A fuzzy-logic-based approach to qualitative modeling , 1993, IEEE Trans. Fuzzy Syst..

[20]  Özgür Kisi,et al.  Comparison of genetic programming with neuro-fuzzy systems for predicting short-term water table depth fluctuations , 2011, Comput. Geosci..

[21]  Hamed Ketabchi,et al.  Elitist Continuous Ant Colony Optimization Algorithm for Optimal Management of Coastal Aquifers , 2011 .

[22]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[23]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[24]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[25]  A. Mantoglou,et al.  Optimal design of pumping networks in coastal aquifers using sharp interface models , 2008 .

[26]  Salvador Pintos,et al.  An Optimization Methodology of Alkaline-Surfactant-Polymer Flooding Processes Using Field Scale Numerical Simulation and Multiple Surrogates , 2004 .

[27]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[28]  Bithin Datta,et al.  ANN-GA-Based Model for Multiple Objective Management of Coastal Aquifers , 2009 .

[29]  Maria P Papadopoulou,et al.  Computational benefits using artificial intelligent methodologies for the solution of an environmental design problem: saltwater intrusion. , 2010, Water science and technology : a journal of the International Association on Water Pollution Research.

[30]  菅野 道夫,et al.  Industrial applications of fuzzy control , 1985 .

[31]  Terry A. Howell,et al.  Gaussian process models for reference ET estimation from alternative meteorological data sources , 2014 .

[32]  Juliang Jin,et al.  A new approach to water resources system assessment — set pair analysis method , 2009 .

[33]  M. Razack,et al.  Modeling daily discharge responses of a large karstic aquifer using soft computing methods: Artificial neural network and neuro-fuzzy , 2010 .

[34]  Bithin Datta,et al.  Optimal Management of Coastal Aquifers Using Linked Simulation Optimization Approach , 2005 .

[35]  Bithin Datta,et al.  Stochastic and Robust Multi-Objective Optimal Management of Pumping from Coastal Aquifers Under Parameter Uncertainty , 2014, Water Resources Management.

[36]  B. Datta,et al.  Saltwater Intrusion Management of Coastal Aquifers. I: Linked Simulation-Optimization , 2009 .

[37]  E. Mizutani,et al.  Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[38]  Orazio Giustolisi,et al.  Comparison of three data-driven techniques in modelling the evapotranspiration process. , 2010 .

[39]  Hamed Ketabchi,et al.  Evolutionary algorithms for the optimal management of coastal groundwater: A comparative study toward future challenges , 2015 .

[40]  William W. Hsieh,et al.  Applying Neural Network Models to Prediction and Data Analysis in Meteorology and Oceanography. , 1998 .

[41]  Duc Truong Pham,et al.  Benchmarking and comparison of nature-inspired population-based continuous optimisation algorithms , 2014, Soft Comput..

[42]  Vasileios Christelis,et al.  Pumping Optimization of Coastal Aquifers Assisted by Adaptive Metamodelling Methods and Radial Basis Functions , 2016, Water Resources Management.

[43]  Bithin Datta,et al.  Comparative Evaluation of Genetic Programming and Neural Network as Potential Surrogate Models for Coastal Aquifer Management , 2011 .

[44]  R. Farmani,et al.  A surrogate model for simulation–optimization of aquifer systems subjected to seawater intrusion , 2015 .

[45]  B. Datta,et al.  Review: Simulation-optimization models for the management and monitoring of coastal aquifers , 2015, Hydrogeology Journal.

[46]  Okyay Kaynak,et al.  Computational Intelligence: Soft Computing and Fuzzy-Neuro Integration with Applications , 1998, NATO ASI Series.

[47]  Bryan A. Tolson,et al.  Review of surrogate modeling in water resources , 2012 .

[48]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[49]  Amin Elshorbagy,et al.  Toward improving the reliability of hydrologic prediction: Model structure uncertainty and its quantification using ensemble‐based genetic programming framework , 2008 .

[50]  Bithin Datta,et al.  A surrogate based multi-objective management model to control saltwater intrusion in multi-layered coastal aquifer systems , 2017 .

[51]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[52]  Alyssa M. Dausman,et al.  SEAWAT Version 4: A Computer Program for Simulation of Multi-Species Solute and Heat Transport , 2008 .

[53]  Naif Alajlan,et al.  Improved Estimation of Water Chlorophyll Concentration With Semisupervised Gaussian Process Regression , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[54]  B. Datta,et al.  Genetic algorithm tuned fuzzy inference system to evolve optimal groundwater extraction strategies to control saltwater intrusion in multi-layered coastal aquifers under parameter uncertainty , 2017, Modeling Earth Systems and Environment.

[55]  Oscar Castillo,et al.  A new approach for time series prediction using ensembles of ANFIS models , 2012, Expert Syst. Appl..

[56]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[57]  Shiv O. Prasher,et al.  APPLICATION OF MARS IN SIMULATING PESTICIDE CONCENTRATIONS IN SOIL , 2006 .

[58]  L. L. Rogers,et al.  Optimal field-scale groundwater remediation using neural networks and the genetic algorithm. , 1995, Environmental science & technology.

[59]  Hamed Ketabchi,et al.  Assessment of a parallel evolutionary optimization approach for efficient management of coastal aquifers , 2015, Environ. Model. Softw..

[60]  Jared D. McLaughlin Parallel Processing of Reactive Transport Models Using OpenMP , 2008 .

[61]  John R. Koza,et al.  Genetic programming as a means for programming computers by natural selection , 1994 .

[62]  Robert W. Blanning,et al.  The construction and implementation of metamodels , 1975 .

[63]  Edzer J. Pebesma,et al.  Latin Hypercube Sampling of Gaussian Random Fields , 1999, Technometrics.

[64]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[65]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[66]  J. Friedman Multivariate adaptive regression splines , 1990 .

[67]  T. Ouarda,et al.  Flood frequency analysis at ungauged sites using artificial neural networks in canonical correlation analysis physiographic space , 2007 .

[68]  Günter Rudolph,et al.  Parallel Approaches for Multiobjective Optimization , 2008, Multiobjective Optimization.

[69]  R. Farmani,et al.  A simulation‐optimization model to control seawater intrusion in coastal aquifers using abstraction/recharge wells , 2012 .

[70]  Alexander I. J. Forrester,et al.  Engineering design applications of surrogate-assisted optimization techniques , 2014 .

[71]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[72]  Mustafa M. Aral,et al.  Multi-objective optimization of pumping rates and well placement in coastal aquifers , 2004 .

[73]  Fan Zhang,et al.  Application of a hybrid MPI/OpenMP approach for parallel groundwater model calibration using multi-core computers , 2010, Comput. Geosci..

[74]  George Kourakos,et al.  Simulation and Multi-Objective Management of Coastal Aquifers in Semi-Arid Regions , 2011 .

[75]  R. Haftka,et al.  Ensemble of surrogates , 2007 .

[76]  Mohammad Rajabi,et al.  Uncertainty-based simulation-optimization using Gaussian process emulation: Application to coastal groundwater management , 2017 .

[77]  Bithin Datta,et al.  Multivariate Adaptive Regression Spline Ensembles for Management of Multilayered Coastal Aquifers , 2017 .

[78]  I-Tung Yang,et al.  Parallel Computing Platform for Multiobjective Simulation Optimization of Bridge Maintenance Planning , 2012 .

[79]  Andy J. Keane,et al.  Constructing a Surrogate , 2008 .

[80]  Orazio Giustolisi,et al.  An evolutionary‐based data mining technique for assessment of civil engineering systems , 2008 .

[81]  Jin Lin,et al.  A comparative research of different ensemble surrogate models based on set pair analysis for the DNAPL-contaminated aquifer remediation strategy optimization. , 2017, Journal of contaminant hydrology.

[82]  Barbara S. Minsker,et al.  Optimal groundwater remediation design using an Adaptive Neural Network Genetic Algorithm , 2006 .

[83]  Bithin Datta,et al.  Optimal combined operation of production and barrier wells for the control of saltwater intrusion in coastal groundwater well fields , 2011 .

[84]  Bithin Datta,et al.  Fuzzy C-Mean Clustering Based Inference System for Saltwater Intrusion Processes Prediction in Coastal Aquifers , 2016, Water Resources Management.

[85]  H. Redkey,et al.  A new approach. , 1967, Rehabilitation record.

[86]  Chuntian Cheng,et al.  A comparison of performance of several artificial intelligence , 2009 .

[87]  Cécile Piret,et al.  Analytical and numerical advances in radial basis functions , 2007 .

[88]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[89]  R. Haftka,et al.  Multiple surrogates: how cross-validation errors can help us to obtain the best predictor , 2009 .

[90]  Alexander Y. Sun,et al.  Monthly streamflow forecasting using Gaussian Process Regression , 2014 .

[91]  Mohammad Rajabi,et al.  Optimal Management of a Freshwater Lens in a Small Island Using Surrogate Models and Evolutionary Algorithms , 2014 .

[92]  Ana I. González Acuña An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, Boosting, and Randomization , 2012 .

[93]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[94]  Ioannis K. Nikolos,et al.  Application of linear programming and differential evolutionary optimization methodologies for the solution of coastal subsurface water management problems subject to environmental criteria , 2007 .

[95]  A. Javadi,et al.  A Cost-Effective Method to Control Seawater Intrusion in Coastal Aquifers , 2011 .

[96]  Anthony T. C. Goh,et al.  Multivariate adaptive regression splines and neural network models for prediction of pile drivability , 2016 .

[97]  Nelishia Pillay,et al.  An investigation into the use of genetic programming for the induction of novice procedural programming solution algorithms in intelligent programming tutors. , 2004 .

[98]  George Kourakos,et al.  Pumping optimization of coastal aquifers based on evolutionary algorithms and surrogate modular neural network models , 2009 .