Self-organizing map clustering technique for ANN-based spatiotemporal modeling of groundwater quality parameters

The present study integrates co-kriging as spatial estimator and self-organizing map (SOM) as clustering technique to identify spatially homogeneous clusters of groundwater quality data and to choose the most effective input data for feed-forward neural network (FFNN) model to simulate electrical conductivity (EC) and total dissolved solids (TDS) of groundwater. The methodology is presented in three stages. In the first stage, geostatistics approach of co-kriging is used to estimate groundwater quality parameters at locations where the groundwater levels are measured. In stage two, SOM clustering technique is used to identify spatially homogeneous clusters of groundwater quality data. The dominant input data, selected by spatial clustering and mutual information are then imposed into the FFNN model for one-step-ahead predictions of groundwater quality parameters at stage three. The performance of the newly proposed model is compared to conventional linear forecasting method of multiple linear regression (MLR). The results suggest that the proposed model decreases dimensionality of the input layer and consequently the complexity of the FFNN model with acceptable efficiency in spatiotemporal simulation of groundwater quality parameters. The application of FFNN for modeling EC and TDS parameters increases the accuracy of predictions respectively up to 84.5% and 17% on average with regard to the MLR model.

[1]  A. Dassargues,et al.  Exploratory data analysis and clustering of multivariate spatial hydrogeological data by means of GEO 3 DSOM , a variant of Kohonen ’ s Self-Organizing Map , 2006 .

[2]  Fernando Bação,et al.  Exploratory data analysis and clustering of multivariate spatial hydrogeological data by means of GEO3DSOM, a variant of Kohonen's Self-Organizing Map , 2006 .

[3]  Sarel van Vuuren,et al.  Relevance of time-frequency features for phonetic and speaker-channel classification , 2000, Speech Commun..

[4]  S. Arabia,et al.  Multivariate statistical analysis of groundwater quality in Wadi Ranyah, Saudi Arabia. , 2010 .

[5]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[6]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[7]  Ozgur Kisi,et al.  Applications of hybrid wavelet–Artificial Intelligence models in hydrology: A review , 2014 .

[8]  D. Legates,et al.  Evaluating the use of “goodness‐of‐fit” Measures in hydrologic and hydroclimatic model validation , 1999 .

[9]  Bhoop Singh,et al.  Artificial neural network model as a potential alternative for groundwater salinity forecasting , 2011 .

[10]  Y. Pachepsky,et al.  Prediction of contamination potential of groundwater arsenic in Cambodia, Laos, and Thailand using artificial neural network. , 2011, Water research.

[11]  null null,et al.  Artificial Neural Networks in Hydrology. II: Hydrologic Applications , 2000 .

[12]  A. Panagopoulos,et al.  Multivariate Statistical Analysis in the Assessment of Hydrochemistry of the Northern Korinthia Prefecture Alluvial Aquifer System (Peloponnese, Greece) , 2000 .

[13]  null null,et al.  Review of Geostatistics in Geohydrology. II: Applications , 1990 .

[14]  David G. Kinniburgh,et al.  Geostatistical analysis of arsenic concentration in groundwater in Bangladesh using disjunctive kriging , 2003 .

[15]  T. Kowalczyk,et al.  Quantitative and qualitative assessment of agricultural water resources under variable climatic conditions of Silesian Lowlands (Southwest Poland) , 2014 .

[16]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[17]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[18]  Frank T.-C. Tsai,et al.  Supervised committee machine with artificial intelligence for prediction of fluoride concentration , 2013 .

[19]  Vahid Nourani,et al.  Conjunction of SOM-based feature extraction method and hybrid wavelet-ANN approach for rainfall–runoff modeling , 2013 .

[20]  FRANCISCO SÁNCHEZ-MARTOS,et al.  Assessment of Groundwater Quality by Means of Self-Organizing Maps: Application in a Semiarid Area , 2002, Environmental management.

[21]  E. Barca,et al.  Spatial evaluation of the risk of groundwater quality degradation. A comparison between disjunctive kriging and geostatistical simulation , 2008, Environmental monitoring and assessment.

[22]  Mohammad H. Aminfar,et al.  A combined neural-wavelet model for prediction of Ligvanchai watershed precipitation , 2009, Eng. Appl. Artif. Intell..

[23]  Gwo-Fong Lin,et al.  Time series forecasting by combining the radial basis function network and the self‐organizing map , 2005 .

[24]  Kui Chang,et al.  Water quality comprehensive evaluation method for large water distribution network based on clustering analysis , 2011 .

[25]  Turgay Partal,et al.  Estimation and forecasting of daily suspended sediment data using wavelet–neural networks , 2008 .

[26]  G. Panagopoulos,et al.  The use of multicomponent statistical analysis in hydrogeological environmental research. , 2004, Water research.

[27]  Hong-zhen Feng,et al.  Early diversification of ordovician graptolites in Jiangnan Slope, South China , 2010 .

[28]  Husam Baalousha,et al.  Assessment of a groundwater quality monitoring network using vulnerability mapping and geostatistics: a case study from Heretaunga Plains, New Zealand. , 2010 .

[29]  D. Myers Matrix formulation of co-kriging , 1982 .

[30]  D. S. K. Karunasingha,et al.  A simple clustering technique to extract subsets of data for function approximation , 2015 .

[31]  Vahid Nourani,et al.  Semi‐distributed flood runoff model at the subcontinental scale for southwestern Iran , 2007 .

[32]  Sheng-Tun Li,et al.  Clustering spatial-temporal precipitation data using wavelet transform and self-organizing map neural network , 2010 .

[33]  H. Maier,et al.  The Use of Artificial Neural Networks for the Prediction of Water Quality Parameters , 1996 .

[34]  Ji-hoon Kim,et al.  Application of cluster analysis for the hydrogeochemical factors of saline groundwater in Kimje, Korea , 2003 .

[35]  Jennifer Haegele,et al.  Conceptual Modeling of Networked Organizations: The Case of Aum Shinrikyo , 2013 .

[36]  Henry C. W. Lau,et al.  A fuzzy multi-criteria decision support procedure for enhancing information delivery in extended enterprise networks , 2003 .

[37]  Vahid Nourani,et al.  Integrated artificial neural network for spatiotemporal modeling of rainfall-runoff-sediment processes. , 2010 .

[38]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[39]  G. Matheron Principles of geostatistics , 1963 .

[40]  Yoon-Seok Timothy Hong,et al.  Intelligent characterisation and diagnosis of the groundwater quality in an urban fractured-rock aquifer using an artificial neural network , 2001 .

[41]  Gwo-Fong Lin,et al.  An improved neural network approach to the determination of aquifer parameters , 2006 .

[42]  S. Praveena,et al.  Statistical approaches and hydrochemical modelling of groundwater system in a small tropical island , 2012 .