Fuzzy c-means and K-means clustering with genetic algorithm for identification of homogeneous regions of groundwater quality

AbstractIn this study, two different clustering algorithms, fuzzy c-means (FCM) and K-means with genetic algorithm, were used to identify the homogeneous regions in terms of groundwater water quality. For this purpose, data of 14 hydrochemical parameters from 108 wells were sampled in 2016, Golestan province, northeast of Iran. The results showed that the optimal clusters of the K-means and FCM were 5 and 6, respectively. The evaluation of water quality by FCM for drinking uses showed that in terms of total dissolved solid (TDS) and chlorine (Cl) parameters, cluster 3 was in an unfavorable condition. Moreover, according to the K-means algorithm, cluster 1 was in inappropriate condition in terms of the TDS and Cl. Water quality assessment by FCM for agricultural use showed that in general, cluster 3 was not in a good condition, especially for the electrical conductivity (EC) parameter. Also, according to the K-means, in general, cluster 1 had an inappropriate state for the EC and sodium adsorption ratio parameters. Investigating the hydrochemical facies of clusters using the FCM and K-means showed that in the northern half of the Golestan province, most samples are Cl–Na and in the southern half, most of the samples are HCO3–Ca. In general, by comparing the results of clustering algorithms, it was found that the FCM algorithm has better results than the K-means clustering algorithm, mainly due to consideration of uncertainty conditions in determining the class boundary.

[1]  Investigation of hydrogeochemical factors and groundwater quality assessment in Marand Municipality, northwest of Iran: A multivariate statistical approach , 2009 .

[2]  Manish Kumar Goyal,et al.  Identification of Homogeneous Rainfall Regimes in Northeast Region of India using Fuzzy Cluster Analysis , 2014, Water Resources Management.

[3]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[4]  T. Satapanajaru,et al.  Spatial Distribution of Cd, Zn and Hg in Groundwater at Rayong Province, Thailand , 2010 .

[5]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[6]  Siripen Wikaisuksakul,et al.  A multi-objective genetic algorithm with fuzzy c-means for automatic data clustering , 2014, Appl. Soft Comput..

[7]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[8]  Ashutosh Kumar Singh,et al.  Deciphering Groundwater Flow Systems in Oasis Valley, Nevada, Using Trace Element Chemistry, Multivariate Statistics, and Geographical Information System , 2000 .

[9]  W. Masamba,et al.  A study of fluoride groundwater occurrence in Nathenje, Lilongwe, Malawi , 2007 .

[10]  Roy E. Williams Statistical Identification of Hydraulic Connections Between the Surface of a Mountain and Internal Mineralized Sources , 1982 .

[11]  Anat Thapinta,et al.  Use of geographic information systems for assessing groundwater pollution potential by pesticides in Central Thailand. , 2003, Environment international.

[12]  K. Srinivasa Raju,et al.  Classification of microwatersheds based on morphological characteristics , 2011 .

[13]  Mahboube Ebrahimi,et al.  Cuckoo optimization algorithm in optimal water allocation and crop planning under various weather conditions (case study: Qazvin plain, Iran) , 2017, Neural Computing and Applications.

[14]  S. Chiotha,et al.  Levels of cadmium, manganese and lead in water and algae; Spirogyra aequinoctialis , 2008 .

[15]  J. Bezdek,et al.  Generalized k -nearest neighbor rules , 1986 .

[16]  Farooq Ahmad,et al.  Delineation of groundwater prospective resources by exploiting geo-spatial decision-making techniques for the Kingdom of Saudi Arabia , 2019, Neural Computing and Applications.

[17]  A. Ramachandra Rao,et al.  Regionalization of watersheds by fuzzy cluster analysis , 2006 .

[18]  K. Pond,et al.  Water Recreation and Disease: Plausibility of Associated Infections: Acute Effects, Sequelae and Mortality , 2005 .

[19]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[20]  C. S. James Analytical Chemistry of Foods , 2014 .

[21]  Witold Pedrycz,et al.  Advances in Fuzzy Clustering and its Applications , 2007 .

[22]  Ganeshbabu Oorkavalan,et al.  Cluster Analysis to Assess Groundwater Quality in Erode District, Tamil Nadu, India , 2016 .

[23]  A. Edet,et al.  Groundwater chemistry and quality of Nigeria: A status review , 2012 .

[24]  Xiaojing Wang,et al.  An Enhanced K-Means Algorithm for Water Quality Analysis of The Haihe River in China , 2015, International journal of environmental research and public health.

[25]  C. Güler,et al.  Delineation of hydrochemical facies distribution in a regional groundwater system by means of fuzzy c‐means clustering , 2004 .

[26]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[27]  T. Msagati,et al.  Fuzzy Set Approach–A Tool to Cluster Holy Samples of Groundwater Quality Parameters at Rameswaram, South India , 2013 .

[28]  Keith Turner,et al.  Evaluation of graphical and multivariate statistical methods for classification of water chemistry data , 2002 .

[29]  A. Farooqi,et al.  Toxic fluoride and arsenic contaminated groundwater in the Lahore and Kasur districts, Punjab, Pakistan and possible contaminant sources. , 2007, Environmental pollution.

[30]  V. Tsihrintzis,et al.  A groundwater flow model for water resources management in the Ismarida plain, North Greece , 2007 .

[31]  B. Tutmeza,et al.  Modelling electrical conductivity of groundwater using an adaptive neuro-fuzzy inference system , 2006 .

[32]  C. Akbulut,et al.  Assessment of the impact of anthropogenic activities on the groundwater hydrology and chemistry in Tarsus coastal plain (Mersin, SE Turkey) using fuzzy clustering, multivariate statistics and GIS techniques , 2012 .

[33]  Lobina G. Palamuleni,et al.  Effect of sanitation facilities, domestic solid waste disposal and hygiene practices on water quality in Malawi’s urban poor areas: a case study of South Lunzu Township in the city of Blantyre , 2002 .

[34]  Riccardo Gori,et al.  Towards A New Decision Support System for Design, Management and Operation of Wastewater Treatment Plants for the Reduction of Greenhouse Gases Emission , 2015 .

[35]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[36]  Alain Dassargues,et al.  Groundwater flow modelling of the regional aquifer of the Pampa del Tamarugal, northern Chile , 2007 .

[37]  Costel Sârbu,et al.  Fuzzy hierarchical cross-clustering of data from abandoned mine site contaminated with heavy metals , 2014, Comput. Geosci..