Finding the number of natural clusters in groundwater data sets using the concept of equivalence class

Abstract Cluster analysis has numerous scientific and practical applications. This paper presents a computer program to find an adequate (natural) number of clusters and to isolate anomalous samples in a data set. The program incorporates an algorithm that is based on the mathematical concept of equivalence class and uses the framework of the graph theory to identify equivalence classes in multivariate data bases. This type of clustering algorithm is particularly useful when one is dealing with groundwater data sets, because anomalies are frequent in these sets, and because the number of groups that are present often are impossible to estimate; the number will depend on the combined effect of many factors, including geology, morphology, climate and pollution. As an example of the utility of this program, a set of groundwater samples is clustered, and the average chemistry of nine identified equivalence classes is related to weathering reactions of plagioclase in a Portuguese granitoid area.

[1]  C. E. Marshall,et al.  The use of heavy minerals in studies of the origin and development of soils , 1945 .

[2]  D. H. Khan STUDIES ON TRANSLOCATION OF CHEMICAL CONSTITUENTS IN SOME RED-BROWN SOILS, TERRA ROSSAS, AND RENDZINAS, USING ZIRCONIUM AS A WEATHERING INDEX , 1959 .

[3]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .

[4]  David J. Chittleborough,et al.  Genesis of a Xeralf on feldspathic sandstone, South Australia , 1989 .

[5]  M. Garrels Robert,et al.  Genesis of some ground waters from igneous rocks , 1967 .

[6]  R. Howie,et al.  Rock-forming minerals , 1962 .

[7]  Janet G. Hering,et al.  Principles and Applications of Aquatic Chemistry , 1993 .

[8]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[9]  C. C. Nikiforoff,et al.  GENESIS OF A CLAYPAN SOIL: II , 1943 .

[10]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[11]  G. M. Young,et al.  Early Proterozoic climates and plate motions inferred from major element chemistry of lutites , 1982, Nature.

[12]  E. Forgy,et al.  Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[13]  Antony Mellor,et al.  Variations in weathering processes and rates with time in a chronosequence of soils from Glen Feshie, Scotland , 1993 .

[14]  George H. Brimhall,et al.  Quantitative geochemical approach to pedogenesis: importance of parent material reduction, volumetric expansion, and eolian influx in lateritization , 1991 .

[15]  F. J. Pettijohn,et al.  Persistence of Heavy Minerals and Geologic Age , 1941, The Journal of Geology.

[16]  F. Pacheco,et al.  Weathering, Biomass Production and Groundwater Chemistry in an Area of Dominant Anthropogenic Influence, the Chaves-Vila Pouca de Aguiar Region, North of Portugal , 1999 .

[17]  Fernando António Leal Pacheco,et al.  Contributions of Water-Rock Interactions to the Composition of Groundwater in Areas with a Sizeable Anthropogenic Input: A Case Study of the Waters of the Fundão Area, Central Portugal , 1996 .

[18]  S. S. Goldich A Study in Rock-Weathering , 1938, The Journal of Geology.

[19]  David J. Chittleborough,et al.  Indices of weathering for soils and palaeosols formed on silicate rocks , 1991 .

[20]  M. L. Jackson,et al.  Chemical Weathering of Minerals in Soils , 1953 .

[21]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[22]  Brian Everitt,et al.  Cluster analysis , 1974 .

[23]  K Norrish,et al.  An accurate X-ray spectrographic method for the analysis of a wide range of geological samples , 1969 .

[24]  Rob Fitzpatrick,et al.  Field monitoring of solute and colloid mobility in a gneissic sub-catchment, South Australia , 1995 .

[25]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[26]  Y. Tardy,et al.  Characterization of the principal weathering types by the geochemistry of waters from some European and African crystalline massifs , 1971 .

[27]  P. Walker,et al.  Soil profile development in some alluvial deposits of eastern New South Wales , 1976 .

[28]  E. Kay,et al.  Graph Theory. An Algorithmic Approach , 1975 .

[29]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[30]  Fernando António Leal Pacheco,et al.  Application of Correspondence Analysis in the Assessment of Groundwater Chemistry , 1998 .

[31]  Michael J. Singer,et al.  Field and laboratory procedures used in a soil chronosequence study , 1986 .

[32]  R. Brewer Fabric and mineral analysis of soils , 1980 .

[33]  L. Wilding,et al.  Elemental Distribution in the Light Mineral Isolate of Soil Separates1 , 1978 .

[34]  Albert Perry Brigham,et al.  Rocks, Rock-Weathering and Soils , 1897 .

[35]  C.R.M. Butt,et al.  Regolith exploration geochemistry in tropical and subtropical terrains , 1992 .