t-SNE Based Visualisation and Clustering of Geological Domain

Identification of geological domains and their boundaries plays a vital role in the estimation of mineral resources. Geologists are often interested in exploratory data analysis and visualization of geological data in two or three dimensions in order to detect quality issues or to generate new hypotheses. We compare PCA and some other linear and non-linear methods with a newer method, t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of large geochemical assay datasets. The t-SNE based reduced dimensions can then be used with clustering algorithm to extract well clustered geological regions using exploration and production datasets. Significant differences between the nonlinear method t-SNE and the state of the art methods were observed in two dimensional target spaces.