Toward Perception-Based Evaluation of Clustering Techniques for Visual Analytics

Automatic clustering techniques play a central role in Visual Analytics by helping analysts to discover interesting patterns in high-dimensional data. Evaluating these clustering techniques, however, is difficult due to the lack of universal ground truth. Instead, clustering approaches are usually evaluated based on a subjective visual judgment of low-dimensional scatterplots of different datasets. As clustering is an inherent human-in-the-loop task, we propose a more systematic way of evaluating clustering algorithms based on quantification of human perception of clusters in 2D scatterplots. The core question we are asking is in how far existing clustering techniques align with clusters perceived by humans. To do so, we build on a dataset from a previous study [1], in which 34 human subjects la-beled 1000 synthetic scatterplots in terms of whether they could see one or more than one cluster. Here, we use this dataset to benchmark state-of-the-art clustering techniques in terms of how far they agree with these human judgments. More specifically, we assess 1437 variants of K-means, Gaussian Mixture Models, CLIQUE, DBSCAN, and Agglomerative Clustering techniques on these benchmarks data. We get unexpected results. For instance, CLIQUE and DBSCAN are at best in slight agreement on this basic cluster counting task, while model-agnostic Agglomerative clustering can be up to a substantial agreement with human subjects depending on the variants. We discuss how to extend this perception-based clustering benchmark approach, and how it could lead to the design of perception-based clustering techniques that would better support more trustworthy and explainable models of cluster patterns.

[1]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[2]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Enrico Bertini,et al.  Quality Metrics in High-Dimensional Data Visualization: An Overview and Systematization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[4]  Greg Hamerly,et al.  Learning the k in k-means , 2003, NIPS.

[5]  Mostafa M. Abbas,et al.  ClustMe: A Visual Quality Measure for Ranking Monochrome Scatterplots based on Cluster Patterns , 2019, Comput. Graph. Forum.

[6]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[7]  Margarida G. M. S. Cardoso,et al.  Clustering stability and ground truth: Numerical experiments , 2015, 2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K).

[8]  Marcus A. Magnor,et al.  Perception-based visual quality measures , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[9]  Tamara Munzner,et al.  A Taxonomy of Visual Cluster Separation Factors , 2012, Comput. Graph. Forum.

[10]  Michaël Aupetit,et al.  Sanity check for class-coloring-based evaluation of dimension reduction techniques , 2014, BELIV.

[11]  Daniel A. Keim,et al.  The Role of Uncertainty, Awareness, and Trust in Visual Analytics , 2016, IEEE Transactions on Visualization and Computer Graphics.

[12]  Jean-Daniel Fekete,et al.  A multidimensional brush for scatterplot data analytics , 2014, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).

[13]  Jean-Daniel Fekete,et al.  A Principled Way of Assessing Visualization Literacy , 2014, IEEE Transactions on Visualization and Computer Graphics.

[14]  Adelin Albert,et al.  Agreement between an isolated rater and a group of raters , 2009 .

[15]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[16]  Tamara Munzner,et al.  Visualizing dimensionally-reduced data: interviews with analysts and a characterization of task sequences , 2014, BELIV.

[17]  Jian Peng,et al.  Variational Inference for Crowdsourcing , 2012, NIPS.

[18]  Ira Assent,et al.  Morpheus: interactive exploration of subspace clustering , 2008, KDD.

[19]  Luis Gustavo Nonato,et al.  Multidimensional Projection for Visual Analytics: Linking Techniques with Distortions, Tasks, and Layout Enrichment , 2019, IEEE Transactions on Visualization and Computer Graphics.

[20]  Bernhard Schölkopf,et al.  A Local Learning Approach for Clustering , 2006, NIPS.

[21]  Wei Chen,et al.  ScatterNet: A Deep Subjective Similarity Model for Visual Analysis of Scatterplots , 2020, IEEE Transactions on Visualization and Computer Graphics.

[22]  Fei Wang,et al.  Linear Time Maximum Margin Clustering , 2010, IEEE Transactions on Neural Networks.

[23]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[24]  Ulrike von Luxburg,et al.  Clustering Stability: An Overview , 2010, Found. Trends Mach. Learn..

[25]  Gérard Govaert,et al.  Learning topology of a labeled data set with the supervised generative gaussian graph , 2008, ESANN.

[26]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[27]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[28]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[29]  Michaël Aupetit,et al.  SepMe: 2002 New visual separation measures , 2016, 2016 IEEE Pacific Visualization Symposium (PacificVis).

[30]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Michaël Aupetit,et al.  Data‐driven Evaluation of Visual Quality Measures , 2015, Comput. Graph. Forum.

[32]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[33]  Joachim M. Buhmann,et al.  Clustering with the Connectivity Kernel , 2003, NIPS.

[34]  Osmar R. Zaïane,et al.  A General Clustering Agreement Index: For Comparing Disjoint and Overlapping Clusters , 2017, AAAI.