A Behavioral Investigation of Dimensionality Reduction

A Behavioral Investigation of Dimensionality Reduction Joshua M. Lewis Laurens van der Maaten Virginia R. de Sa josh@cogsci.ucsd.edu lvdmaaten@gmail.com desa@cogsci.ucsd.edu Department of Cognitive Science Pattern Recognition & Bio-informatics Lab Department of Cognitive Science University of California, San Diego Delft University of Technology University of California, San Diego wide range of techniques? 1 To answer this question, we need to evaluate whether humans are good at evaluating embed- dings. As there is no external authority we can appeal to, this is a daunting task. However, it is relatively easy to find out whether human data analysts are at least consistent in their evaluations, which is the first aim of this study. Consistency, across individuals and across a wide range of inputs, is a rea- sonable prerequisite for evaluation. Beyond investigating whether human data analysts are con- sistent when they evaluate embeddings, the second aim of this study is to investigate what humans are doing when they eval- uate embeddings. Such information could be useful for deter- mining whether humans are appropriate for an evaluation task with a known structure (e.g. if they naturally prefer embed- ding characteristics appropriate to the structure), or for devel- oping techniques that are tailored towards producing results that humans will find helpful (e.g. algorithms that selectively emphasize informative data structure). We can to some extent infer human strategies from the algorithms humans prefer, but we can also investigate those strategies by correlating embed- ding characteristics with human evaluations. Motivated by the two aims described above, we solicit em- bedding quality judgments from both novice and expert sub- jects in an effort to determine whether they are consistent in their ratings, and which embedding characteristics they find appealing. For the novice subjects, we manipulate dataset knowledge—half read a description and see samples from each dataset, and half do not. We hypothesize that provid- ing dataset information will increase consistency, as it should if the evaluative process is principled. The study consists of two experiments. The first presents subjects with a selection of embeddings derived from nine distinct dimensionality re- duction algorithms; the second uses embeddings from a sin- gle algorithm with several different parameter settings for a more controlled comparison between “clustered” and “grad- ual” embeddings. Abstract A cornucopia of dimensionality reduction techniques have emerged over the past decade, leaving data analysts with a wide variety of choices for reducing their data. Means of eval- uating and comparing low-dimensional embeddings useful for visualization, however, are very limited. When proposing a new technique it is common to simply show rival embeddings side-by-side and let human judgment determine which embed- ding is superior. This study investigates whether such human embedding evaluations are reliable, i.e., whether humans tend to agree on the quality of an embedding. We also investigate what types of embedding structures humans appreciate a pri- ori. Our results reveal that, although experts are reasonably consistent in their evaluation of embeddings, novices gener- ally disagree on the quality of an embedding. We discuss the impact of this result on the way dimensionality reduction re- searchers should present their results, and on applicability of dimensionality reduction outside of machine learning. Keywords: dimensionality reduction; unsupervised machine learning; psychophysics Introduction There is an evaluative vacuum in the dimensionality reduc- tion literature. In many other unsupervised machine learn- ing fields, such as density modeling, evaluation may be per- formed by measuring likelihoods of held-out test data. Al- ternatively, in domains such as topic modeling, human com- putation (Ahn, Maurer, McMillen, Abraham, & Blum, 2008) resources such as Amazon’s Mechanical Turk may be em- ployed to exploit the fact that humans are phenoms in evaluat- ing semantic structure (Chang, Boyd-Graber, Gerrish, Wang, & Blei, 2009). Human evaluations have also been used to assess image segmentation techniques (Martin, Fowlkes, Tal, & Malik, 2001). The field of dimensionality reduction, how- ever, lacks a standard evaluation measure (Venna, Peltonen, Nybo, Aidos, & Kaski, 2010), and is not as obvious a target for human intuition. Two or three dimensional embeddings can be visualized as scatter plots, but on what intuitive basis can we judge a 200 to 2-dimensional reduction to be good? In addition, Gestalt effects or simple rotations may bias human evaluations of scatter plots. Nevertheless, with no broadly agreed upon embedding quality measure (though a few have been proposed, see below), human judgment is often explic- itly and implicitly solicited in the literature. The most com- mon form of this solicitation consists of placing a scatter plot of the preferred embedding next to those of rival embeddings and inviting the reader to conclude that the preferred embed- ding is superior (e.g., (Maaten & Hinton, 2008)). If one is interested in applying a dimensionality reduction algorithm to visualize a dataset, is this a valid way to select from the Dimensionality reduction techniques Dimensionality reduction techniques can be subdivided into several categories: linear or non-linear, convex or non- convex, parametric or non-parametric, etc. (Lee & Verley- sen, 2007). Whilst many new techniques have been proposed over the last decade, data analysts still often resort to linear, convex, parametric techniques such as PCA to visualize their 1 Moreover, one should note that dimensionality reduction com- prises only a small part of the “visualization zoo” (Heer, Bostock, & Ogievetsky, 2010).

[1]  Peng Zhang,et al.  Nonlinear Dimensionality Reduction by Locally Linear Inlaying , 2009, IEEE Transactions on Neural Networks.

[2]  Lawrence K. Saul,et al.  Exploratory analysis and visualization of speech and music by locally linear embedding , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  E. Rosch Cognitive reference points , 1975, Cognitive Psychology.

[4]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[5]  Neil D. Lawrence,et al.  Probabilistic Non-linear Principal Component Analysis with Gaussian Process Latent Variable Models , 2005, J. Mach. Learn. Res..

[6]  Jarkko Venna,et al.  Information Retrieval Perspective to Nonlinear Dimensionality Reduction for Data Visualization , 2010, J. Mach. Learn. Res..

[7]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[8]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[9]  Kilian Q. Weinberger,et al.  Mapping Uncharted Waters: Exploratory Analysis, Visualization, and Clustering of Oceanographic Data , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[10]  Jeffrey Heer,et al.  A Tour through the Visualization Zoo , 2010 .

[11]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[12]  Kilian Q. Weinberger,et al.  Graph Laplacian Regularization for Large-Scale Semidefinite Programming , 2006, NIPS.

[13]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[14]  Laurens van der Maaten,et al.  Learning a Parametric Embedding by Preserving Local Structure , 2009, AISTATS.

[15]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[16]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[17]  H. Lilliefors On the Kolmogorov-Smirnov Test for Normality with Mean and Variance Unknown , 1967 .

[18]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[19]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[20]  Jarkko Venna,et al.  Visualizing gene interaction graphs with local multidimensional scaling , 2006, ESANN.

[21]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[22]  Michel Verleysen,et al.  Nonlinear Dimensionality Reduction , 2021, Computer Vision.

[23]  Manuel Blum,et al.  reCAPTCHA: Human-Based Character Recognition via Web Security Measures , 2008, Science.

[24]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[25]  Erwin Beck,et al.  Nonlinear dimensionality reduction: Alternative ordination approaches for extracting and visualizing biodiversity patterns in tropical montane forest vegetation data , 2007, Ecol. Informatics.

[26]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[27]  David H. Wolpert,et al.  The Lack of A Priori Distinctions Between Learning Algorithms , 1996, Neural Computation.