Information Retrieval Perspective to Interactive Data Visualization

Dimensionality reduction for data visualization has recently been formulated as an information retrieval task with a well-defined objective function. The formulation was based on preserving similarity relationships defined by a metric in the input space, and explicitly revealed the need for a tradeoff between avoiding false neighbors and missing neighbors on the low-dimensional display. In the harder case when the metric is not known, the similarity relationships need to come from the user. We formulate interactive visualization as information retrieval under uncertainty about the true similarities, which depend on the user’s tacit knowledge and interests in the data. During the interaction the user points out misses and false positives on the display; based on the feedback the metric is gradually learned and the display converges to visualizing similarity relationships that correspond to the tacit knowledge of the user.

[1]  Samuel Kaski,et al.  Generative Modeling for Maximizing Precision and Recall in Information Visualization , 2011, AISTATS.

[2]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[3]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[4]  Rong Jin,et al.  Bayesian Active Distance Metric Learning , 2007, UAI.

[5]  Jarkko Venna,et al.  Information Retrieval Perspective to Nonlinear Dimensionality Reduction for Data Visualization , 2010, J. Mach. Learn. Res..

[6]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[7]  Chris North,et al.  Semantic interaction for visual text analytics , 2012, CHI.

[8]  Jarkko Venna,et al.  Comparison of Visualization Methods for an Atlas of Gene Expression Data Sets , 2007, Inf. Vis..

[9]  Daniel Asimov,et al.  The grand tour: a tool for viewing multidimensional data , 1985 .

[10]  Carla E. Brodley,et al.  Dis-function: Learning distance functions interactively , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[11]  Jarkko Venna,et al.  Nonlinear Dimensionality Reduction as Information Retrieval , 2007, AISTATS.

[12]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[13]  Chris North,et al.  Observation-level interaction with statistical models for visual analytics , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).