Observation-level interaction with statistical models for visual analytics

In visual analytics, sensemaking is facilitated through interactive visual exploration of data. Throughout this dynamic process, users combine their domain knowledge with the dataset to create insight. Therefore, visual analytic tools exist that aid sensemaking by providing various interaction techniques that focus on allowing users to change the visual representation through adjusting parameters of the underlying statistical model. However, we postulate that the process of sensemaking is not focused on a series of parameter adjustments, but instead, a series of perceived connections and patterns within the data. Thus, how can models for visual analytic tools be designed, so that users can express their reasoning on observations (the data), instead of directly on the model or tunable parameters? Observation level (and thus “observation”) in this paper refers to the data points within a visualization. In this paper, we explore two possible observation-level interactions, namely exploratory and expressive, within the context of three statistical methods, Probabilistic Principal Component Analysis (PPCA), Multidimensional Scaling (MDS), and Generative Topographic Mapping (GTM). We discuss the importance of these two types of observation level interactions, in terms of how they occur within the sensemaking process. Further, we present use cases for GTM, MDS, and PPCA, illustrating how observation level interaction can be incorporated into visual analytic tools.

[1]  Alfredo Vellido,et al.  Kernel generative topographic mapping , 2010, ESANN.

[2]  Michael A. West,et al.  Bayesian Forecasting and Dynamic Models (Springer Series in Statistics) , 2013 .

[3]  Alex Endert,et al.  A Bi-directional Visualization Pipeline that Enables Visual to Parametric Interation ( V 2 PI ) , 2010 .

[4]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[5]  Cristina Conde,et al.  PCA vs low resolution images in face verification , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[6]  Qian Du,et al.  Low-Complexity Principal Component Analysis for Hyperspectral Image Compression , 2008, Int. J. High Perform. Comput. Appl..

[7]  Alfredo Vellido Alcacena,et al.  Unfolding the Manifold in Generative Topographic Mapping , 2008 .

[8]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[10]  Forrest W. Young,et al.  Introduction to Multidimensional Scaling: Theory, Methods, and Applications , 1981 .

[11]  Ata Kabán A scalable generative topographic mapping for sparse data sequences , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[12]  Ingo Hotz,et al.  iPCA : An Interactive System for PCA-based Visual Analytics , 2008 .

[13]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[14]  Joost Broekens,et al.  Chapter 3 Object-Centered Interactive Multi-Dimensional Scaling : Ask the Expert , 2006 .

[15]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[16]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[17]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[18]  David J. Spiegelhalter,et al.  Sequential updating of conditional probabilities on directed graphical structures , 1990, Networks.

[19]  Ioannis G. Tollis,et al.  Algorithms for Drawing Graphs: an Annotated Bibliography , 1988, Comput. Geom..

[20]  John T. Stasko,et al.  Dust & Magnet: Multivariate Information Visualization Using a Magnet Metaphor , 2005, Inf. Vis..

[21]  Vijayan K. Asari,et al.  An improved face recognition technique based on modular PCA approach , 2004, Pattern Recognit. Lett..

[22]  Christopher Andrews,et al.  Space to think: large high-resolution displays for sensemaking , 2010, CHI.

[23]  Anatoli Torokhti,et al.  Towards theory of generic Principal Component Analysis , 2009, J. Multivar. Anal..

[24]  Ron Kimmel,et al.  Texture Mapping Using Surface Flattening via Multidimensional Scaling , 2002, IEEE Trans. Vis. Comput. Graph..

[25]  A. Buja,et al.  Local Multidimensional Scaling for Nonlinear Dimension Reduction, Graph Drawing, and Proximity Analysis , 2009 .

[26]  Deborah Lynn Guber,et al.  Getting What You Pay For The Debate Over Equity in Public School Expenditures , 2011 .

[27]  I.S. Bajwa,et al.  PCA based image classification of single-layered cloud types , 2005, Proceedings of the IEEE Symposium on Emerging Technologies, 2005..

[28]  Deborah F. Swayne,et al.  Data Visualization With Multidimensional Scaling , 2008 .

[29]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[30]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[31]  Deborah F. Swayne,et al.  Interactive Data Visualization with Multidimensional Scaling , 2022 .