A Visual Analytics Approach for Correlation, Classification, and Regression Analysis

New approaches that combine the strengths of humans and machines are necessary to equip analysts with the proper tools for exploring today's increasing complex, multivariate data sets. In this paper, a novel visual data mining framework, called the Multidimensional Data eXplorer (MDX), is described that addresses the challenges of today's data by combining automated statistical analytics with a highly interactive parallel coordinates based canvas. In addition to several intuitive interaction capabilities, this framework offers a rich set of graphical statistical indicators, interactive regression analysis, visual correlation mining, automated axis arrangements and filtering, and data classification techniques. The current work provides a detailed description of the system as well as a discussion of key design aspects and critical feedback from domain experts.

[1]  Peter Pirolli,et al.  Information Foraging , 2009, Encyclopedia of Database Systems.

[2]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[3]  E. Wegman Hyperdimensional Data Analysis Using Parallel Coordinates , 1990 .

[4]  Helwig Hauser,et al.  Angular brushing of extended parallel coordinates , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[5]  Ping Guo,et al.  Visual Analysis of the Air Pollution Problem in Hong Kong , 2007, IEEE Transactions on Visualization and Computer Graphics.

[6]  James T. Enns,et al.  Perceptually based brush strokes for nonphotorealistic visualization , 2004, TOGS.

[7]  Harri Siirtola Direct manipulation of parallel coordinates , 2000, CHI Extended Abstracts.

[8]  Joseph L. Mundy,et al.  Change Detection , 2014, Computer Vision, A Reference Guide.

[9]  Robert L. Grossman,et al.  High-Dimensional Visual Analytics: Interactive Exploration Guided by Pairwise Views of Point Distributions , 2006, IEEE Transactions on Visualization and Computer Graphics.

[10]  R. H. Myers,et al.  Probability and Statistics for Engineers and Scientists , 1978 .

[11]  Patrick J. Fitzpatrick,et al.  Understanding and Forecasting Tropical Cyclone Intensity Change with the Typhoon Intensity Prediction Scheme (TIPS) , 1997 .

[12]  T. J. Jankun-Kelly,et al.  Guided analysis of hurricane trends using statistical processes integrated with interactive parallel coordinates , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[13]  Matthew O. Ward,et al.  Hierarchical parallel coordinates for exploration of large datasets , 1999, Proceedings Visualization '99 (Cat. No.99CB37067).

[14]  Ben Shneiderman,et al.  A Rank-by-Feature Framework for Interactive Exploration of Multidimensional Data , 2005, Inf. Vis..

[15]  Ben Shneiderman,et al.  Interface and data architecture for query preview in networked information systems , 1999, TOIS.

[16]  Arthur Karp,et al.  The Elements of Color , 1970 .

[17]  M. Cooper,et al.  Revealing structure within clustered parallel coordinates displays , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[18]  Kristin A. Cook,et al.  Illuminating the Path: The Research and Development Agenda for Visual Analytics , 2005 .

[19]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[20]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[21]  Almir Olivette Artero,et al.  Uncovering Clusters in Crowded Parallel Coordinates Visualizations , 2004 .

[22]  Alfred Inselberg,et al.  Parallel Coordinates: Interactive Visualisation for High Dimensions , 2009 .

[23]  T. J. Jankun-Kelly,et al.  An interactive parallel coordinates technique applied to a tropical cyclone climate analysis , 2009, Comput. Geosci..

[24]  Wolfgang Berger,et al.  Quantifying and Comparing Features in High-Dimensional Datasets , 2008, 2008 12th International Conference Information Visualisation.

[25]  T. J. Jankun-Kelly,et al.  Tropical Cyclone Trend Analysis Using Enhanced Parallel Coordinates and Statistical Analytics , 2009 .

[26]  Margo McCall,et al.  IEEE Computer Society , 2019, Encyclopedia of Software Engineering.

[27]  Alfred Inselberg,et al.  The plane with parallel coordinates , 1985, The Visual Computer.

[28]  Jeffrey Heer,et al.  Scented Widgets: Improving Navigation Cues with Embedded Visualizations , 2007, IEEE Transactions on Visualization and Computer Graphics.

[29]  Terry A. Slocum Thematic Cartography and Visualization , 1998 .

[30]  Pak Chung Wong,et al.  30 Years of Multidimensional Multivariate Visualization , 1994, Scientific Visualization.

[31]  Helwig Hauser,et al.  Outlier-Preserving Focus+Context Visualization in Parallel Coordinates , 2006, IEEE Transactions on Visualization and Computer Graphics.