Visual Analytics Towards Big Data

Visual analytics is an important method used in big data analysis. The aim of big data visual analytics is to take advantage of human's cognitive abilities in visualizing information while utilizing computer's capability in automatic analysis. By combining the advantages of both human and computers, along with interactive analysis methods and interaction techniques, big data visual analytics canhelp people to understand the information, knowledge and wisdom behind big data directly and effectively. This article emphasizes on the cognition, visualization and human computer interaction. It first analyzes the basic theories, including cognition theory, information theory, interaction theory and user interface theory. Based on the analysis, the paper discusses the information visualization techniques used in mainstream applications of big data, such as text visualization techniques, network visualization techniques, spatio-temporal visualization techniques and multi-dimensional visualization techniques. In addition, it reviews the interaction techniques supporting visual analytics, including interface metaphors and interaction components, multi-scale/multi-focus/multi-facet interaction techniques, and natural interaction techniques faced on Post-WIMP. Finally, it discusses the bottleneck problems and technical challenges of big data visual analytics.

[1]  Jean-Daniel Fekete,et al.  Task taxonomy for graph visualization , 2006, BELIV '06.

[2]  Waldo R. Tobler,et al.  Experiments In Migration Mapping By Computer , 1987 .

[3]  Jason Dykes,et al.  Exploring Uncertainty in Geodemographics with Interactive Graphics , 2011, IEEE Transactions on Visualization and Computer Graphics.

[4]  Peter Eades,et al.  Journal of Graph Algorithms and Applications Navigating Clustered Graphs Using Force-directed Methods , 2022 .

[5]  Lei Ren,et al.  A methodology towards virtualisation-based high performance simulation platform supporting multidisciplinary design of complex products , 2012, Enterp. Inf. Syst..

[6]  Kwan-Liu Ma,et al.  Semantic‐Preserving Word Clouds by Seam Carving , 2011, Comput. Graph. Forum.

[7]  James Abello,et al.  ASK-GraphView: A Large Scale Graph Visualization System , 2006, IEEE Transactions on Visualization and Computer Graphics.

[8]  John T. Stasko,et al.  Mental Models, Visual Reasoning and Interaction in Information Visualization: A Top-down Perspective , 2010, IEEE Transactions on Visualization and Computer Graphics.

[9]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[10]  Herbert A. Simon,et al.  Why a Diagram is (Sometimes) Worth Ten Thousand Words , 1987, Cogn. Sci..

[11]  Ivan Herman,et al.  Graph Visualization and Navigation in Information Visualization: A Survey , 2000, IEEE Trans. Vis. Comput. Graph..

[12]  Susan T. Dumais,et al.  PivotPaths: Strolling through Faceted Information Spaces , 2012, IEEE Transactions on Visualization and Computer Graphics.

[13]  Dai Guo-zhong Ubiquitous human-computer interaction in cloud manufacturing , 2011 .

[14]  Pamela Effrein Sandstrom,et al.  Information Foraging Theory: Adaptive Interaction with Information , 2010, J. Assoc. Inf. Sci. Technol..

[15]  Mary Czerwinski,et al.  Co-Located Collaborative Visual Analytics around a Tabletop Display , 2012, IEEE Transactions on Visualization and Computer Graphics.

[16]  Jarke J. van Wijk,et al.  Flexible Linked Axes for Multivariate Data Visualization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[17]  George W. Furnas,et al.  A fisheye follow-up: further reflections on focus + context , 2006, CHI.

[18]  Stuart K. Card,et al.  Information foraging in information access environments , 1995, CHI '95.

[19]  Daniel A. Keim,et al.  Mastering the Information Age - Solving Problems with Visual Analytics , 2010 .

[20]  Gennady L. Andrienko,et al.  Composite Density Maps for Multivariate Trajectories , 2011, IEEE Transactions on Visualization and Computer Graphics.

[21]  Mary Czerwinski,et al.  DateLens: A fisheye calendar interface for PDAs , 2004, TCHI.

[22]  Matthew O. Ward,et al.  Value and Relation Display: Interactive Visual Exploration of Large Data Sets with Hundreds of Dimensions , 2007, IEEE Trans. Vis. Comput. Graph..

[23]  Jian Zhao,et al.  Facilitating Discourse Analysis with Interactive Visualization , 2012, IEEE Transactions on Visualization and Computer Graphics.

[24]  Min Chen,et al.  An Information-theoretic Framework for Visualization , 2010, IEEE Transactions on Visualization and Computer Graphics.

[25]  Emmanuel Pietriga,et al.  Sigma lenses: focus-context transitions combining space, time and translucence , 2008, CHI.

[26]  Ben Shneiderman,et al.  Extreme visualization: squeezing a billion records into a million pixels , 2008, SIGMOD Conference.

[27]  Heidrun Schumann,et al.  A Design Space of Visualization Tasks , 2013, IEEE Transactions on Visualization and Computer Graphics.

[28]  Jacob Eisenstein,et al.  Towards a general computational framework for model-based interface development systems , 1998, IUI '99.

[29]  Heidrun Schumann,et al.  Stacking-Based Visualization of Trajectory Attribute Data , 2012, IEEE Transactions on Visualization and Computer Graphics.

[30]  Cory Doctorow,et al.  Big data: Welcome to the petacentre , 2008, Nature.

[31]  Lei Ren,et al.  DaisyViz: A model-based user interface toolkit for interactive information visualization systems , 2010, J. Vis. Lang. Comput..

[32]  Jean-Daniel Fekete,et al.  ZAME: Interactive Large-Scale Graph Visualization , 2008, 2008 IEEE Pacific Visualization Symposium.

[33]  Xin Tong,et al.  TextFlow: Towards Better Understanding of Evolving Topics in Text , 2011, IEEE Transactions on Visualization and Computer Graphics.

[34]  M. Sheelagh T. Carpendale,et al.  Achieving higher magnification in context , 2004, UIST '04.

[35]  John T. Stasko,et al.  The Science of Interaction , 2009, Inf. Vis..

[36]  Furu Wei,et al.  Context preserving dynamic word cloud visualization , 2010, 2010 IEEE Pacific Visualization Symposium (PacificVis).

[37]  Michael S. Horn,et al.  The DeepTree Exhibit: Visualizing the Tree of Life to Facilitate Informal Learning , 2012, IEEE Transactions on Visualization and Computer Graphics.

[38]  Hans-Peter Kriegel,et al.  Visualization Techniques for Mining Large Databases: A Comparison , 1996, IEEE Trans. Knowl. Data Eng..

[39]  Donna Peuquet,et al.  Geobrowsing: Creative Thinking and Knowledge Discovery Using Geographic Visualization , 2002, Inf. Vis..

[40]  Martin Wattenberg,et al.  Parallel Tag Clouds to explore and analyze faceted text corpora , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[41]  Dai Guo-zhong,et al.  A Model Driven Development Method for Interactive Information Visualization , 2008 .

[42]  Jason Dykes,et al.  An Extensible Framework for Provenance in Human Terrain Visual Analytics , 2013, IEEE Transactions on Visualization and Computer Graphics.

[43]  Robert B. Ross,et al.  The Top 10 Challenges in Extreme-Scale Visual Analytics , 2012, IEEE Computer Graphics and Applications.

[44]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.

[45]  Susan T. Dumais,et al.  WaveLens: a new view onto Internet search results , 2004, CHI.

[46]  Jin Hyung Kim,et al.  Pen-Based User Interface , 1991, Workshop on Vision and Language.

[47]  P. Pirolli,et al.  The Sensemaking Process and Leverage Points for Analyst Technology as Identified Through Cognitive Task Analysis , 2007 .

[48]  Liang Gou,et al.  TreeNetViz: Revealing Patterns of Networks over Tree Structures , 2011, IEEE Transactions on Visualization and Computer Graphics.

[49]  Jacques Bertin,et al.  Graphics and graphic information-processing , 1981 .

[50]  Lei Ren,et al.  Multilevel interaction model for hierarchical tasks in information visualization , 2013, VINCI '13.

[51]  Mengchen Liu,et al.  StoryFlow: Tracking the Evolution of Stories , 2013, IEEE Transactions on Visualization and Computer Graphics.

[52]  Dai Guo-zhong,et al.  Focus+Context Technique for Interactive Visualization of Large Hierarchies , 2008 .

[53]  Jeffrey Heer,et al.  DOITrees revisited: scalable, space-constrained visualization of hierarchical data , 2004, AVI.

[54]  Jeffrey Heer,et al.  Design Considerations for Collaborative Visual Analytics , 2008, Inf. Vis..

[55]  Camilla Forsell,et al.  Interaction Support for Visual Comparison Inspired by Natural Behavior , 2012, IEEE Transactions on Visualization and Computer Graphics.

[56]  Burkhard Wünsche A Survey, Classification and Analysis of Perceptual Concepts and their Application for the Effective Visualisation of Complex Information , 2004, InVis.au.

[57]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[58]  Zhang Lin,et al.  Further discussion on cloud manufacturing , 2011 .

[59]  Marti A. Hearst,et al.  Animated exploration of dynamic graphs with radial layout , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[60]  Ed H. Chi,et al.  Using information scent to model user information needs and actions and the Web , 2001, CHI.

[61]  Christophe Hurter,et al.  Skeleton-Based Edge Bundling for Graph Visualization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[62]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[63]  Daniel A. Keim,et al.  Visual Analytics: Definition, Process, and Challenges , 2008, Information Visualization.

[64]  Jean-Daniel Fekete,et al.  Hierarchical Aggregation for Information Visualization: Overview, Techniques, and Design Guidelines , 2010, IEEE Transactions on Visualization and Computer Graphics.

[65]  Matthew B. Jones,et al.  Challenges and Opportunities of Open Data in Ecology , 2011, Science.

[66]  Zhou Zude,et al.  Typical characteristics,technologies and applications of cloud manufacturing , 2012 .

[67]  Paul P. Maglio,et al.  On Distinguishing Epistemic from Pragmatic Action , 1994, Cogn. Sci..

[68]  Kwan-Liu Ma,et al.  Big-Data Visualization , 2013, IEEE Computer Graphics and Applications.

[69]  Paul Johns,et al.  Understanding Pen and Touch Interaction for Data Exploration on Interactive Whiteboards , 2012, IEEE Transactions on Visualization and Computer Graphics.

[70]  Chris North,et al.  Analytic provenance: process+interaction+insight , 2011, CHI Extended Abstracts.

[71]  Robert Spence,et al.  Information Visualization: Design for Interaction (2nd Edition) , 2006 .

[72]  Alexandru Telea,et al.  Eurographics/ Ieee-vgtc Symposium on Visualization 2010 Image-based Edge Bundles: Simplified Visualization of Large Graphs , 2022 .

[73]  Jeffrey Heer,et al.  imMens: Real‐time Visual Querying of Big Data , 2013, Comput. Graph. Forum.

[74]  Jonathan C. Roberts,et al.  Angular Histograms: Frequency-Based Visualizations for Large, High Dimensional Data , 2011, IEEE Transactions on Visualization and Computer Graphics.

[75]  William Ribarsky,et al.  Visual analytics for complex concepts using a human cognition model , 2008, 2008 IEEE Symposium on Visual Analytics Science and Technology.

[76]  Pierre Dragicevic,et al.  Rolling the Dice: Multidimensional Visual Exploration using Scatterplot Matrix Navigation , 2008, IEEE Transactions on Visualization and Computer Graphics.

[77]  S. Shyam Sundar,et al.  News cues: Information scent and cognitive heuristics , 2007, J. Assoc. Inf. Sci. Technol..

[78]  Guozhong Dai,et al.  Information visualization and visual analytics: challenges and opportunities , 2013 .

[79]  Tamara Munzner,et al.  A Multi-Level Typology of Abstract Visualization Tasks , 2013, IEEE Transactions on Visualization and Computer Graphics.

[80]  Ben Shneiderman,et al.  Visual information seeking: tight coupling of dynamic query filters with starfield displays , 1994, CHI '94.

[81]  W. Buxton Human-Computer Interaction , 1988, Springer Berlin Heidelberg.

[82]  Jeffrey Heer,et al.  Divided Edge Bundling for Directional Network Data , 2011, IEEE Transactions on Visualization and Computer Graphics.

[83]  Chaomei Chen,et al.  An Information-Theoretic View of Visual Analytics , 2008, IEEE Computer Graphics and Applications.

[84]  Kristin A. Cook,et al.  Illuminating the Path: The Research and Development Agenda for Visual Analytics , 2005 .

[85]  Han-Wei Shen,et al.  Balloon Focus: a Seamless Multi-Focus+Context Method for Treemaps , 2008, IEEE Transactions on Visualization and Computer Graphics.

[86]  Christophe Hurter,et al.  Graph Bundling by Kernel Density Estimation , 2012, Comput. Graph. Forum.

[87]  Kirsi Virrantaus,et al.  Space–time density of trajectories: exploring spatio-temporal patterns in movement data , 2010, Int. J. Geogr. Inf. Sci..

[88]  Bettina Speckmann,et al.  Flow Map Layout via Spiral Trees , 2011, IEEE Transactions on Visualization and Computer Graphics.

[89]  Benjamin B. Bederson,et al.  Fisheye menus , 2000, UIST '00.

[90]  Jarke J. van Wijk,et al.  Supporting the analytical reasoning process in information visualization , 2008, CHI.

[91]  Arvid Lundervold,et al.  Representative Factor Generation for the Interactive Visual Analysis of High-Dimensional Data , 2012, IEEE Transactions on Visualization and Computer Graphics.

[92]  J. Piaget Intellectual Evolution from Adolescence to Adulthood , 1972 .

[93]  Matthew O. Ward,et al.  Interaction spaces in data and information visualization , 2004, VISSYM'04.

[94]  P. Hanrahan,et al.  Flow map layout , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[95]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[96]  Martin Wattenberg,et al.  TIMELINESTag clouds and the case for vernacular visualization , 2008, INTR.

[97]  Brenda Dervin,et al.  Sense-making theory and practice: an overview of user interests in knowledge seeking and use , 1998, J. Knowl. Manag..

[98]  Anthony J. G. Hey,et al.  The Future of Data-Intensive Science , 2012, Computer.

[99]  Carl Gutwin,et al.  Improving revisitation in fisheye views with visit wear , 2005, CHI.

[100]  Ben Shneiderman,et al.  Interactive Dynamics for Visual Analysis , 2012 .

[101]  Steven F. Roth,et al.  On the semantics of interactive visualizations , 1996, Proceedings IEEE Symposium on Information Visualization '96.

[102]  John T. Stasko,et al.  Toward a Deeper Understanding of the Role of Interaction in Information Visualization , 2007, IEEE Transactions on Visualization and Computer Graphics.

[103]  Catherine Plaisant,et al.  Navigation patterns and usability of zoomable user interfaces with and without an overview , 2002, TCHI.

[104]  Hong Zhou,et al.  Visual Clustering in Parallel Coordinates , 2008, Comput. Graph. Forum.

[105]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[106]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[107]  Alfred Inselberg,et al.  Parallel coordinates: a tool for visualizing multi-dimensional geometry , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[108]  John T. Stasko,et al.  Distributed Cognition as a Theoretical Framework for Information Visualization , 2008, IEEE Transactions on Visualization and Computer Graphics.

[109]  Lei Ren,et al.  Cloud manufacturing: from concept to practice , 2015, Enterp. Inf. Syst..

[110]  Brenda Dervin,et al.  On studying information seeking methodologically: the implications of connecting metatheory to method , 1999, Inf. Process. Manag..

[111]  Shimei Pan,et al.  TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis , 2012, TIST.

[112]  Daniel A. Keim,et al.  Visual Analysis of Social Media Data , 2013, Computer.

[113]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[114]  Bongshin Lee,et al.  ManiWordle: Providing Flexible Control over Wordle , 2010, IEEE Transactions on Visualization and Computer Graphics.

[115]  Kwan-Liu Ma,et al.  Visualizing Flow of Uncertainty through Analytical Processes , 2012, IEEE Transactions on Visualization and Computer Graphics.

[116]  James D. Hollan,et al.  Distributed cognition: toward a new foundation for human-computer interaction research , 2000, TCHI.

[117]  Luis Gustavo Nonato,et al.  Local Affine Multidimensional Projection , 2011, IEEE Transactions on Visualization and Computer Graphics.

[118]  Lei Ren,et al.  DOI-Wave: A Focus+Context Interaction Technique for Networks Based on Attention-Reactive Interface , 2009, VINCI.

[119]  Jeffrey Heer,et al.  Software Design Patterns for Information Visualization , 2006, IEEE Transactions on Visualization and Computer Graphics.

[120]  Robert W. Reeder,et al.  Information scent as a driver of Web behavior graphs: results of a protocol analysis method for Web usability , 2001, CHI.

[121]  Bongshin Lee,et al.  SketchStory: Telling More Engaging Stories with Data through Freeform Sketching , 2013, IEEE Transactions on Visualization and Computer Graphics.

[122]  Mark Bailey,et al.  The Grammar of Graphics , 2007, Technometrics.

[123]  Jian Zhao,et al.  Interactive Exploration of Implicit and Explicit Relations in Faceted Datasets , 2013, IEEE Transactions on Visualization and Computer Graphics.

[124]  Ramana Rao,et al.  A focus+context technique based on hyperbolic geometry for visualizing large hierarchies , 1995, CHI '95.

[125]  Daniel A. Keim,et al.  EventRiver: Visually Exploring Text Collections with Temporal References , 2012, IEEE Transactions on Visualization and Computer Graphics.

[126]  James R. Eagan,et al.  Low-level components of analytic activity in information visualization , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[127]  Hong Zhou,et al.  Geometry-Based Edge Clustering for Graph Visualization , 2008, IEEE Transactions on Visualization and Computer Graphics.

[128]  Joseph M. Hellerstein,et al.  MAD Skills: New Analysis Practices for Big Data , 2009, Proc. VLDB Endow..

[129]  M. Sheelagh T. Carpendale,et al.  DocuBurst: Visualizing Document Content using Language Structure , 2009, Comput. Graph. Forum.

[130]  Rosane Minghim,et al.  HiPP: A Novel Hierarchical Point Placement Strategy and its Application to the Exploration of Document Collections , 2008, IEEE Transactions on Visualization and Computer Graphics.