Envisioning knowledge: tightly coupling knowledge analysis and visualization

Analysis and visualization techniques share a symbiotic relationship when it comes to making sense of datasets. Particularly for large datasets, the coupling of data analysis and data visualization is often beneficial. While a gamut of data analysis and visualization techniques exist, it is often problematic to identify what combination of techniques is good for what task to provide maximum insight into a dataset. This dissertation introduces, exemplifies and validates a Data Analysis and Visualization Taxonomy, called 'DA-Vis taxonomy', that provides guidance to the selection of complementary analysis and visualization techniques. The DA-Vis taxonomy is validated by demonstrating its utility to develop new visualizations for real world applications. Further, the new taxonomy is applied to systematically describe and classify couplings of data analysis and visualization techniques in prior work. A user-study that evaluates the usability of the DA-Vis taxonomy was also conducted and is reported here. The intellectual contributions of this dissertation include a flexible DA-Vis layout schema that can be used to tightly couple complementary data analysis and visualization techniques. This dissertation also shows a visionary computational diagnostic tool developed for data analysis and visualization of clinical data. Techniques used to generate meaningful knowledge management visualizations from a dataset are presented as a part of this dissertation. This dissertation concludes with a discussion of the broader impacts of the DA-Vis taxonomy, the computational diagnostic tool, and knowledge management maps.

[1]  M. Callon,et al.  Mapping the dynamics of science and technology : sociology of science in the real world , 1988 .

[2]  Kristin A. Cook,et al.  Illuminating the Path: The Research and Development Agenda for Visual Analytics , 2005 .

[3]  Ben Shneiderman,et al.  Visual information seeking using the FilmFinder , 1994, CHI Conference Companion.

[4]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[5]  Kurt Mehlhorn,et al.  LEDA: a platform for combinatorial and geometric computing , 1997, CACM.

[6]  A. Baruchel,et al.  Prognostic study of continuous variables (white blood cell count, peripheral blast cell count, haemoglobin level, platelet count and age) in childhood acute lymphoblastic leukaemia. Analysis of a population of 1545 children treated by the French Acute Lymphoblastic Leukaemia Group (FRALLE) , 2000, British Journal of Cancer.

[7]  Emilie M. Roth,et al.  Predicting Vulnerabilities in Computer-Supported Inferential Analysis under Data Overload , 2001, Cognition, Technology & Work.

[8]  B. Marx The Visual Display of Quantitative Information , 1985 .

[9]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Jock D. Mackinlay,et al.  The perspective wall: detail and context smoothly integrated , 1991, CHI.

[11]  Ed H. Chi,et al.  A taxonomy of visualization techniques using the data state reference model , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[12]  Ivan Herman,et al.  Graph Visualization and Navigation in Information Visualization: A Survey , 2000, IEEE Trans. Vis. Comput. Graph..

[13]  Marti A. Hearst,et al.  Animated exploration of dynamic graphs with radial layout , 2001, IEEE Symposium on Information Visualization, 2001. INFOVIS 2001..

[14]  K. Börner,et al.  Mapping topics and topic bursts in PNAS , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Alexander I. Pudovkin,et al.  Algorithmic procedure for finding semantically related journals , 2002, J. Assoc. Inf. Sci. Technol..

[16]  Ulrik Brandes,et al.  Centrality Measures Based on Current Flow , 2005, STACS.

[17]  Mark Apperley,et al.  Data base navigation: an office environment for the professional , 1982 .

[18]  Andreas Buja,et al.  XGobi: Interactive Dynamic Data Visualization in the X Window System , 1998 .

[19]  Pat Hanrahan,et al.  Interactive visualization of large graphs and networks , 2000 .

[20]  Chaomei Chen,et al.  Information Visualization: Beyond the Horizon , 2006 .

[21]  Peter Eades,et al.  A Heuristic for Graph Drawing , 1984 .

[22]  Katy Börner,et al.  Content coverage of animal behavior data , 2004, IS&T/SPIE Electronic Imaging.

[23]  Gary G Yen,et al.  Crossmaps: Visualization of overlapping relationships in collections of journal papers , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  H. Arabnia Reading in information visualization: using vision to Think [Media Review] , 1999, IEEE MultiMedia.

[25]  John Riedl,et al.  An operator interaction framework for visualization systems , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[26]  Katy Börner,et al.  SRS browser: a visual interface to the sequence retrieval system , 2006, Electronic Imaging.

[27]  Albert-László Barabási,et al.  The Architecture of Biological Networks , 2006 .

[28]  Kevin W. Boyack,et al.  Domain visualization using VxInsight® for science and technology management , 2002, J. Assoc. Inf. Sci. Technol..

[29]  Kevin W. Boyack,et al.  Indicator-assisted evaluation and funding of research: Visualizing the influence of grants on the number and citation counts of research papers , 2003, J. Assoc. Inf. Sci. Technol..

[30]  Kevin W. Boyack,et al.  Analysis of Patent Databases Using VxInsight , 2000 .

[31]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Allan R. Wilks,et al.  Visualizing Network Data , 1995, IEEE Trans. Vis. Comput. Graph..

[33]  Robert Joseph Fowler,et al.  A Spring Modeling Algorithm to Position Nodes of an Undirected Graph in Three Dimensions , 1994 .

[34]  A. Barabasi,et al.  Halting viruses in scale-free networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Robert L. Harris Information Graphics: A Comprehensive Illustrated Reference: Visual Tools for Analyzing, Managing, and Communicating , 1997 .

[37]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[38]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[39]  Kevin W. Boyack,et al.  Spatio-Temporal Information Production and Consumption of Major U.S. Research Institutions , 2005 .

[40]  Marc A. Smith,et al.  Treemap Visualizations of Newsgroups , 2001 .

[41]  K. Börner,et al.  Trends in animal behaviour research (1968–2002): ethoinformatics and the mining of library databases , 2005, Animal Behaviour.

[42]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[43]  Falk Schreiber,et al.  Exploration of biological network centralities with CentiBiN , 2006, BMC Bioinformatics.

[44]  Daniel A. Keim,et al.  CircleView: a new approach for visualizing time-related multidimensional data sets , 2004, AVI.

[45]  David D. Lewis Text representation for intelligent text retrieval: a classification-oriented view , 1992 .

[46]  Roger W. Schvaneveldt,et al.  Pathfinder associative networks: studies in knowledge organization , 1990 .

[47]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[48]  Manojit Sarkar,et al.  Graphical fisheye views , 1994, CACM.

[49]  John Riedl,et al.  A framework for information visualization spreadsheets , 1999 .

[50]  Herman Chernoff,et al.  The Use of Faces to Represent Points in k- Dimensional Space Graphically , 1973 .

[51]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[52]  Edward R. Tufte,et al.  Envisioning Information , 1990 .

[53]  Kevin W. Boyack,et al.  Mapping Medline papers, genes, and proteins related to melanoma research , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[54]  Loet Leydesdorff,et al.  Clusters and Maps of Science Journals Based on Bi-connected Graphs in the Journal Citation Reports , 2009, ArXiv.

[55]  Petra Mutzel,et al.  AGD-Library: A Library of Algorithms for Graph Drawing , 1997 .

[56]  Alessandro Vespignani,et al.  Epidemics and immunization in scale‐free networks , 2002, cond-mat/0205260.

[57]  Ioannis G. Tollis,et al.  Algorithms for Drawing Graphs: an Annotated Bibliography , 1988, Comput. Geom..

[58]  Ketan K. Mane,et al.  ScienceSifter: facilitating activity awareness in collaborative research groups through focused information feeds , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[59]  André Skupin,et al.  The world of geography: Visualizing a knowledge domain with cartographic means , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[61]  Katy Börner,et al.  A Toolkit for Large Scale Network Analysis , 2004 .

[62]  Christopher Williamson,et al.  Dynamic queries for information exploration: an implementation and evaluation , 1992, CHI.

[63]  Chaomei Chen,et al.  Searching for intellectual turning points: Progressive knowledge domain visualization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Emden R. Gansner,et al.  Graphviz and Dynagraph – Static and Dynamic Graph Drawing Tools , 2003 .

[65]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[66]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[67]  Download Book,et al.  Information Visualization in Data Mining and Knowledge Discovery , 2001 .

[68]  Peter R. Keller,et al.  Visual cues - practical data visualization , 1993 .

[69]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[70]  Alfred Inselberg,et al.  Parallel coordinates: a tool for visualizing multi-dimensional geometry , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[71]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[72]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[73]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[74]  Chris North,et al.  Snap-Together Visualization: A User Interface for Coordinating Visualizations via Relational Schemata , 2003 .

[75]  John Maeda,et al.  Computational information design , 2004 .

[76]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[77]  Alan M. MacEachren,et al.  How Maps Work - Representation, Visualization, and Design , 1995 .

[78]  Jock D. Mackinlay,et al.  The structure of the information visualization design space , 1997, Proceedings of VIZ '97: Visualization Conference, Information Visualization Symposium and Parallel Rendering Symposium.

[79]  Ramana Rao,et al.  The table lens: merging graphical and symbolic representations in an interactive focus + context visualization for tabular information , 1994, CHI '94.

[80]  Jacques Bertin,et al.  Graphics and graphic information-processing , 1981 .

[81]  Jason Baumgartner,et al.  Treemap, Radial Tree, and 3D Tree Visualizations , 2003 .

[82]  Robert R. Korfhage,et al.  Information Storage and Retrieval , 1963 .

[83]  H. Barlow Vision Science: Photons to Phenomenology by Stephen E. Palmer , 2000, Trends in Cognitive Sciences.

[84]  Ben Shneiderman,et al.  Treemaps for space-constrained visualization of hierarchies , 2005 .

[85]  Stuart K. Card,et al.  The cost structure of sensemaking , 1993, INTERCHI.

[86]  Satoru Kawai,et al.  An Algorithm for Drawing General Undirected Graphs , 1989, Inf. Process. Lett..

[87]  S. Redner How popular is your paper? An empirical study of the citation distribution , 1998, cond-mat/9804163.