Visual Analytics of Heterogeneous Data Using Hypergraph Learning

For real-world learning tasks (e.g., classification), graph-based models are commonly used to fuse the information distributed in diverse data sources, which can be heterogeneous, redundant, and incomplete. These models represent the relations in different datasets as pairwise links. However, these links cannot deal with high-order relations which connect multiple objects (e.g., in public health datasets, more than two patient groups admitted by the same hospital in 2014). In this article, we propose a visual analytics approach for the classification on heterogeneous datasets using the hypergraph model. The hypergraph is an extension to traditional graphs in which a hyperedge connects multiple vertices instead of just two. We model various high-order relations in heterogeneous datasets as hyperedges and fuse different datasets with a unified hypergraph structure. We use the hypergraph learning algorithm for predicting missing labels in the datasets. To allow users to inject their domain knowledge into the model-learning process, we augment the traditional learning algorithm in a number of ways. In addition, we also propose a set of visualizations which enable the user to construct the hypergraph structure and the parameters of the learning model interactively during the analysis. We demonstrate the capability of our approach via two real-world cases.

[1]  Bernhard Preim,et al.  Interactive Visual Analysis of Heterogeneous Cohort-Study Data , 2014, IEEE Computer Graphics and Applications.

[2]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[3]  Qingshan Liu,et al.  Hypergraph with sampling for image retrieval , 2011, Pattern Recognit..

[4]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[5]  Klaus Mueller,et al.  The Data Context Map: Fusing Data and Attributes into a Unified Display , 2016, IEEE Transactions on Visualization and Computer Graphics.

[6]  Danah Boyd,et al.  Vizster: visualizing online social networks , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[7]  Silvia Miksch,et al.  Visualizing Sets and Set-typed Data: State-of-the-Art and Future Challenges , 2014, EuroVis.

[8]  Gem Stapleton,et al.  Inductively Generating Euler Diagrams , 2011, IEEE Transactions on Visualization and Computer Graphics.

[9]  Ahmed M. Elgammal,et al.  On The Effect of Hyperedge Weights On Hypergraph Learning , 2014, Image Vis. Comput..

[10]  Bettina Speckmann,et al.  KelpFusion: A Hybrid Set Visualization Technique , 2013, IEEE Transactions on Visualization and Computer Graphics.

[11]  J. Hughes,et al.  Identifying Potentially Preventable Readmissions , 2008, Health care financing review.

[12]  Tao Li,et al.  News recommendation via hypergraph learning: encapsulation of user behavior and news content , 2013, WSDM.

[13]  Andrew Mercer,et al.  Uncertainty-Aware Multidimensional Ensemble Data Visualization and Exploration , 2015, IEEE Transactions on Visualization and Computer Graphics.

[14]  Pat Hanrahan,et al.  Visualization of Heterogeneous Data , 2007, IEEE Transactions on Visualization and Computer Graphics.

[15]  David S. Ebert,et al.  DimScanner: A relation-based visual exploration approach towards data dimension inspection , 2016, 2016 IEEE Conference on Visual Analytics Science and Technology (VAST).

[16]  Jieping Ye,et al.  Hypergraph spectral learning for multi-label classification , 2008, KDD.

[17]  Klaus Mueller,et al.  Improving the fidelity of contextual data layouts using a Generalized Barycentric Coordinates framework , 2015, 2015 IEEE Pacific Visualization Symposium (PacificVis).

[18]  Bongshin Lee,et al.  Visualizing set concordance with permutation matrices and fan diagrams , 2007, Interact. Comput..

[19]  Jeffrey Heer,et al.  Refinery: Visual Exploration of Large, Heterogeneous Networks through Associative Browsing , 2015, Comput. Graph. Forum.

[20]  Fei Wang,et al.  A visual analytical approach for transfer learning in classification , 2017, Inf. Sci..

[21]  TaeHyun Hwang,et al.  Learning on Weighted Hypergraphs to Integrate Protein Interactions and Gene Expressions for Cancer Outcome Prediction , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[22]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[23]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[24]  Kwan-Liu Ma,et al.  A Utility-Aware Visual Approach for Anonymizing Multi-Attribute Tabular Data , 2018, IEEE Transactions on Visualization and Computer Graphics.

[25]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[26]  M. Sheelagh T. Carpendale,et al.  Bubble Sets: Revealing Set Relations with Isocontours over Existing Visualizations , 2009, IEEE Transactions on Visualization and Computer Graphics.

[27]  Balaraman Ravindran,et al.  Extended Discriminative Random Walk: A Hypergraph Approach to Multi-View Multi-Relational Transductive Learning , 2015, IJCAI.

[28]  R. Kosara,et al.  Parallel sets: visual analysis of categorical data , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[29]  Fabian Beck,et al.  The State of the Art in Visualizing Group Structures in Graphs , 2015, EuroVis.

[30]  Hanspeter Pfister,et al.  UpSet: Visualization of Intersecting Sets , 2014, IEEE Transactions on Visualization and Computer Graphics.

[31]  Huan Liu,et al.  Resource description framework: metadata and its applications , 2001, SKDD.

[32]  Xiaojin Zhu,et al.  Semi-Supervised Learning Literature Survey , 2005 .

[33]  Pete Bicak Kuypers, Jim A.: Partisan Journalism: A History of Media Bias in the United States , 2018 .

[34]  Charles Perin,et al.  Exploring the Possibilities of Embedding Heterogeneous Data Attributes in Familiar Visualizations , 2017, IEEE Transactions on Visualization and Computer Graphics.

[35]  John T. Stasko,et al.  OnSet: A Visualization Technique for Large-scale Binary Set Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[36]  Chun Chen,et al.  Music recommendation by unified hypergraph: combining social media information and music content , 2010, ACM Multimedia.

[37]  Yue Gao,et al.  Tag-based social image search with visual-text joint hypergraph learning , 2011, ACM Multimedia.

[38]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[39]  Ross Maciejewski,et al.  VAUD: A Visual Analysis Approach for Exploring Spatio-Temporal Urban Data , 2018, IEEE Transactions on Visualization and Computer Graphics.