Using Cliques with Higher-order Spectral Embeddings Improves Graph Visualizations

In the simplest setting, graph visualization is the problem of producing a set of two-dimensional coordinates for each node that meaningfully shows connections and latent structure in a graph. Among other uses, having a meaningful layout is often useful to help interpret the results from network science tasks such as community detection and link prediction. There are several existing graph visualization techniques in the literature that are based on spectral methods, graph embeddings, or optimizing graph distances. Despite the large number of methods, it is still often challenging or extremely time consuming to produce meaningful layouts of graphs with hundreds of thousands of vertices. Existing methods often either fail to produce a visualization in a meaningful time window, or produce a layout colorfully called a “hairball”, which does not illustrate any internal structure in the graph. Here, we show that adding higher-order information based on cliques to a classic eigenvector based graph visualization technique enables it to produce meaningful plots of large graphs. We further evaluate these visualizations along a number of graph visualization metrics and we find that it outperforms existing techniques on a metric that uses random walks to measure the local structure. Finally, we show many examples of how our algorithm successfully produces layouts of large networks. Code to reproduce our results is available.

[1]  Matthew A. Hibbs,et al.  Visualization of omics data for systems biology , 2010, Nature Methods.

[2]  David F. Gleich,et al.  Pairwise Link Prediction , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[3]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[4]  Jure Leskovec,et al.  Higher-order organization of complex networks , 2016, Science.

[5]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[6]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[7]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[8]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[9]  Stephen G. Kobourov,et al.  Spring Embedders and Force Directed Graph Drawing Algorithms , 2012, ArXiv.

[10]  Marco Rosa,et al.  Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks , 2010, WWW.

[11]  Azadeh Iranmehr,et al.  Trust Management for Semantic Web , 2009, 2009 Second International Conference on Computer and Electrical Engineering.

[12]  Prateek Yadav,et al.  HyperGCN: Hypergraph Convolutional Networks for Semi-Supervised Classification , 2018, ArXiv.

[13]  Kevin W. Boyack,et al.  Dr.L: Distributed Recursive (Graph) Layout , 2007 .

[14]  Alessandro Epasto,et al.  Is a Single Embedding Enough? Learning Node Representations that Capture Multiple Social Contexts , 2019, WWW.

[15]  Sahin Albayrak,et al.  Spectral Analysis of Signed Graphs for Clustering, Prediction and Visualization , 2010, SDM.

[16]  Sebastiano Vigna,et al.  The webgraph framework I: compression techniques , 2004, WWW '04.

[17]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[18]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[19]  Edward M Marcotte,et al.  LGL: creating a map of protein function with an algorithm for visualizing very large biological networks. , 2004, Journal of molecular biology.

[20]  Inanç Birol,et al.  Hive plots - rational approach to visualizing networks , 2012, Briefings Bioinform..

[21]  Jian Pei,et al.  Arbitrary-Order Proximity Preserved Network Embedding , 2018, KDD.

[22]  Danny C. Sorensen,et al.  Deflation Techniques for an Implicitly Restarted Arnoldi Iteration , 1996, SIAM J. Matrix Anal. Appl..

[23]  Seshadhri Comandur,et al.  A Fast and Provable Method for Estimating Clique Counts Using Turán's Theorem , 2016, WWW.

[24]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[25]  Ben Y. Zhao,et al.  User interactions in social networks and their implications , 2009, EuroSys '09.

[26]  Matthew Suderman,et al.  Tools for visually exploring biological networks , 2007, Bioinform..

[27]  M. Fiedler Laplacian of graphs and algebraic connectivity , 1989 .

[28]  Mason A. Porter,et al.  Social Structure of Facebook Networks , 2011, ArXiv.

[29]  Darren Edge,et al.  Trimming the Hairball: Edge Cutting Strategies for Making Dense Graphs Usable , 2018, 2018 IEEE International Conference on Big Data (Big Data).