Complex Network Metrics: Can Deep Learning Keep up With Tailor-Made Reference Algorithms?

Complex network metrics are used for ranking the importance of nodes in many different applications, e.g., spreader identification and percolation analysis. Examples for such node metrics include the degree centrality and betweenness centrality. The computation of some of these metrics is computationally expensive, e.g. the computation of betweenness takes $O(N^{3})$ of computation steps in the worst case, where $N$ is the number of nodes. In this study, we investigate the ability of deep learning via graph embedding to predict node centrality metrics in complex networks. Our study reveals that deep learning can identify vital nodes well for several of these metrics. Compared to exact computation with prohibitive costs, deep learning can get significant speedups, which scale up well with the size of the network. Further investigation on betweenness centrality reveals that the accuracy of deep learning is not as good as some tailor-made, tuned approximation algorithms. However, deep learning offers a nice trade-off between runtime and quality with its linear computational complexity regarding the network size, which makes it scalable on large networks and especially for these metrics with expensive computational costs. Our work is the first to explore the ability of deep learning on a wide range of networks metrics, and will hopefully induce future work on improved graph embeddings, tuned for specific network metrics.

[1]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[3]  Florian Linke,et al.  On the topology of air navigation route systems , 2017 .

[4]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[5]  Yizhou Sun,et al.  Learning to Identify High Betweenness Centrality Nodes from Scratch: A Novel Graph Neural Network Approach , 2019, CIKM.

[6]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  Bin Zhang,et al.  Regulation cooperative control for heterogeneous uncertain chaotic systems with time delay: A synchronization errors estimation framework , 2019, Autom..

[9]  Alessandro Epasto,et al.  Is a Single Embedding Enough? Learning Node Representations that Capture Multiple Social Contexts , 2019, WWW.

[10]  Samy Bengio,et al.  Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks , 2019, KDD.

[11]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[12]  Reinhard Lipowsky,et al.  Network Brownian Motion: A New Method to Measure Vertex-Vertex Proximity and to Identify Communities and Subcommunities , 2004, International Conference on Computational Science.

[13]  Jian Li,et al.  NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization , 2019, WWW.

[14]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[15]  Christos Faloutsos,et al.  Estimating Node Importance in Knowledge Graphs Using Graph Neural Networks , 2019, KDD.

[16]  Alexander J. Smola,et al.  Distributed large-scale natural graph factorization , 2013, WWW.

[17]  Adriana Iamnitchi,et al.  Identifying high betweenness centrality nodes in large social networks , 2012, Social Network Analysis and Mining.

[18]  Kathleen M. Carley,et al.  k-Centralities: local approximations of global measures based on shortest paths , 2012, WWW.

[19]  Massimiliano Zanin,et al.  A comparative analysis of approaches to network-dismantling , 2018, Scientific Reports.

[20]  David A. Bader,et al.  Approximating Betweenness Centrality , 2007, WAW.

[21]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[22]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[23]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[24]  Harold Hotelling,et al.  Simplified calculation of principal components , 1936 .

[25]  Jörn Altmann,et al.  Identifying the effects of co-authorship networks on the performance of scholars: A correlation and regression analysis of performance measures and social network analysis measures , 2011, J. Informetrics.

[26]  Mark E. J. Newman A measure of betweenness centrality based on random walks , 2005, Soc. Networks.

[27]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[28]  Falk Schreiber,et al.  Comparison of Centralities for Biological Networks , 2004, German Conference on Bioinformatics.

[29]  Ulrik Brandes,et al.  Centrality Estimation in Large Networks , 2007, Int. J. Bifurc. Chaos.

[30]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[31]  Michele Borassi,et al.  KADABRA is an ADaptive Algorithm for Betweenness via Random Approximation , 2016, ESA.

[32]  Martin Everett,et al.  Ego network betweenness , 2005, Soc. Networks.

[33]  Eli Upfal,et al.  ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with Rademacher Averages , 2016, KDD.

[34]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[35]  Peter Sanders,et al.  Better Approximation of Betweenness Centrality , 2008, ALENEX.

[36]  Shashank Khandelwal,et al.  Exploring biological network structure with clustered random networks , 2009, BMC Bioinformatics.

[37]  Le Song,et al.  2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[38]  K. Goh,et al.  Universal behavior of load distribution in scale-free networks. , 2001, Physical review letters.

[39]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[40]  Anne-Marie Kermarrec,et al.  Second order centrality: Distributed assessment of nodes criticity in complex networks , 2011, Comput. Commun..

[41]  Jian Pei,et al.  Asymmetric Transitivity Preserving Graph Embedding , 2016, KDD.

[42]  M. Zelen,et al.  Rethinking centrality: Methods and examples☆ , 1989 .

[43]  Xiaoqian Sun,et al.  Worldwide Railway Skeleton Network: Extraction Methodology and Preliminary Analysis , 2017, IEEE Transactions on Intelligent Transportation Systems.

[44]  Evgenios M. Kornaropoulos,et al.  Fast approximation of betweenness centrality through sampling , 2014, Data Mining and Knowledge Discovery.

[45]  Haijun Zhou Distance, dissimilarity index, and network community structure. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.