Approximating Betweenness Centrality in Large Evolving Networks

Betweenness centrality ranks the importance of nodes by their participation in all shortest paths of the network. Therefore computing exact betweenness values is impractical in large networks. For static networks, approximation based on randomly sampled paths has been shown to be significantly faster in practice. However, for dynamic networks, no approximation algorithm for betweenness centrality is known that improves on static recomputation. We address this deficit by proposing two incremental approximation algorithms (for weighted and unweighted connected graphs) which provide a provable guarantee on the absolute approximation error. Processing batches of edge insertions, our algorithms yield significant speedups up to a factor of $10^4$ compared to restarting the approximation. This is enabled by investing memory to store and efficiently update shortest paths. As a building block, we also propose an asymptotically faster algorithm for updating the SSSP problem in unweighted graphs. Our experimental study shows that our algorithms are the first to make in-memory computation of a betweenness ranking practical for million-edge semi-dynamic networks. Moreover, our results show that the accuracy is even better than the theoretical guarantees in terms of absolutes errors and the rank of nodes is well preserved, in particular for those with high betweenness.

[1]  Thomas W. Reps,et al.  An Incremental Algorithm for a Generalization of the Shortest-Path Problem , 1996, J. Algorithms.

[2]  David A. Bader,et al.  A Fast Algorithm for Streaming Betweenness Centrality , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[3]  Matteo Pontecorvi,et al.  Betweenness Centrality - Incremental and Faster , 2013, MFCS.

[4]  Francesco Bonchi,et al.  Scalable online betweenness centrality in evolving graphs , 2016, ICDE.

[5]  Uri Zwick,et al.  On Dynamic Shortest Paths Problems , 2004, Algorithmica.

[6]  Kathleen M. Carley,et al.  Incremental algorithm for updating betweenness centrality in dynamically growing networks , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[7]  Peter Sanders,et al.  Better Approximation of Betweenness Centrality , 2008, ALENEX.

[8]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[9]  Ulrik Brandes,et al.  Centrality Estimation in Large Networks , 2007, Int. J. Bifurc. Chaos.

[10]  Daniele Frigioni,et al.  Semidynamic Algorithms for Maintaining Single-Source Shortest Path Trees , 1998, Algorithmica.

[11]  Mattia D'Emidio,et al.  Dynamically Maintaining Shortest Path Trees under Batches of Updates , 2013, SIROCCO.

[12]  Daniele Frigioni,et al.  Experimental analysis of dynamic algorithms for the single source shortest paths problem , 1998, JEAL.

[13]  Mattia D'Emidio,et al.  Experimental Evaluation of Dynamic Shortest Path Tree Algorithms on Homogeneous Batches , 2014, SEA.

[14]  Albert-László Barabási,et al.  Evolution of Networks: From Biological Nets to the Internet and WWW , 2004 .

[15]  David A. Bader,et al.  Approximating Betweenness Centrality , 2007, WAW.

[16]  Ryan H. Choi,et al.  QUBE: a quick algorithm for updating betweenness centrality , 2012, WWW.

[17]  Evgenios M. Kornaropoulos,et al.  Fast approximation of betweenness centrality through sampling , 2014, WSDM.

[18]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[19]  Christian Staudt,et al.  NetworKit: An Interactive Tool Suite for High-Performance Network Analysis , 2014, ArXiv.

[20]  Thomas W. Reps,et al.  On the Computational Complexity of Dynamic Graph Problems , 1996, Theor. Comput. Sci..

[21]  Daniele Frigioni,et al.  Fully dynamic output bounded single source shortest path problem , 1996, SODA '96.

[22]  Dorothea Wagner,et al.  Batch Dynamic Single-Source Shortest-Path Algorithms: An Experimental Study , 2009, SEA.