ACM SRC poster: a scalable group testing based algorithm for finding d-highest betweenness centrality vertices in large scale networks

We present a group testing approach to identify the first d vertices with the highest betweenness centrality. Betweenness centrality (BC) of a vertex is the ratio of shortest paths that pass through it and is an important metric in complex networks. The Brandes algorithm computes the BC cumulatively over all vertices. Approximate BC of a single vertex can be computed by selective vertex sampling. However, applications such as community detection require only the vertices with the first few highest BC values, which are not known a-priori. In our method we sample a set of vertices and compute their combined BC. It can be shown that for small values of d, the number of tests required to find the d-highest BC vertices is logarithmic in the total number of vertices. Our algorithm is highly scalable since the samples are independently selected and therefore each test can be performed in parallel.

[1]  V. V. Rykov,et al.  Superimposed distance codes , 1989 .

[2]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[3]  Wen Haw Chong,et al.  Efficient Extraction of High-Betweenness Vertices , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[4]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[5]  David A. Bader,et al.  Parallel Algorithms for Evaluating Centrality Indices in Real-world Networks , 2006, 2006 International Conference on Parallel Processing (ICPP'06).

[6]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[7]  David A. Bader,et al.  Approximating Betweenness Centrality , 2007, WAW.