A penalty box approach for approximation betweenness and closeness centrality algorithms

Centrality metrics are used to find important nodes in social networks. In the days of ever-increasing social network sizes, it becomes more and more difficult to compute centrality scores of all nodes quickly. One of the ways to tackle this problem is to use approximation centrality algorithms using sampling techniques. Also, in situations where finding only high value individuals/important nodes is the primary objective, accuracy of rank ordering of nodes is especially important. We propose a new sampling method similar to Tabu search, called “penalty box approach,” which can be used for approximation closeness and betweenness algorithms. On a variety of graphs we experimentally demonstrate that this new method when combined with previously known methods, such as random sampling and linear scaling, produces better results. The evaluation is done based on two measures that assess quality of rank ordered lists of nodes when compared against the true lists based on their closeness and betweenness scores. Effects of graph characteristics on the parameters of the proposed method are also analyzed.

[1]  David A. Bader,et al.  Parallel Algorithms for Evaluating Centrality Indices in Real-world Networks , 2006, 2006 International Conference on Parallel Processing (ICPP'06).

[2]  Ulrik Brandes,et al.  Centrality Estimation in Large Networks , 2007, Int. J. Bifurc. Chaos.

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Adriana Iamnitchi,et al.  Identifying high betweenness centrality nodes in large social networks , 2012, Social Network Analysis and Mining.

[5]  Fred W. Glover,et al.  Tabu Search - Part I , 1989, INFORMS J. Comput..

[6]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[7]  Rishi Ranjan Singh,et al.  An Efficient Estimation of a Node's Betweenness , 2015, CompleNet.

[8]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[9]  John Skvoretz,et al.  Node centrality in weighted networks: Generalizing degree and shortest paths , 2010, Soc. Networks.

[10]  Florian Probst,et al.  Identifying Key Users in Online Social Networks: A PageRank Based Approach , 2010, ICIS.

[11]  Fred Glover,et al.  Tabu Search - Part II , 1989, INFORMS J. Comput..

[12]  Valdis E. Krebs,et al.  Mapping Networks of Terrorist Cells , 2001 .

[13]  Antoine Dutot,et al.  GraphStream: A Tool for bridging the gap between Complex Systems and Dynamic Graphs , 2008, ArXiv.

[14]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[15]  Jérôme Kunegis,et al.  KONECT: the Koblenz network collection , 2013, WWW.

[16]  Leon Peeters,et al.  Algorithms for Centrality Indices , 2004, Network Analysis.

[17]  Xiang-Yang Li,et al.  Ranking of Closeness Centrality for Large-Scale Social Networks , 2008, FAW.

[18]  Peter Sanders,et al.  Better Approximation of Betweenness Centrality , 2008, ALENEX.

[19]  Evimaria Terzi,et al.  A divide-and-conquer algorithm for betweenness centrality , 2015, SDM 2015.

[20]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[21]  Evgenios M. Kornaropoulos,et al.  Fast approximation of betweenness centrality through sampling , 2014, WSDM.

[22]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[23]  J. van Engelen,et al.  Disseminating educational innovations in health care practice: training versus social networks. , 2010, Social science & medicine.

[24]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[25]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[26]  Bildungswesen Advanced School for Computing and Imaging , 2010 .

[27]  David A. Bader,et al.  Approximating Betweenness Centrality , 2007, WAW.

[28]  Johan Pouwelse,et al.  Efficient Approximate Computation of Betweenness Centrality , 2010 .

[29]  David Eppstein,et al.  Fast approximation of centrality , 2000, SODA '01.

[30]  Azer Bestavros,et al.  A Divide-and-Conquer Algorithm for Betweenness Centrality , 2014, SDM.

[31]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .