Finding k most influential edges on flow graphs

In this paper, we formulate a novel question on maximum flow queries. Specifically, this problem aims to find which k edges would have the largest impact on a maximum flow query on a network. This problem has important applications in areas like social network and network planning. We show the inapproximability of the problems and present our heuristic algorithms. Experimental evaluations are carried out on real datasets and results show that our algorithms are scalable and return high quality solutions. HighlightsWe propose two graph problems: the k Most Beneficial New Edges (kMBNE), and the k Most Lethal Existing Edges (kMLEE).First, we prove that kMBNE and kMLEE are inapproximable. It is hard to find even an approximate solution (with constant approximation ratio), let alone find the exact solution.For both kMBNE and kMLEE, we develop polynomial-time heuristic algorithms that give high-quality solutions on real flow graphs. Moreover, we propose several pruning and optimization techniques to speedup our proposed algorithms.

[1]  Fang Zhou,et al.  Network Simplification with Minimal Loss of Connectivity , 2010, 2010 IEEE International Conference on Data Mining.

[2]  Özlem Ergun,et al.  The Maximum Flow Network Interdiction Problem: Valid inequalities, integrality gaps, and approximability , 2010, Oper. Res. Lett..

[3]  Aristides Gionis,et al.  Sparsification of influence networks , 2011, KDD.

[4]  Yixun Lin,et al.  Computation of the Reverse Shortest-Path Problem , 2003, J. Glob. Optim..

[5]  Michalis Faloutsos,et al.  Gelling, and melting, large graphs by edge manipulation , 2012, CIKM.

[6]  Rajmohan Rajaraman,et al.  Bounded budget connection (BBC) games or how to make friends and influence people, on a budget , 2008, PODC '08.

[7]  Morteza Zadimoghaddam,et al.  Minimizing the Diameter of a Network Using Shortcut Edges , 2010, SWAT.

[8]  Kazuyuki Aihara,et al.  A large-scale study of link spam detection by graph algorithms , 2007, AIRWeb '07.

[9]  Qi He,et al.  Boosting social network connectivity with link revival , 2010, CIKM '10.

[10]  Aristides Gionis,et al.  Suggesting ghost edges for a smaller world , 2011, CIKM '11.

[11]  Fang Zhou,et al.  A Framework for Path-Oriented Network Simplification , 2010, IDA.

[12]  R. Kevin Wood,et al.  Deterministic network interdiction , 1993 .

[13]  S. Kumar,et al.  An Incremental Algorithm for the Maximum Flow Problem , 2003, J. Math. Model. Algorithms.

[14]  Jure Leskovec,et al.  Statistical properties of community structure in large social and information networks , 2008, WWW.

[15]  Cynthia A. Phillips,et al.  The network inhibition problem , 1993, STOC.

[16]  R. Carr,et al.  A Decomposition-Based Pseudoapproximation Algorithm for Network Flow Inhibition , 2003 .

[17]  Robert Krauthgamer,et al.  Polylogarithmic inapproximability , 2003, STOC '03.

[18]  Jeffrey Xu Yu,et al.  High efficiency and quality: large graphs matching , 2011, CIKM.

[19]  Jianzhon Zhang,et al.  Inverse maximum flow and minimum cut problems , 1997 .

[20]  Dong Li,et al.  Link recommendation for promoting information diffusion in social networks , 2013, WWW.

[21]  Yunjun Gao,et al.  Optimal-Location-Selection Query Processing in Spatial Databases , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22]  Rajeev Rastogi,et al.  Recommendations to boost content spread in social networks , 2012, WWW.

[23]  Johannes O. Royset,et al.  Solving the Bi-Objective Maximum-Flow Network-Interdiction Problem , 2007, INFORMS J. Comput..

[24]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[25]  Masaru Kitsuregawa,et al.  Effects of maximum flow algorithm on identifying web community , 2002, WIDM '02.

[26]  Lakshminarayanan Subramanian,et al.  Sybil-Resilient Online Content Voting , 2009, NSDI.

[27]  Adam Meyerson,et al.  Minimizing Average Shortest Path Distances via Shortcut Edge Addition , 2009, APPROX-RANDOM.

[28]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[29]  Wei Jin,et al.  SAPPER: Subgraph Indexing and Approximate Matching in Large Graphs , 2010, Proc. VLDB Endow..

[30]  Philip S. Yu,et al.  GConnect: A Connectivity Index for Massive Disk-resident Graphs , 2009, Proc. VLDB Endow..

[31]  Masaru Kitsuregawa,et al.  Finding a Web community by maximum flow algorithm with HITS score based capacity , 2003, Eighth International Conference on Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings..

[32]  Mita Nasipuri,et al.  Fault Detection and Localization Scheme for Multiple Failures in Optical Network , 2008, ICDCN.

[33]  Ryutaro Ichise,et al.  Similarity search on supergraph containment , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).