The Most Reliable Subgraph Problem

We introduce the problem of finding the most reliable subgraph: given a probabilistic graph Gsubject to random edge failures, a set of terminal vertices, and an integer Kfind a subgraph Hi¾? Ghaving Kfewer edges than G, such that the probability of connecting the terminals in His maximized. The solution has applications in link analysis and visualization. We begin by formally defining the problem in a general form, after which we focus on a two-terminal, undirected case. Although the problem is most likely computationally intractable, we give a polynomial-time algorithm for a special case where Gis seriesparallel. For the general case, we propose a computationally efficient greedy heuristic. Our experiments on simulated graphs illustrate the usefulness of the concept of most reliable subgraph, and suggest that the heuristic for the general case is quite competitive.

[1]  Francis D. Gibbons,et al.  Predicting protein complex membership using probabilistic network reliability. , 2004, Genome research.

[2]  Charles J. Colbourn,et al.  The Combinatorics of Network Reliability , 1987 .

[3]  L. Page,et al.  Reliability polynomials and link importance in networks , 1994 .

[4]  Leslie G. Valiant,et al.  The Complexity of Enumeration and Reliability Problems , 1979, SIAM J. Comput..

[5]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[6]  C. Lie,et al.  Joint reliability-importance of two edges in an undirected network , 1993 .

[7]  Christos Faloutsos,et al.  Fast discovery of connection subgraphs , 2004, KDD.

[8]  Luc De Raedt,et al.  Compressing probabilistic Prolog programs , 2007, Machine Learning.

[9]  Z W Birnbaum,et al.  ON THE IMPORTANCE OF DIFFERENT COMPONENTS IN A MULTICOMPONENT SYSTEM , 1968 .

[10]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[11]  Michael O. Ball,et al.  Complexity of network reliability computations , 1980, Networks.

[12]  Richard M. Karp,et al.  Monte-Carlo Approximation Algorithms for Enumeration Problems , 1989, J. Algorithms.

[13]  Eugene L. Lawler,et al.  The Recognition of Series Parallel Digraphs , 1982, SIAM J. Comput..

[14]  Amit P. Sheth,et al.  Discovering informative connection subgraphs in multi-relational graphs , 2005, SKDD.

[15]  Shou-De Lin,et al.  Unsupervised link discovery in multi-relational data via rarity analysis , 2003, Third IEEE International Conference on Data Mining.

[16]  Hannu Toivonen,et al.  Link Discovery in Graphs Derived from Biological Databases , 2006, DILS.