Fast Discovery of Reliable k-terminal Subgraphs

We present a novel and efficient algorithm for solving the most reliable subgraph problem with multiple query nodes on undirected random graphs Reliable subgraphs are useful for summarizing connectivity between given query nodes Formally, we are given a graph G=(V, E), a set of query (or terminal) nodes Q⊂V, and a positive integer B The objective is to find a subgraph H⊂G containing Q, such that H has at most B edges, and the probability that H is connected is maximized Previous algorithms for the problem are either computationally demanding, or restricted to only two query nodes Our algorithm extends a previous algorithm to handle k query nodes, where 2≤k≤|V| We demonstrate experimentally the usefulness of reliable k-terminal subgraphs, and the accuracy, efficiency and scalability of the proposed algorithm on real graphs derived from public biological databases.

[1]  Yehuda Koren,et al.  Measuring and extracting proximity in networks , 2006, KDD '06.

[2]  Leslie G. Valiant,et al.  The Complexity of Enumeration and Reliability Problems , 1979, SIAM J. Comput..

[3]  Petteri Hintsanen The Most Reliable Subgraph Problem , 2007, PKDD.

[4]  Christos Faloutsos,et al.  Fast discovery of connection subgraphs , 2004, KDD.

[5]  Joost N. Kok,et al.  Knowledge Discovery in Databases: PKDD 2007, 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw, Poland, September 17-21, 2007, Proceedings , 2007, PKDD.

[6]  Stan Matwin,et al.  Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases , 2007 .

[7]  Charles J. Colbourn,et al.  The Combinatorics of Network Reliability , 1987 .

[8]  Yehuda Koren,et al.  Measuring and extracting proximity graphs in networks , 2007, TKDD.

[9]  Dirk P. Kroese,et al.  Network Reliability Optimization via the Cross-Entropy Method , 2007, IEEE Transactions on Reliability.

[10]  Hannu Toivonen,et al.  Link Discovery in Graphs Derived from Biological Databases , 2006, DILS.

[11]  Christos Faloutsos,et al.  Center-piece subgraphs: problem definition and fast solutions , 2006, KDD '06.

[12]  Hannu Toivonen,et al.  Finding reliable subgraphs from large probabilistic graphs , 2008, Data Mining and Knowledge Discovery.

[13]  Luc De Raedt,et al.  Compressing probabilistic Prolog programs , 2007, Machine Learning.

[14]  Hannu Toivonen,et al.  Fast Discovery of Reliable Subnetworks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[15]  Luc De Raedt,et al.  ProbLog: A Probabilistic Prolog and its Application in Link Discovery , 2007, IJCAI.