Approximation Hardness of the Cross-Species Conserved Active Modules Detection Problem

Biological network comparison is an essential but algorithmically challenging approach for the analysis of underlying data. A typical example is looking for certain subgraphs in a given network, such as subgraphs that maximize some function of their nodes’ weights. However, the corresponding maximum-weight connected subgraph (mwcs) problem is known to be hard to approximate. In this contribution, we consider the problem of the simultaneous discovery of maximum weight subgraphs in two networks, whose nodes are matched by a mapping: the maximum-weight cross-connected subgraphs (mwccs) problem. We provide inapproximability results for this problem. These results indicate that the complexity of the problem is conditioned both by the nature of the mapping function and by the topologies of the two networks. In particular, we show that the problem is inapproximable even when the mapping is an injective function and the input graphs are two binary trees. We also prove that it remains hard to approximate when the mapping is a bijective function and the input graphs are a graph and a binary tree. We further analyze a variant of the mwcs problem where the networks’ nodes are assigned both a weight and a contribution value, that we call maximum-weight ratio-bounded connected subgraph (mwrbcs). We provide a polynomial time algorithm for bounded-degree trees and an efficient dynamic programming solution for cycles. These algorithms allow us to derive a polynomial solution for mwccs applicable when (i) mwrbcs is polynomially solvable for one of the graphs and (ii) the set of connected induced subgraphs of the other graph is polynomially enumerable.

[1]  Marek Karpinski,et al.  On Some Tighter Inapproximability Results (Extended Abstract) , 1999, ICALP.

[2]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[3]  Lodewyk F. A. Wessels,et al.  A Critical Evaluation of Network and Pathway-Based Classifiers for Outcome Prediction in Breast Cancer , 2011, PloS one.

[4]  Richard Bonneau,et al.  Multi-species integrative biclustering , 2010, Genome Biology.

[5]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[6]  Johannes Goll,et al.  Protein interaction data curation: the International Molecular Exchange (IMEx) consortium , 2012, Nature Methods.

[7]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[8]  Robin Milner,et al.  On Observing Nondeterminism and Concurrency , 1980, ICALP.

[9]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[10]  Petra Mutzel,et al.  The Maximum Weight Connected Subgraph Problem , 2013 .

[11]  Mihalis Yannakakis,et al.  Optimization, approximation, and complexity classes , 1991, STOC '88.

[12]  Petra Mutzel,et al.  The Rooted Maximum Node-Weight Connected Subgraph Problem , 2013, CPAIOR.

[13]  Ziv Bar-Joseph,et al.  Cross Species Expression Analysis of Innate Immune Response , 2009, J. Comput. Biol..