Finding approximate and constrained motifs in graphs

One of the most relevant topics in the analysis of biological networks is the identification of functional motifs inside a network. A recent approach introduced in literature, called Graph Motif, represents the network as a vertex-colored graph, and the motif M as a multiset of colors. An occurrence of a motif M in a vertex-colored graph G is a connected induced subgraph of G whose vertex set is colored exactly as M. In this paper we investigate three different variants of the Graph Motif problem. The first two variants, Minimum Adding Motif (Min-Add Graph Motif) and Minimum Substitution Motif (Min-Sub Graph Motif), deal with approximate occurrences of a motif in the graph, while the third variant, Constrained Graph Motif (CGM), constrains the motif to contain a given set of vertices. We investigate the computational and parameterized complexity of the three problems. We show that Min-Add Graph Motifand Min-Sub Graph Motifare both NP-hard, even when M is a set, and the graph is a tree with maximum degree 4 in which each color appears at most twice. Then, we show that Min-Sub Graph Motifis fixed-parameter tractable when parameterized by the size of M. Finally, we consider the parameterized complexity of the CGMproblem; we give a fixed-parameter algorithm for graphs of bounded treewidth, and show that the problem is W[2]-hard when parameterized by |M|, even if the input graph has diameter 2.

[1]  Wojciech Szpankowski,et al.  Pairwise Local Alignment of Protein Interaction Networks Guided by Models of Evolution , 2005, RECOMB.

[2]  Marco Cesati,et al.  Compendium of Parameterized Problems , 2006 .

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Wojciech Szpankowski,et al.  Pairwise Alignment of Protein Interaction Networks , 2006, J. Comput. Biol..

[5]  R. Karp,et al.  Conserved pathways within bacteria and yeast as revealed by global protein network alignment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Riccardo Dondi,et al.  Maximum Motif Problem in Vertex-Colored Graphs , 2009, CPM.

[7]  Michael R. Fellows,et al.  Parameterized Complexity , 1998 .

[8]  Florian Rasche,et al.  Annotating Fragmentation Patterns , 2009, WABI.

[9]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[10]  Riccardo Dondi,et al.  Finding Approximate and Constrained Motifs in Graphs , 2011, CPM.

[11]  Roded Sharan,et al.  Torque: topology-free querying of protein interaction networks , 2009, Nucleic Acids Res..

[12]  Roded Sharan,et al.  Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks , 2006, J. Comput. Biol..

[13]  Roded Sharan,et al.  Topology-Free Querying of Protein Interaction Networks , 2009, RECOMB.

[14]  Michael R. Fellows,et al.  Upper and lower bounds for finding connected motifs in vertex-colored graphs , 2011, J. Comput. Syst. Sci..

[15]  Cristina G. Fernandes,et al.  Motif Search in Graphs: Application to Metabolic Networks , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  Romeo Rizzi,et al.  Some Results on more Flexible Versions of Graph Motif , 2012, CSR.

[17]  Riccardo Dondi,et al.  Complexity issues in vertex-colored graph pattern matching , 2011, J. Discrete Algorithms.

[18]  Roded Sharan,et al.  Identification of protein complexes by comparative analysis of yeast and bacterial protein interaction data , 2004, J. Comput. Biol..

[19]  Mam Riess Jones Color Coding , 1962, Human factors.

[20]  Geevarghese Philip,et al.  On the Kernelization Complexity of Colorful Motifs , 2010, IPEC.

[21]  Michael R. Fellows,et al.  Sharp Tractability Borderlines for Finding Connected Motifs in Vertex-Colored Graphs , 2007, ICALP.

[22]  Christian Komusiewicz,et al.  Parameterized Algorithmics for Finding Connected Motifs in Biological Networks , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[23]  Shlomo Moran,et al.  Non Deterministic Polynomial Optimization Problems and their Approximations , 1977, Theor. Comput. Sci..

[24]  Viggo Kann,et al.  Some APX-completeness results for cubic graphs , 2000, Theor. Comput. Sci..

[25]  Dimitrios M. Thilikos,et al.  Invitation to fixed-parameter algorithms , 2007, Comput. Sci. Rev..

[26]  Christian Komusiewicz,et al.  Parameterized Algorithms and Hardness Results for Some Graph Motif Problems , 2008, CPM.

[27]  O Mason,et al.  Graph theory and networks in Biology. , 2006, IET systems biology.

[28]  Florian Sikora,et al.  Aspects algorithmiques de la comparaison d'éléments biologiques. (Algorithmics aspects of biological entities comparison) , 2011 .

[29]  Riccardo Dondi,et al.  Weak pattern matching in colored graphs: Minimizing the number of connected components , 2007, ICTCS.

[30]  Sylvain Guillemot,et al.  Finding and Counting Vertex-Colored Subtrees , 2010, Algorithmica.