Weak pattern matching in colored graphs: Minimizing the number of connected components

In the context of metabolic network analysis, Lacroix et al.11 introduced the problem of finding occurrences of motifs in vertex-colored graphs, where a motif is a multiset of colors and an occurrence of a motif is a subset of connected vertices which are colored by all colors of the motif. We consider in this paper the above-mentioned problem in one of its natural optimization forms, referred hereafter as the Min-CC problem: Find an occurrence of a motif in a vertex-colored graph, called the target graph, that induces a minimum number of connected components. Our results can be summarized as follows. We prove the Min-CC problem to be APX–hard even in the extremal case where the motif is a set and the target graph is a path. We complement this result by giving a polynomial-time algorithm in case the motif is built upon a fixed number of colors and the target graph is a path. Also, extending recent research8 , we prove the Min- CC problem to be fixed-parameter tractable when parameterized by the size of the motif, and we give a faster algorithm in case the target graph is a tree. Furthermore, we prove the Min-CC problem for trees not to be approximable within ratio c log n for some constant c > 0, where n is the order of the target graph, and to be W[2]–hard when parameterized by the number of connected components in the occurrence of the motif. Finally, we give an exact efficient exponential-time algorithm for the Min-CC problem in case the target graph is a tree.

[1]  Reinhard Diestel,et al.  Graph Theory , 1997 .

[2]  Michael R. Fellows,et al.  Sharp Tractability Borderlines for Finding Connected Motifs in Vertex-Colored Graphs , 2007, ICALP.

[3]  Roded Sharan,et al.  Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks , 2006, J. Comput. Biol..

[4]  Cristina G. Fernandes,et al.  Motif Search in Graphs: Application to Metabolic Networks , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Paul S. Bonsma,et al.  Complexity results on restricted instances of a paint shop problem for words , 2006, Discret. Appl. Math..

[6]  Yves Deville,et al.  An Overview of Data Models for the Analysis of Biochemical Pathways , 2003, CMSB.

[7]  Ran Raz,et al.  A sub-constant error-probability low-degree test, and a sub-constant error-probability PCP characterization of NP , 1997, STOC '97.

[8]  Shlomo Moran,et al.  Non Deterministic Polynomial Optimization Problems and their Approximations , 1977, Theor. Comput. Sci..

[9]  Winfried Hochstättler,et al.  Complexity results on a paint shop problem , 2004, Discret. Appl. Math..

[10]  Noga Alon,et al.  Color-coding , 1995, JACM.

[11]  P. Bonsma Complexity results for restricted instances of a paint shop problem , 2003 .

[12]  Jörg Flum,et al.  Parameterized Complexity Theory , 2006, Texts in Theoretical Computer Science. An EATCS Series.

[13]  Mihalis Yannakakis,et al.  Optimization, approximation, and complexity classes , 1991, STOC '88.

[14]  Michael R. Fellows,et al.  Parameterized Complexity , 1998 .

[15]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .