Tractable cases of (*,2)-bounded parsimony haplotyping

Parsimony haplotyping is the problem of finding a set of haplotypes of minimum cardinality that explains a given set of genotypes, where a genotype is explained by two haplotypes if it can be obtained as a combination of the two. This problem is NP-complete in the general case, but polynomially solvable for (k, l)-bounded instances for certain k and l. Here, k denotes the maximum number of ambiguous sites in any genotype, and l is the maximum number of genotypes that are ambiguous at the same site. Only the complexity of the (*, 2)-bounded problem is still unknown, where * denotes no restriction. It has been proved that (*, 2)-bounded instances have compatibility graphs that can be constructed from cliques and circuits by pasting along an edge. In this paper, we give a constructive proof of the fact that (*, 2)-bounded instances are polynomially solvable if the compatibility graph is constructed by pasting cliques, trees and circuits along a bounded number of edges. We obtain this proof by solving a slightly generalized problem on circuits, trees and cliques respectively, and arguing that all possible combinations of optimal solutions for these graphs that are pasted along a bounded number of edges can be enumerated efficiently.

[1]  Xin Li,et al.  Haplotype inference. , 2012, Methods in molecular biology.

[2]  P. Tam The International HapMap Consortium. The International HapMap Project (Co-PI of Hong Kong Centre which responsible for 2.5% of genome) , 2003 .

[3]  Franz Kummert,et al.  On Parsimony Haplotyping , 2008 .

[4]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[5]  Rolf Niedermeier,et al.  Extended Islands of Tractability for Parsimony Haplotyping , 2010, CPM.

[6]  Dan Gusfield,et al.  Haplotype Inference by Pure Parsimony , 2003, CPM.

[7]  Roded Sharan,et al.  Islands of Tractability for Parsimony Haplotyping , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  Martin Milanic,et al.  Graphs of separability at most 2 , 2012, Discret. Appl. Math..

[9]  L. van lersel,et al.  Shorelines of Islands of Tractability: Algorithms for Parsimony and Minimum Perfect Phylogeny Haplotyping Problems , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  Leo van Iersel,et al.  On the Complexity of Several Haplotyping Problems , 2005, WABI.

[11]  Gad M. Landau,et al.  Haplotype Inference Constrained by Plausible Haplotype Data , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  Giuseppe Lancia,et al.  A polynomial case of the parsimony haplotyping problem , 2006, Oper. Res. Lett..