Towards a Characterisation of the Generalised Cladistic Character Compatibility Problem for Non-branching Character Trees

In [3,2], the authors introduced the Generalised Cladistic Character Compatibility (GCCC) Problem which generalises a variant of the Perfect Phylogeny Problem in order to model better experiments in molecular biology showing that genes contain information for currently unexpressed traits, e.g., having teeth. In [3], the authors show that this problem is NP-complete and give some special cases which are polynomial. The authors also pose an open case of this problem where each character has only one generalised state, and each character tree is non-branching, a case that models these experiments particularly closely, which we call the Benham-Kannan-Warnow (BKW) Case. In [18], the authors study the complexity of a set of cases of the GCCC Problem for non-branching character trees when the phylogeny tree that is a solution to this compatibility problem is restricted to be either a tree, path or single-branch tree. In particular, they show that if the phylogeny tree must have only one branch, the BKW Case is polynomial-time solvable, by giving a novel algorithm based on PQ-trees used for the consecutive-ones property of binary matrices. In this work, we characterise the complexity of the remainder of the cases considered in [18] for the single-branch tree and the path. We show that some of the open cases are polynomial-time solvable, one by using an algorithm based on directed paths in the character trees similar to the algorithm in [2], and the second by showing that this case can be reduced to a polynomial-time solvable case of [18]. On the other hand, we will show that other open cases are NP-complete using an interesting variation of the ordering problems we study here. In particular, we show that the BKW Case for the path is NP-complete.

[1]  Robin Milner,et al.  On Observing Nondeterminism and Concurrency , 1980, ICALP.

[2]  Dan Gusfield,et al.  Efficient algorithms for inferring evolutionary trees , 1991, Networks.

[3]  J. Trowsdale,et al.  Genomic structure and function in the MHC. , 1993, Trends in genetics : TIG.

[4]  Sampath Kannan,et al.  Inferring Evolutionary History from DNA Sequences , 1994, SIAM J. Comput..

[5]  David Fernández-Baca,et al.  A Polynomial-Time Algorithm for the Perfect Phylogeny Problem when the Number of Character States is Fixed , 1994 .

[6]  João Meidanis,et al.  On the Consecutive Ones Property , 1998, Discret. Appl. Math..

[7]  Sampath Kannan,et al.  Of Chicken Teeth and Mouse Eyes, or Generalized Character Compatibility , 1995, CPM.

[8]  Michael R. Fellows,et al.  Two Strikes Against Perfect Phylogeny , 1992, ICALP.

[9]  Fred R. McMorris,et al.  Triangulating vertex colored graphs , 1994, SODA '93.

[10]  Tandy J. Warnow,et al.  Tree compatibility and inferring evolutionary history , 1994, SODA '93.

[11]  E. Kollar,et al.  Tooth induction in chick epithelium: expression of quiescent genes for enamel synthesis. , 1980, Science.

[12]  Ron Shamir,et al.  Incomplete Directed Perfect Phylogeny , 2000, CPM.

[13]  Kellogg S. Booth,et al.  Testing for the Consecutive Ones Property, Interval Graphs, and Graph Planarity Using PQ-Tree Algorithms , 1976, J. Comput. Syst. Sci..

[14]  Jaroslav Opatrny,et al.  Total Ordering Problem , 1979, SIAM J. Comput..

[15]  Roded Sharan,et al.  Haplotyping with missing data via perfect path phylogenies , 2007, Discret. Appl. Math..

[16]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[17]  M. Steel The complexity of reconstructing trees from qualitative characters and subtrees , 1992 .

[18]  Ján Manuch,et al.  On the Generalised Character Compatibility Problem for Non-branching Character Trees , 2009, COCOON.

[19]  Sampath Kannan,et al.  Hen's Teeth and Whale's Feet: Generalized Characters and Their Compatibility , 1995, J. Comput. Biol..

[20]  Sampath Kannan,et al.  A fast algorithm for the computation and enumeration of perfect phylogenies when the number of character states is fixed , 1995, SODA '95.

[21]  J. Felsenstein Numerical Methods for Inferring Evolutionary Trees , 1982, The Quarterly Review of Biology.

[22]  D. R. Fulkerson,et al.  Incidence matrices and interval graphs , 1965 .

[23]  Dan Gusfield,et al.  The Multi-State Perfect Phylogeny Problem with Missing and Removable Data: Solutions via Integer-Programming and Chordal Graph Theory , 2009, RECOMB.

[24]  F. McMorris,et al.  When is one estimate of evolutionary relationships a refinement of another? , 1980 .

[25]  P. Patel,et al.  Mapping of the congenital generalized hypertrichosis locus to chromosome Xq24–q27.1 , 1995, Nature Genetics.