Unique perfect phylogeny is intractable

A phylogeny is a tree capturing evolution and ancestral relationships of a set of taxa (e.g., species). Reconstructing phylogenies from molecular data plays an important role in many areas of contemporary biological research. A phylogeny is perfect if (in rough terms) it correctly captures all input data. Determining if a perfect phylogeny exists was shown to be intractable in 1992 by Mike Steel (Steel, 1992) [32] and independently by Bodlaender et al. (Bodlaender et al., 1992) [4]. In light of this, a related problem was proposed by Steel (1992) [32]: given a perfect phylogeny, determine if it is the unique perfect phylogeny for the given dataset, where the dataset is provided as a set of quartet (4-leaf) trees. It was suggested that this problem may be more tractable (Steel, 1992) [32], and determining its complexity became known as the Quartet Challenge (Steel, 2012) [33]. In this paper, we resolve this question by showing that the problem is CoNP-complete. We prove this by relating perfect phylogenies to satisfying assignments of Boolean formulae. To this end, we cast the question as a chordal sandwich problem. As a particular consequence of our method, we show that the unique minimal chordal sandwich problem is CoNP-complete, and counting minimal chordal sandwiches is #P-complete.

[1]  Martin Charles Golumbic,et al.  Graph Sandwich Problems , 1995, J. Algorithms.

[2]  Ron Shamir,et al.  Complexity and algorithms for reasoning about time: a graph-theoretic approach , 1993, JACM.

[3]  Laurent Juban,et al.  Dichotomy Theorem for the Generalized Unique Satisfiability Problem , 1999, FCT.

[4]  G. F. Estabrook,et al.  An algebraic analysis of cladistic characters , 1976, Discret. Math..

[5]  G. Dirac On rigid circuit graphs , 1961 .

[6]  Dan Gusfield,et al.  Generalizing the Splits Equivalence Theorem and Four Gamete Condition: Perfect Phylogeny on Three-State Characters , 2009, SIAM J. Discret. Math..

[7]  E. Wilson A Consistency Test for Phylogenies Based on Contemporaneous Species , 1965 .

[8]  Sampath Kannan,et al.  Triangulating 3-Colored Graphs , 1992, SIAM J. Discret. Math..

[9]  Michel Habib,et al.  Competitive graph searches , 2008, Theor. Comput. Sci..

[10]  Charles Semple,et al.  A characterization for a set of partial partitions to define an X-tree , 2002, Discret. Math..

[11]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[12]  A. D. Gordon Consensus supertrees: The synthesis of rooted trees containing overlapping sets of labeled leaves , 1986 .

[13]  Le Quesne,et al.  The Uniquely Evolved Character Concept , 1977 .

[14]  Celina M. H. de Figueiredo,et al.  On the complexity of the sandwich problems for strongly chordal graphs and chordal bipartite graphs , 2007, Theor. Comput. Sci..

[15]  Dan Gusfield,et al.  Generalizing the Four Gamete Condition and Splits Equivalence Theorem: Perfect Phylogeny on Three State Characters , 2009, WABI.

[16]  G. Estabrook,et al.  An idealized concept of the true cladistic character , 1975 .

[17]  Walter J. Lequesne Further Studies Based on the Uniquely Derived Character Concept , 1972 .

[18]  R. Sokal,et al.  A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY , 1965 .

[19]  Michel Habib,et al.  Unique Perfect Phylogeny Is NP-Hard , 2010, CPM.

[20]  Fred R. McMorris,et al.  Triangulating vertex colored graphs , 1994, SODA '93.

[21]  Kathryn Fraughnaugh,et al.  Introduction to graph theory , 1973, Mathematical Gazette.

[22]  Robert E. Tarjan,et al.  Algorithmic Aspects of Vertex Elimination on Graphs , 1976, SIAM J. Comput..

[23]  Mike A. Steel,et al.  Algorithmic Aspects of Tree Amalgamation , 2000, J. Algorithms.

[24]  G. Estabrook,et al.  Cladistic Methodology: A Discussion of the Theoretical Basis for the Induction of Evolutionary History , 1972 .

[25]  F. McMorris,et al.  A Mathematical Foundation for the Analysis of Cladistic Character Compatibility , 1976 .

[26]  M. Steel The complexity of reconstructing trees from qualitative characters and subtrees , 1992 .

[27]  David Fernández-Baca,et al.  A Polynomial-Time Algorithm for the Perfect Phylogeny Problem when the Number of Character States is Fixed , 1993, FOCS.

[28]  W. J. Quesne The Uniquely Evolved Character Concept and its Cladistic Application , 1974 .

[29]  Dan Gusfield,et al.  Efficient algorithms for inferring evolutionary trees , 1991, Networks.

[30]  Nadia Creignou,et al.  Complexity of Generalized Satisfiability Counting Problems , 1996, Inf. Comput..

[31]  Simone Linz,et al.  The Complexity of Finding Multiple Solutions to Betweenness and Quartet Compatibility , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  Peter Buneman,et al.  A characterisation of rigid circuit graphs , 1974, Discret. Math..

[33]  Michael R. Fellows,et al.  Two Strikes Against Perfect Phylogeny , 1992, ICALP.