Fast algorithms for computing the tripartition-based distance between phylogenetic networks

Consider two phylogenetic networks $${\cal N}$$ and $${\cal N}$$’ of size n. The tripartition-based distance finds the proportion of tripartitions which are not shared by $${\cal N}$$ and $${\cal N}$$’. This distance is proposed by Moret et al. (2004) and is a generalization of Robinson-Foulds distance, which is orginally used to compare two phylogenetic trees. This paper gives an $$O(\min \{k n \log n, n \log n + hn\})$$-time algorithm to compute this distance, where h is the number of hybrid nodes in $${\cal N}$$ and $${\cal N}$$’ while k is the maximum number of hybrid nodes among all biconnected components in $${\cal N}$$ and $${\cal N}$$’. Note that $k \ll h \ll n$ in a phylogenetic network. In addition, we propose algorithms for comparing galled-trees, which are an important, biological meaningful special case of phylogenetic network. We give an $O(n)$-time algorithm for comparing two galled-trees. We also give an $$O(n + kh)$$-time algorithm for comparing a galled-tree with another general network, where h and k are the number of hybrid nodes in the latter network and its biggest biconnected component respectively.

[1]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[2]  Dan Gusfield,et al.  Efficient reconstruction of phylogenetic networks with constrained recombination , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[3]  Wing-Kin Sung,et al.  The Maximum Agreement of Two Nested Phylogenetic Networks , 2004, ISAAC.

[4]  Wing-Kin Sung,et al.  Constructing a Smallest Refining Galled Phylogenetic Network , 2005, RECOMB.

[5]  Wing-Kin Sung,et al.  Inferring a Level-1 Phylogenetic Network from a Dense Set of Rooted Triplets , 2004, COCOON.

[6]  Dan Gusfield,et al.  A Fundamental Decomposition Theory for Phylogenetic Networks and Incompatible Characters , 2005, RECOMB.

[7]  Kaizhong Zhang,et al.  Perfect Phylogenetic Networks with Recombination , 2001, J. Comput. Biol..

[8]  Siu-Ming Yiu,et al.  Non-shared edges and nearest neighbor interchanges revisited , 2004, Inf. Process. Lett..

[9]  Wing-Kin Sung,et al.  Algorithms for Combining Rooted Triplets into a Galled Phylogenetic Network , 2006, SIAM J. Comput..

[10]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[11]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[12]  J. Hein Reconstructing evolution of sequences subject to recombination using parsimony. , 1990, Mathematical biosciences.

[13]  Robert E. Tarjan,et al.  Fast Algorithms for Finding Nearest Common Ancestors , 1984, SIAM J. Comput..

[14]  Tandy J. Warnow,et al.  Towards the Development of Computational Tools for Evaluating Phylogenetic Network Reconstruction Methods , 2002, Pacific Symposium on Biocomputing.

[15]  Vincent Moulton,et al.  NeighborNet: An Agglomerative Method for the Construction of Planar Phylogenetic Networks , 2002, WABI.

[16]  Daniel H. Huson,et al.  Reconstruction of Reticulate Networks from Gene Trees , 2005, RECOMB.

[17]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .

[18]  Daniel H. Huson,et al.  Phylogenetic Super-Networks from Partial Trees , 2004, IEEE ACM Trans. Comput. Biol. Bioinform..

[19]  Kunihiko Sadakane,et al.  Computing the Maximum Agreement of Phylogenetic Networks , 2004, CATS.

[20]  Tandy J. Warnow,et al.  Phylogenetic networks: modeling, reconstructibility, and accuracy , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[21]  K. Crandall,et al.  Intraspecific gene genealogies: trees grafting into networks. , 2001, Trends in ecology & evolution.

[22]  Tandy J. Warnow,et al.  Reconstructing reticulate evolution in species: theory and practice , 2004, RECOMB.