Path lengths in tree-child time consistent hybridization networks

Hybridization networks are representations of evolutionary histories that allow for the inclusion of reticulate events like recombinations, hybridizations, or lateral gene transfers. The recent growth in the number of hybridization network reconstruction algorithms has led to an increasing interest in the definition of metrics for their comparison that can be used to assess the accuracy or robustness of these methods. In this paper we establish some basic results that make it possible the generalization to tree-child time consistent (TCTC) hybridization networks of some of the oldest known metrics for phylogenetic trees: those based on the comparison of the vectors of path lengths between leaves. More specifically, we associate to each hybridization network a suitably defined vector of 'splitted' path lengths between its leaves, and we prove that if two TCTC hybridization networks have the same such vectors, then they must be isomorphic. Thus, comparing these vectors by means of a metric for real-valued vectors defines a metric for TCTC hybridization networks. We also consider the case of fully resolved hybridization networks, where we prove that simpler, 'non-splitted' vectors can be used.

[1]  Daniel H. Huson,et al.  Beyond Galled Trees - Decomposition and Computation of Galled Networks , 2007, RECOMB.

[2]  Luay Nakhleh,et al.  Phylogenetic networks , 2004 .

[3]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[4]  Tandy J. Warnow,et al.  Towards the Development of Computational Tools for Evaluating Phylogenetic Network Reconstruction Methods , 2002, Pacific Symposium on Biocomputing.

[5]  Tandy J. Warnow,et al.  Reconstructing reticulate evolution in species: theory and practice , 2004, RECOMB.

[6]  D. Huson,et al.  Application of phylogenetic networks in evolutionary studies. , 2006, Molecular biology and evolution.

[7]  J. Farris A Successive Approximations Approach to Character Weighting , 1969 .

[8]  S. J. Willson Reconstruction of certain phylogenetic networks from the genomes at their leaves. , 2008, Journal of theoretical biology.

[9]  Maurizio Naldi,et al.  Evolutionary design of oriented-tree networks using Cayley-type encodings , 2009, Inf. Sci..

[10]  Dong-Guk Shin,et al.  Nodal distance algorithm: calculating a phylogenetic tree comparison metric , 2003, Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings..

[11]  W. T. Williams,et al.  ON THE COMPARISON OF TWO CLASSIFICATIONS OF THE SAME SET OF ELEMENTS , 1971 .

[12]  Gabriel Cardona,et al.  A distance metric for a class of tree-sibling phylogenetic networks , 2008, Bioinform..

[13]  Olivier Gascuel,et al.  Reconstructing evolution : new mathematical and computational advances , 2007 .

[14]  Ye.A Smolenskii A method for the linear recording of graphs , 1963 .

[15]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .

[16]  K. Crandall,et al.  A Comparison of Phylogenetic Network Methods Using Computer Simulation , 2008, PloS one.

[17]  J. Farris On Comparing the Shapes of Taxonomic Trees , 1973 .

[18]  Charles Semple,et al.  Hybrids in real time. , 2006, Systematic biology.

[19]  Gabriel Cardona,et al.  Nodal distances for rooted phylogenetic trees , 2008, Journal of mathematical biology.

[20]  Carsten Wiuf,et al.  Gene Genealogies, Variation and Evolution - A Primer in Coalescent Theory , 2004 .

[21]  Prashant Batra,et al.  Newton's method and the Computational Complexity of the Fundamental Theorem of Algebra , 2008, CCA.

[22]  G. Valiente,et al.  Metrics for Phylogenetic Networks I: Generalizations of the Robinson-Foulds Metric , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[23]  Kaizhong Zhang,et al.  Perfect phylogenetic networks with recombination , 2001, J. Comput. Biol..

[24]  M. Steel,et al.  Distributions of Tree Comparison Metrics—Some New Results , 1993 .

[25]  Dan Gusfield,et al.  The Fine Structure of Galls in Phylogenetic Networks , 2004, INFORMS J. Comput..

[26]  Kunihiko Sadakane,et al.  Computing the Maximum Agreement of Phylogenetic Networks , 2004, CATS.

[27]  Dan Gusfield,et al.  Optimal, Efficient Reconstruction of Phylogenetic Networks with Constrained Recombination , 2004, J. Bioinform. Comput. Biol..

[28]  Horst Bunke,et al.  Automatic learning of cost functions for graph edit distance , 2007, Inf. Sci..

[29]  Yun S. Song,et al.  Constructing Minimal Ancestral Recombination Graphs , 2005, J. Comput. Biol..

[30]  Gabriel Cardona,et al.  Comparison of Tree-Child Phylogenetic Networks , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[31]  G. Valiente,et al.  Metrics for Phylogenetic Networks II: Nodal and Triplets Metrics , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  Tandy J. Warnow,et al.  Phylogenetic networks: modeling, reconstructibility, and accuracy , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[33]  Gabriel Cardona,et al.  Tripartitions do not always discriminate phylogenetic networks , 2008, Mathematical biosciences.