Phylogenetic networks do not need to be complex: using fewer reticulations to represent conflicting clusters

Phylogenetic trees are widely used to display estimates of how groups of species are evolved. Each phylogenetic tree can be seen as a collection of clusters, subgroups of the species that evolved from a common ancestor. When phylogenetic trees are obtained for several datasets (e.g. for different genes), then their clusters are often contradicting. Consequently, the set of all clusters of such a dataset cannot be combined into a single phylogenetic tree. Phylogenetic networks are a generalization of phylogenetic trees that can be used to display more complex evolutionary histories, including reticulate events, such as hybridizations, recombinations and horizontal gene transfers. Here, we present the new Cass algorithm that can combine any set of clusters into a phylogenetic network. We show that the networks constructed by Cass are usually simpler than networks constructed by other available methods. Moreover, we show that Cass is guaranteed to produce a network with at most two reticulations per biconnected component, whenever such a network exists. We have implemented Cass and integrated it into the freely available Dendroscope software. Contact: l.j.j.v.iersel@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Luay Nakhleh,et al.  Phylogenetic networks , 2004 .

[2]  Michel Habib,et al.  Level-k Phylogenetic Networks Are Constructable from a Dense Triplet Set in Polynomial Time , 2009, CPM.

[3]  Daniel H. Huson,et al.  Computing galled networks from real data , 2009, Bioinform..

[4]  Daniel H. Huson,et al.  Summarizing Multiple Gene Trees Using Cluster Networks , 2008, WABI.

[5]  Wing-Kin Sung,et al.  Constructing a Smallest Refining Galled Phylogenetic Network , 2005, RECOMB.

[6]  Kunihiko Sadakane,et al.  Computing the Maximum Agreement of Phylogenetic Networks , 2004, CATS.

[7]  L. Nakhleh Evolutionary Phylogenetic Networks: Models and Issues , 2010 .

[8]  Daniel H. Huson,et al.  Beyond Galled Trees - Decomposition and Computation of Galled Networks , 2007, RECOMB.

[9]  Charles Semple,et al.  Computing the minimum number of hybridization events for a consistent evolutionary history , 2007, Discret. Appl. Math..

[10]  Simone Linz,et al.  A Reduction Algorithm for Computing The Hybridization Number of Two Trees , 2007, Evolutionary bioinformatics online.

[11]  L. Stougie,et al.  Constructing Level-2 Phylogenetic Networks from Triplets , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  C. Semple,et al.  Hybridization in Nonbinary Trees , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[13]  Daniel H. Huson,et al.  Dendroscope: An interactive viewer for large phylogenetic trees , 2007, BMC Bioinformatics.

[14]  Heiko A. Schmidt,et al.  Phylogenetic trees from large datasets , 2003 .

[15]  Yun S. Song,et al.  A Decomposition Theory for Phylogenetic Networks and Incompatible Characters , 2007, J. Comput. Biol..

[16]  Simone Linz,et al.  Quantifying Hybridization in Realistic Time , 2011, J. Comput. Biol..

[17]  Steven Kelk,et al.  CASS: Combining phylogenetic trees into a phylogenetic network , 2009 .

[18]  Wing-Kin Sung,et al.  Inferring a Level-1 Phylogenetic Network from a Dense Set of Rooted Triplets , 2004, COCOON.

[19]  Jerrold I. Davis,et al.  Phylogeny and subfamilial classification of the grasses (Poaceae) , 2001 .

[20]  Olivier Gascuel,et al.  Reconstructing evolution : new mathematical and computational advances , 2007 .

[21]  Dan Gusfield,et al.  An efficiently computed lower bound on the number of recombinations in phylogenetic networks: Theory and empirical study , 2007, Discret. Appl. Math..

[22]  Wing-Kin Sung,et al.  Algorithms for combining rooted triplets into a galled phylogenetic network , 2005, SODA '05.

[23]  Leo van Iersel,et al.  Constructing the Simplest Possible Phylogenetic Network from Triplets , 2008, Algorithmica.