Lnetwork: an efficient and effective method for constructing phylogenetic networks

MOTIVATION The evolutionary history of species is traditionally represented with a rooted phylogenetic tree. Each tree comprises a set of clusters, i.e. subsets of the species that are descended from a common ancestor. When rooted phylogenetic trees are built from several different datasets (e.g. from different genes), the clusters are often conflicting. These conflicting clusters cannot be expressed as a simple phylogenetic tree; however, they can be expressed in a phylogenetic network. Phylogenetic networks are a generalization of phylogenetic trees that can account for processes such as hybridization, horizontal gene transfer and recombination, which are difficult to represent in standard tree-like models of evolutionary histories. There is currently a large body of research aimed at developing appropriate methods for constructing phylogenetic networks from cluster sets. The Cass algorithm can construct a much simpler network than other available methods, but is extremely slow for large datasets or for datasets that need lots of reticulate nodes. The networks constructed by Cass are also greatly dependent on the order of input data, i.e. it generally derives different phylogenetic networks for the same dataset when different input orders are used. RESULTS In this study, we introduce an improved Cass algorithm, Lnetwork, which can construct a phylogenetic network for a given set of clusters. We show that Lnetwork is significantly faster than Cass and effectively weakens the influence of input data order. Moreover, we show that Lnetwork can construct a much simpler network than most of the other available methods. AVAILABILITY Lnetwork has been built as a Java software package and is freely available at http://nclab.hit.edu.cn/∼wangjuan/Lnetwork/. CONTACT maozuguo@hit.edu.cn SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[2]  Daniel H. Huson,et al.  Phylogenetic Super-Networks from Partial Trees , 2004, IEEE ACM Trans. Comput. Biol. Bioinform..

[3]  D. Huson,et al.  A Survey of Combinatorial Methods for Phylogenetic Networks , 2010, Genome biology and evolution.

[4]  Daniel H. Huson,et al.  Phylogenetic Networks: Introduction to phylogenetic networks , 2010 .

[5]  C. Semple,et al.  Hybridization in Nonbinary Trees , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  M. Chase,et al.  Intragenic recombination events and evidence for hybrid speciation in Nicotiana (Solanaceae). , 2010, Molecular biology and evolution.

[7]  L. Nakhleh Evolutionary Phylogenetic Networks: Models and Issues , 2010 .

[8]  Yun S. Song,et al.  Constructing Minimal Ancestral Recombination Graphs , 2005, J. Comput. Biol..

[9]  D. Huson,et al.  Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. , 2012, Systematic biology.

[10]  Daniel H. Huson,et al.  Summarizing Multiple Gene Trees Using Cluster Networks , 2008, WABI.

[11]  Daniel H. Huson,et al.  Phylogenetic Networks - Concepts, Algorithms and Applications , 2011 .

[12]  Bernhard Misof,et al.  Speciation within genomic networks: a case study based on Steatocranus cichlids of the lower Congo rapids , 2012, Journal of evolutionary biology.

[13]  David Posada,et al.  Recodon: Coalescent simulation of coding DNA sequences with recombination, migration and demography , 2007, BMC Bioinformatics.

[14]  Daniel H Huson,et al.  Filtered Z-closure supernetworks for extracting and visualizing recurrent signal from incongruent gene trees. , 2008, Systematic biology.

[15]  Zhiqiang Wu,et al.  Indehiscent sporangia enable the accumulation of local fern diversity at the Qinghai-Tibetan Plateau , 2012, BMC Evolutionary Biology.

[16]  D. Huson,et al.  Application of phylogenetic networks in evolutionary studies. , 2006, Molecular biology and evolution.

[17]  W. Maddison Gene Trees in Species Trees , 1997 .

[18]  K. Crandall,et al.  A Comparison of Phylogenetic Network Methods Using Computer Simulation , 2008, PloS one.

[19]  Daniel H. Huson,et al.  Phylogenetic Networks: Clusters and rooted phylogenetic networks , 2010 .

[20]  Dan Gusfield,et al.  A Fundamental Decomposition Theory for Phylogenetic Networks and Incompatible Characters , 2005, RECOMB.

[21]  Yufeng Wu,et al.  Close lower and upper bounds for the minimum reticulate network of multiple phylogenetic trees , 2010, Bioinform..

[22]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .

[23]  Bernard M. E. Moret,et al.  Network ( Reticulate ) Evolution : Biology , Models , and Algorithms , 2004 .

[24]  Loren H. Rieseberg,et al.  Hybrid Origins of Plant Species , 1997 .

[25]  Daniel H. Huson,et al.  Computing galled networks from real data , 2009, Bioinform..

[26]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[27]  Loren H Rieseberg,et al.  Reconstructing patterns of reticulate evolution in plants. , 2004, American journal of botany.

[28]  Gabriel Cardona,et al.  Metrics for Phylogenetic Networks I: Generalizations of the Robinson-Foulds Metric , 2009, IEEE ACM Trans. Comput. Biol. Bioinform..

[29]  Leo van Iersel,et al.  Phylogenetic networks do not need to be complex: using fewer reticulations to represent conflicting clusters , 2009, Bioinform..

[30]  Simone Linz,et al.  Quantifying Hybridization in Realistic Time , 2011, J. Comput. Biol..

[31]  Bernard M. E. Moret,et al.  Network (Reticulated) Evolution: Biology, Models, and Algorithms , 2004 .