The prevalence of multifurcations in tree-space and their implications for tree-search.

Phylogenetic tree-search is a major aspect of many evolutionary studies. Several tree rearrangement algorithms are available for tree-search, but it is hard to draw general conclusions about their relative performance because many effects are data set specific and can be highly dependent on individual implementations (e.g., RAxML or phyml). Using only the structure of the rearrangements proposed by the Nearest Neighbor Interchange (NNI) algorithm, we show tree-search can prematurely terminate if it encounters multifurcating trees. We validate the relevance of this result by demonstrating that in real data the majority of possible bifurcating trees potentially encountered during tree-search are actually multifurcations, which suggests NNI would be expected to perform poorly. We also show that the star-decomposition algorithm is a special case of two other popular tree-search algorithms, subtree pruning and regrafting (SPR) and tree bisection and reconnection (TBR), which means that these two algorithms can efficiently escape when they encounter multifurcations. We caution against the use of the NNI algorithm and for most applications we recommend the use of more robust tree-search algorithms, such as SPR and TBR.

[1]  Chris Field,et al.  Estimation of rates-across-sites distributions in phylogenetic substitution models. , 2003, Systematic biology.

[2]  K. Voigt,et al.  Revision of the genus Absidia (Mucorales, Zygomycetes) based on physiological, phylogenetic, and morphological characters; thermotolerant Absidia spp. form a coherent group, Mycocladiaceae fam. nov. , 2007, Mycological research.

[3]  V. Morell The Roots of Phylogeny , 1996 .

[4]  Moriya Ohkuma,et al.  Comparison of four protein phylogeny of parabasalian symbionts in termite guts. , 2007, Molecular phylogenetics and evolution.

[5]  H. Philippe,et al.  Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model , 2007, BMC Evolutionary Biology.

[6]  Simon Whelan,et al.  New approaches to phylogenetic tree search and their application to large numbers of protein alignments. , 2007, Systematic biology.

[7]  S. Hedges,et al.  Tetrapod phylogeny inferred from 18S and 28S ribosomal RNA sequences and a review of the evidence for amniote relationships. , 1990, Molecular biology and evolution.

[8]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[9]  A. Tartar,et al.  Phylogenetic analysis identifies the invertebrate pathogen Helicosporidium sp. as a green alga (Chlorophyta). , 2002, International journal of systematic and evolutionary microbiology.

[10]  B. Swalla,et al.  Coloniality has evolved once in Stolidobranch Ascidians. , 2006, Integrative and comparative biology.

[11]  Ziheng Yang,et al.  Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics. , 2007, Molecular biology and evolution.

[12]  F. Delsuc,et al.  Phylogenomics and the reconstruction of the tree of life , 2005, Nature Reviews Genetics.

[13]  M. Steel,et al.  Subtree Transfer Operations and Their Induced Metrics on Evolutionary Trees , 2001 .

[14]  K. Misawa,et al.  Phylogenetic positions of Glaucophyta, green plants (Archaeplastida) and Haptophyta (Chromalveolata) as deduced from slowly evolving nuclear genes. , 2009, Molecular phylogenetics and evolution.

[15]  A. Coleman BIOGEOGRAPHY AND SPECIATION IN THE PANDORINA/VOLVULINA (CHLOROPHYTA) SUPERCLADE , 2001 .

[16]  H. Philippe,et al.  Computing Bayes factors using thermodynamic integration. , 2006, Systematic biology.

[17]  K. Holsinger,et al.  Polytomies and Bayesian phylogenetic inference. , 2005, Systematic biology.