Error in Phylogenetic Estimation for Bushes in the Tree of Life

Many rapid radiations, or bushes, throughout the Tree of Life remain unresolved. Here, we investigated how the shape of a bush interacts with two key processes - coalescence and mutation - that can lead to errors in phylogenetic inference under specific conditions. For this study, we focused on the tradeoff between sampling more individuals per species and sampling more loci as well as the utility of a species tree method based upon gene tree reconciliation and the concatenation of multiple loci for resolving bushes. We examined different bush shapes, varying both the speciation rate during the radiation and the depth of the radiation, to encompass a broad range of situations. Using simulations based upon parameters derived from empirical studies, we investigated the performance of phylogenetic analyses under different conditions to identify approaches with the greatest potential to resolve difficult phylogenies. Sampling a single individual for more loci outperformed sampling multiple individuals for one locus in all cases except the most recent radiations. We found that error due to homoplastic mutations increased with depth, while error due to the coalescent process remained unchanged. These simulations also revealed that, for certain ancient bushes, analyses of concatenated data matrices surprisingly resulted in more accurate phylogenies than gene tree reconciliation. The poor performance of gene tree reconciliation in this study appeared to reflect the poor estimation of gene trees, not the superiority of concatenation per se. Our results suggest concatenation remains a useful approximate method for species tree estimation, even for rapid evolutionary radiations. However, improved estimation of gene trees combined with use of gene tree reconciliation has the greatest potential for resolving the remaining bushes of the Tree of Life.

[1]  M. Sanderson Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach. , 2002, Molecular biology and evolution.

[2]  E. Braun,et al.  From Reptilian Phylogenomics to Reptilian Genomes: Analyses of c-Jun and DJ-1 Proto-Oncogenes , 2010, Cytogenetic and Genome Research.

[3]  Liang Liu,et al.  Phybase: an R package for species tree analysis , 2010, Bioinform..

[4]  M. Holder,et al.  Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[5]  Scott V Edwards,et al.  Coalescent methods for estimating phylogenetic trees. , 2009, Molecular phylogenetics and evolution.

[6]  M. Batzer,et al.  SINEs of a nearly perfect character. , 2006, Systematic biology.

[7]  J. Brosius,et al.  A universal method for the study of CR1 retroposons in nonmodel bird genomes. , 2012, Molecular biology and evolution.

[8]  R. Page,et al.  How should species phylogenies be inferred from sequence data? , 1999, Systematic biology.

[9]  C. J-F,et al.  THE COALESCENT , 1980 .

[10]  Travis C. Glenn,et al.  A Phylogeny of Birds Based on Over 1,500 Loci Collected by Target Enrichment and High-Throughput Sequencing , 2012, PloS one.

[11]  Chad D. Brock,et al.  Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates , 2009, Proceedings of the National Academy of Sciences.

[12]  J. Huelsenbeck,et al.  Bayesian phylogenetic analysis of combined data. , 2004, Systematic biology.

[13]  L. Kubatko Identifying hybridization events in the presence of coalescence via model selection. , 2009, Systematic biology.

[14]  J. Cracraft,et al.  Phylogenetic relationships among modern birds (Neornithes): towards an avian tree of life , 2004 .

[15]  B. Larget,et al.  Bayesian estimation of concordance among gene trees. , 2006, Molecular biology and evolution.

[16]  Tandy J. Warnow,et al.  Fast and accurate methods for phylogenomic analyses , 2011, BMC Bioinformatics.

[17]  Colin N. Dewey,et al.  BUCKy: Gene tree/species tree reconciliation with Bayesian concordance analysis , 2010, Bioinform..

[18]  S. Carroll,et al.  Animal Evolution and the Molecular Signature of Radiations Compressed in Time , 2005, Science.

[19]  Sen Song,et al.  Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model , 2012, Proceedings of the National Academy of Sciences.

[20]  M. Braun,et al.  Are transposable element insertions homoplasy free?: an examination using the avian tree of life. , 2011, Systematic biology.

[21]  N. Rosenberg,et al.  Discordance of Species Trees with Their Most Likely Gene Trees , 2006, PLoS genetics.

[22]  L. Kubatko,et al.  Inconsistency of phylogenetic estimates from concatenated data under coalescence. , 2007, Systematic biology.

[23]  Bryan C Carstens,et al.  Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting: an example from Melanoplus grasshoppers. , 2007, Systematic biology.

[24]  T. J. Robinson,et al.  Indel evolution of mammalian introns and the utility of non-coding nuclear markers in eutherian phylogenetics. , 2007, Molecular phylogenetics and evolution.

[25]  H. Philippe,et al.  How good are deep phylogenetic trees? , 1998, Current opinion in genetics & development.

[26]  Bin Ma,et al.  From Gene Trees to Species Trees , 2000, SIAM J. Comput..

[27]  D. Swofford,et al.  Should we use model-based methods for phylogenetic inference when we know that assumptions about among-site rate variation and nucleotide substitution pattern are violated? , 2001, Systematic biology.

[28]  K. Peterson,et al.  MicroRNAs and metazoan phylogeny: big trees from little genes , 2009 .

[29]  J. Gordon Burleigh,et al.  Assessing Parameter Identifiability in Phylogenetic Models Using Data Cloning , 2012, Systematic biology.

[30]  Nicholas G. Crawford,et al.  LSU Digital Commons LSU Digital Commons Ultraconserved elements are novel phylogenomic markers that Ultraconserved elements are novel phylogenomic markers that resolve placental mammal phylogeny when combined with resolve placental mammal phylogeny when combined with species-tree analysis species-tr , 2022 .

[31]  M. Gouy,et al.  Genome-scale coestimation of species and gene trees , 2013, Genome research.

[32]  M. Braun,et al.  A well-tested set of primers to amplify regions spread across the avian genome , 2009 .

[33]  R. Hudson Gene genealogies and the coalescent process. , 1990 .

[34]  H. Ellegren,et al.  Genomics of natural bird populations: a gene‐based set of reference markers evenly spread across the avian genome , 2007, Molecular ecology.

[35]  P. Lockhart,et al.  Deciphering ancient rapid radiations. , 2007, Trends in ecology & evolution.

[36]  Ole Seehausen,et al.  African cichlid fish: a model system in adaptive radiation research , 2006, Proceedings of the Royal Society B: Biological Sciences.

[37]  Edward L. Braun,et al.  Phylogenomic evidence for multiple losses of flight in ratite birds , 2008, Proceedings of the National Academy of Sciences.

[38]  Ingo Ebersberger,et al.  Rooted triple consensus and anomalous gene trees , 2008, BMC Evolutionary Biology.

[39]  N. Takahata Gene genealogy in three related populations: consistency probability between gene and population trees. , 1989, Genetics.

[40]  Andrew Rambaut,et al.  Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees , 1997, Comput. Appl. Biosci..

[41]  M. Nei,et al.  Relationships between gene trees and species trees. , 1988, Molecular biology and evolution.

[42]  Liran Carmel,et al.  Homoplasy in genome-wide analysis of rare amino acid replacements: the molecular-evolutionary basis for Vavilov's law of homologous series , 2008, Biology Direct.

[43]  D. Penny Rewriting Evolution—“Been There, Done That” , 2013, Genome biology and evolution.

[44]  Liang Liu,et al.  BEST: Bayesian estimation of species trees under the coalescent model , 2008, Bioinform..

[45]  Benjamin J. Raphael,et al.  Microinversions in mammalian evolution , 2006, Proceedings of the National Academy of Sciences.

[46]  Shigenori Maruyama,et al.  Retroposon analysis and recent geological data suggest near-simultaneous divergence of the three superorders of mammals , 2009, Proceedings of the National Academy of Sciences.

[47]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[48]  H. Philippe,et al.  The new phylogeny of eukaryotes. , 2000, Current opinion in genetics & development.

[49]  Steven Poe,et al.  BIRDS IN A BUSH: FIVE GENES INDICATE EXPLOSIVE EVOLUTION OF AVIAN ORDERS , 2004, Evolution; international journal of organic evolution.

[50]  Bruce Rannala,et al.  The accuracy of species tree estimation under simulation: a comparison of methods. , 2011, Systematic biology.

[51]  D. J. Funk,et al.  Species-Level Paraphyly and Polyphyly: Frequency, Causes, and Consequences, with Insights from Animal Mitochondrial DNA , 2003 .

[52]  E. Braun,et al.  POLYTOMIES, THE POWER OF PHYLOGENETIC INFERENCE, AND THE STOCHASTIC NATURE OF MOLECULAR EVOLUTION: A COMMENT ON WALSH ET AL. (1999) , 2001, Evolution; international journal of organic evolution.

[53]  John E McCormack,et al.  Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design. , 2009, Systematic biology.

[54]  G. Yule,et al.  A Mathematical Theory of Evolution Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[55]  J. Felsenstein Cases in which Parsimony or Compatibility Methods will be Positively Misleading , 1978 .

[56]  W. A. Cox,et al.  A Phylogenomic Study of Birds Reveals Their Evolutionary History , 2008, Science.

[57]  H. Philippe,et al.  Resolving Difficult Phylogenetic Questions: Why More Sequences Are Not Enough , 2011, PLoS biology.

[58]  E. Braun,et al.  Testing hypotheses about the sister group of the passeriformes using an independent 30-locus data set. , 2012, Molecular biology and evolution.

[59]  Sarah A. Teichmann,et al.  Is There a Phylogenetic Signal in Prokaryote Proteins? , 1999, Journal of Molecular Evolution.

[60]  Edward L. Braun,et al.  Parsimony and Model-Based Analyses of Indels in Avian Nuclear Genes Reveal Congruent and Incongruent Phylogenetic Signals , 2013, Biology.

[61]  Oliver A Ryder,et al.  Phylogenetic utility of nuclear introns in interfamilial relationships of Caniformia (order Carnivora). , 2011, Systematic biology.

[62]  Caitlin A. Kuczynski,et al.  Phylogeny of iguanian lizards inferred from 29 nuclear loci, and a comparison of concatenated and species-tree approaches for an ancient, rapid radiation. , 2011, Molecular phylogenetics and evolution.

[63]  Y. Zhuravlev,et al.  NUCLEAR LOCI AND COALESCENT METHODS SUPPORT ANCIENT HYBRIDIZATION AS CAUSE OF MITOCHONDRIAL PARAPHYLY BETWEEN GADWALL AND FALCATED DUCK (ANAS SPP.) , 2007, Evolution; international journal of organic evolution.

[64]  R. Copley,et al.  Acoelomorph flatworms are deuterostomes related to Xenoturbella , 2011, Nature.

[65]  Noah A Rosenberg,et al.  Gene tree discordance, phylogenetic inference and the multispecies coalescent. , 2009, Trends in ecology & evolution.

[66]  J. Oliver MICROEVOLUTIONARY PROCESSES GENERATE PHYLOGENOMIC DISCORDANCE AT ANCIENT DIVERGENCES , 2013, Evolution; international journal of organic evolution.

[67]  H. Shaffer,et al.  Turtle phylogeny: insights from a novel nuclear intron. , 2004, Molecular phylogenetics and evolution.

[68]  W. Maddison,et al.  Inferring phylogeny despite incomplete lineage sorting. , 2006, Systematic biology.

[69]  Elchanan Mossel,et al.  On the Impossibility of Reconstructing Ancestral Data and Phylogenies , 2003, J. Comput. Biol..

[70]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[71]  S. Edwards IS A NEW AND GENERAL THEORY OF MOLECULAR SYSTEMATICS EMERGING? , 2009, Evolution; international journal of organic evolution.

[72]  Noah A Rosenberg,et al.  The probability of topological concordance of gene trees and species trees. , 2002, Theoretical population biology.

[73]  Olivier Gascuel,et al.  Genomics, biogeography, and the diversification of placental mammals , 2007, Proceedings of the National Academy of Sciences.

[74]  Liran Carmel,et al.  Ecdysozoan clade rejected by genome-wide analysis of rare amino acid replacements. , 2007, Molecular biology and evolution.

[75]  E. Braun,et al.  Introns outperform exons in analyses of basal avian phylogeny using clathrin heavy chain genes. , 2008, Gene.

[76]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[77]  A. Meyer,et al.  Origin of the Superflock of Cichlid Fishes from Lake Victoria, East Africa , 2003, Science.

[78]  Laura Salter Kubatko,et al.  STEM: species tree estimation using maximum likelihood for gene trees under coalescence , 2009, Bioinform..

[79]  W. Murphy,et al.  Resolution of the Early Placental Mammal Radiation Using Bayesian Phylogenetics , 2001, Science.

[80]  Liang Liu,et al.  Estimating Species Trees Using Multiple-Allele DNA Sequence Data , 2008, Evolution; international journal of organic evolution.

[81]  A. Drummond,et al.  Bayesian Inference of Species Trees from Multilocus Data , 2009, Molecular biology and evolution.

[82]  B. King,et al.  MicroRNAs support a turtle + lizard clade , 2012, Biology Letters.

[83]  M. Braun,et al.  Homoplastic microinversions and the avian tree of life , 2011, BMC Evolutionary Biology.

[84]  D. Pearl,et al.  High-resolution species trees without concatenation , 2007, Proceedings of the National Academy of Sciences.

[85]  R. Debry,et al.  NUCLEAR INTRON SEQUENCES FOR PHYLOGENETICS OF CLOSELY RELATED MAMMALS: AN EXAMPLE USING THE PHYLOGENY OF MUS , 2001 .

[86]  R. Nichols,et al.  Gene trees and species trees are not the same. , 2001, Trends in ecology & evolution.

[87]  J. Brosius,et al.  Retroposon insertion patterns of neoavian birds: strong evidence for an extensive incomplete lineage sorting era. , 2012, Molecular biology and evolution.

[88]  F. Delsuc,et al.  Phylogenomics: the beginning of incongruence? , 2006, Trends in genetics : TIG.

[89]  Scott V Edwards,et al.  SPECIATIONAL HISTORY OF AUSTRALIAN GRASS FINCHES (POEPHILA) INFERRED FROM THIRTY GENE TREES* , 2005, Evolution; international journal of organic evolution.

[90]  R. C. Thomson,et al.  Testing avian, squamate, and mammalian nuclear markers for cross amplification in turtles , 2010, Conservation Genetics Resources.

[91]  P. Houde,et al.  PARALLEL RADIATIONS IN THE PRIMARY CLADES OF BIRDS , 2004, Evolution; international journal of organic evolution.

[92]  Edward L. Braun,et al.  A multigene phylogeny of Galliformes supports a single origin of erectile ability in non-feathered facial traits , 2008 .

[93]  Daniel E. Warren,et al.  The western painted turtle genome, a model for the evolution of extreme physiological adaptations in a slowly evolving lineage , 2013, Genome Biology.

[94]  S. Carroll,et al.  Bushes in the Tree of Life , 2006, PLoS biology.

[95]  Qixin He,et al.  Sources of error inherent in species-tree estimation: impact of mutational and coalescent effects on accuracy and implications for choosing among different methods. , 2010, Systematic biology.

[96]  A. Yoder,et al.  Multiple nuclear loci reveal patterns of incomplete lineage sorting and complex species history within western mouse lemurs (Microcebus). , 2007, Molecular phylogenetics and evolution.

[97]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[98]  J. Kim,et al.  Slicing hyperdimensional oranges: the geometry of phylogenetic estimation. , 2000, Molecular phylogenetics and evolution.

[99]  Jordan V Smith,et al.  Ratite nonmonophyly: independent evidence from 40 novel Loci. , 2013, Systematic biology.

[100]  C. Ané,et al.  Comparing two Bayesian methods for gene tree/species tree reconstruction: simulations with incomplete lineage sorting and horizontal gene transfer. , 2011, Systematic biology.

[101]  E. Braun,et al.  Turtle isochore structure is intermediate between amphibians and other amniotes. , 2008, Integrative and comparative biology.

[102]  W. Moore,et al.  Chapter 4 - The Window of Taxonomic Resolution for Phylogenies Based on Mitochondrial Cytochrome b , 1997 .

[103]  Daniel L Rabosky,et al.  LASER: A Maximum Likelihood Toolkit for Detecting Temporal Shifts in Diversification Rates From Molecular Phylogenies , 2006, Evolutionary bioinformatics online.

[104]  M. Kiefmann,et al.  Mesozoic retroposons reveal parrots as the closest living relatives of passerine birds , 2011, Nature communications.