Rooted triple consensus and anomalous gene trees

BackgroundAnomalous gene trees (AGTs) are gene trees with a topology different from a species tree that are more probable to observe than congruent gene trees. In this paper we propose a rooted triple approach to finding the correct species tree in the presence of AGTs.ResultsBased on simulated data we show that our method outperforms the extended majority rule consensus strategy, while still resolving the species tree. Applying both methods to a metazoan data set of 216 genes, we tested whether AGTs substantially interfere with the reconstruction of the metazoan phylogeny.ConclusionEvidence of AGTs was not found in this data set, suggesting that erroneously reconstructed gene trees are the most significant challenge in the reconstruction of phylogenetic relationships among species with current data. The new method does however rule out the erroneous reconstruction of deep or poorly resolved splits in the presence of lineage sorting.

[1]  E. Koonin,et al.  Coelomata and not Ecdysozoa: evidence from genome-wide phylogenetic analysis. , 2003, Genome research.

[2]  David Bryant,et al.  A classification of consensus methods for phylogenetics , 2001, Bioconsensus.

[3]  Liran Carmel,et al.  Ecdysozoan clade rejected by genome-wide analysis of rare amino acid replacements. , 2007, Molecular biology and evolution.

[4]  S. Jeffery Evolution of Protein Molecules , 1979 .

[5]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[6]  E. Harding The probabilities of rooted tree-shapes generated by random bifurcation , 1971, Advances in Applied Probability.

[7]  Bin Ma,et al.  From Gene Trees to Species Trees , 2000, SIAM J. Comput..

[8]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[9]  R. Baker,et al.  Hidden likelihood support in genomic data: can forty-five wrongs make a right? , 2005, Systematic biology.

[10]  M Steel,et al.  Properties of phylogenetic trees generated by Yule-type speciation models. , 2001, Mathematical biosciences.

[11]  Ziheng Yang,et al.  Likelihood and Bayes estimation of ancestral population sizes in hominoids using data from multiple loci. , 2002, Genetics.

[12]  L. Kubatko,et al.  Inconsistency of phylogenetic estimates from concatenated data under coalescence. , 2007, Systematic biology.

[13]  Jian Zhang Analysis of Information Content for Biological Sequences , 2003, J. Comput. Biol..

[14]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[15]  David Posada,et al.  MtArt: a new model of amino acid replacement for Arthropoda. , 2006, Molecular biology and evolution.

[16]  K. Strimmer,et al.  Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies , 1996 .

[17]  J. M. Smith,et al.  Synonymous nucleotide divergence: what is "saturation"? , 1996, Genetics.

[18]  W. Maddison,et al.  Inferring phylogeny despite incomplete lineage sorting. , 2006, Systematic biology.

[19]  J. Kingman On the genealogy of large populations , 1982 .

[20]  J. Kingman On the genealogy of large populations , 1982, Journal of Applied Probability.

[21]  J. Huelsenbeck Is the Felsenstein zone a fly trap? , 1997, Systematic biology.

[22]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[23]  Alan M. Moses,et al.  Widespread Discordance of Gene Trees with Species Tree in Drosophila: Evidence for Incomplete Lineage Sorting , 2006, PLoS genetics.

[24]  G. Yule,et al.  A Mathematical Theory of Evolution, Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[25]  H. Philippe,et al.  Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. , 2005, Molecular biology and evolution.

[26]  M. Nei,et al.  Gene genealogy and variance of interpopulational nucleotide differences. , 1985, Genetics.

[27]  B. Larget,et al.  Bayesian estimation of concordance among gene trees. , 2006, Molecular biology and evolution.

[28]  M. Ruvolo,et al.  Molecular phylogeny of the hominoids: inferences from multiple independent DNA sequence data sets. , 1997, Molecular biology and evolution.

[29]  S. Carroll,et al.  Animal Evolution and the Molecular Signature of Radiations Compressed in Time , 2005, Science.

[30]  J. Neigel,et al.  Demographic influences on mitochondrial DNA lineage survivorship in animal populations , 2005, Journal of Molecular Evolution.

[31]  J. Dopazo,et al.  Genome-scale evidence of the nematode-arthropod clade , 2005, Genome Biology.

[32]  D. Aldous Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today , 2001 .

[33]  T. D. Schneider,et al.  Information content of individual genetic sequences. , 1997, Journal of theoretical biology.

[34]  M Friedrich,et al.  Molecular phylogenetics at the Felsenstein zone: approaching the Strepsiptera problem using 5.8S and 28S rDNA sequences. , 1998, Molecular phylogenetics and evolution.

[35]  J. Felsenstein Numerical Methods for Inferring Evolutionary Trees , 1982, The Quarterly Review of Biology.

[36]  James K. M. Brown Probabilities of Evolutionary Trees , 1994 .

[37]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[38]  N. Rosenberg,et al.  Discordance of Species Trees with Their Most Likely Gene Trees , 2006, PLoS genetics.

[39]  Eric S. Lander,et al.  Genetic evidence for complex speciation of humans and chimpanzees , 2006, Nature.

[40]  D. Pearl,et al.  High-resolution species trees without concatenation , 2007, Proceedings of the National Academy of Sciences.

[41]  Joseph Felsenstein,et al.  Statistical inference of phylogenies , 1983 .

[42]  M. Hattori,et al.  Comparative analysis of chimpanzee and human Y chromosomes unveils complex evolutionary pathway , 2006, Nature Genetics.

[43]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[44]  Heiko A. Schmidt,et al.  Phylogenetic trees from large datasets , 2003 .

[45]  Roderic D. M. Page,et al.  FORUM ON CONSENSUS, CONFIDENCE, AND "TOTAL EVIDENCE" , 1996 .

[46]  Matthias Platzer,et al.  Mapping human genetic ancestry. , 2007, Molecular biology and evolution.

[47]  Roderic D. M. Page,et al.  ON CONSENSUS, CONFIDENCE, AND “TOTAL EVIDENCE” , 1996 .

[48]  Bryan C Carstens,et al.  Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting: an example from Melanoplus grasshoppers. , 2007, Systematic biology.

[49]  M. Nei,et al.  Relationships between gene trees and species trees. , 1988, Molecular biology and evolution.

[50]  C. J-F,et al.  THE COALESCENT , 1980 .

[51]  G. Luikart,et al.  Multiple maternal origins of chickens: out of the Asian jungles. , 2006, Molecular phylogenetics and evolution.