Quest for Orthologs Entails Quest for Tree of Life: In Search of the Gene Stream

Quest for Orthologs (QfO) is a community effort with the goal to improve and benchmark orthology predictions. As quality assessment assumes prior knowledge on species phylogenies, we investigated the congruency between existing species trees by comparing the relationships of 147 QfO reference organisms from six Tree of Life (ToL)/species tree projects: The National Center for Biotechnology Information (NCBI) taxonomy, Opentree of Life, the sequenced species/species ToL, the 16S ribosomal RNA (rRNA) database, and trees published by Ciccarelli et al. (Ciccarelli FD, et al. 2006. Toward automatic reconstruction of a highly resolved tree of life. Science 311:1283–1287) and by Huerta-Cepas et al. (Huerta-Cepas J, Marcet-Houben M, Gabaldon T. 2014. A nested phylogenetic reconstruction approach provides scalable resolution in the eukaryotic Tree Of Life. PeerJ PrePrints 2:223) Our study reveals that each species tree suggests a different phylogeny: 87 of the 146 (60%) possible splits of a dichotomous and rooted tree are congruent, while all other splits are incongruent in at least one of the species trees. Topological differences are observed not only at deep speciation events, but also within younger clades, such as Hominidae, Rodentia, Laurasiatheria, or rosids. The evolutionary relationships of 27 archaea and bacteria are highly inconsistent. By assessing 458,108 gene trees from 65 genomes, we show that consistent species topologies are more often supported by gene phylogenies than contradicting ones. The largest concordant species tree includes 77 of the QfO reference organisms at the most. Results are summarized in the form of a consensus ToL (http://swisstree.vital-it.ch/species_tree) that can serve different benchmarking purposes.

[1]  Purificación López-García,et al.  Extending the conserved phylogenetic core of archaea disentangles the evolution of the third domain of life. , 2015, Molecular biology and evolution.

[2]  Md. Shamsuzzoha Bayzid,et al.  Whole-genome analyses resolve early branches in the tree of life of modern birds , 2014, Science.

[3]  Thomas K. F. Wong,et al.  Phylogenomics resolves the timing and pattern of insect evolution , 2014, Science.

[4]  Peer Bork,et al.  A Phylogeny-Based Benchmarking Test for Orthology Inference Reveals the Limitations of Function-Based Validation , 2014, PloS one.

[5]  Hong Ma,et al.  Resolution of deep angiosperm phylogeny using conserved nuclear genes and estimates of early divergence times , 2014, Nature Communications.

[6]  Maria Jesus Martin,et al.  Big data and other challenges in the quest for orthologs , 2014, Bioinform..

[7]  Andrew S. Burrell,et al.  Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes. , 2014, Molecular phylogenetics and evolution.

[8]  S. Baldauf,et al.  An Alternative Root for the Eukaryote Tree of Life , 2014, Current Biology.

[9]  J. Huerta-Cepas,et al.  A nested phylogenetic reconstruction approach provides scalable resolution in the eukaryotic Tree Of Life , 2014 .

[10]  Pelin Yilmaz,et al.  The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks , 2013, Nucleic Acids Res..

[11]  Salvador Capella-Gutiérrez,et al.  PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome , 2013, Nucleic Acids Res..

[12]  S. Baldauf,et al.  Did Terrestrial Diversification of Amoebas (Amoebozoa) Occur in Synchrony with Land Plants? , 2013, PloS one.

[13]  P. Bork,et al.  Accurate and universal delineation of prokaryotic species , 2013, Nature Methods.

[14]  Alexandros Stamatakis,et al.  A daily-updated tree of (sequenced) life as a reference for genome research , 2013, Scientific Reports.

[15]  Jonathan A. Eisen,et al.  Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees and Supermatrices , 2013, PloS one.

[16]  M. Maldonado,et al.  Deep metazoan phylogeny: when different genes tell different stories. , 2013, Molecular phylogenetics and evolution.

[17]  Yu Lin,et al.  A Metric for Phylogenetic Trees Based on Matching , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  P. Keeling,et al.  The evolutionary history of haptophytes and cryptophytes: phylogenomic evidence for separate origins , 2012, Proceedings of the Royal Society B: Biological Sciences.

[19]  T. Gabaldón,et al.  Phylogenomics supports microsporidia as the earliest diverging clade of sequenced fungi , 2012, BMC Biology.

[20]  Javier Herrero,et al.  Toward community standards in the quest for orthologs , 2012, Bioinform..

[21]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[22]  A. von Haeseler,et al.  A Consistent Phylogenetic Backbone for the Fungi , 2011, Molecular biology and evolution.

[23]  Xuming Zhou,et al.  Phylogenomic Analysis Resolves the Interordinal Relationships and Rapid Diversification of the Laurasiatherian Mammals , 2011, Systematic biology.

[24]  P. Bork,et al.  Orthology prediction methods: A quality assessment using curated protein families , 2011, BioEssays : news and reviews in molecular, cellular and developmental biology.

[25]  Gary W. Jones,et al.  Reconstructing the Fungal Tree of Life Using Phylogenomics and a Preliminary Investigation of the Distribution of Yeast Prion-Like Proteins in the Fungal Kingdom , 2011, Journal of Molecular Evolution.

[26]  Peer Bork,et al.  Universally Distributed Single-Copy Genes Indicate a Constant Rate of Horizontal Transfer , 2011, PloS one.

[27]  Daniel J. G. Lahr,et al.  Comprehensive Phylogenetic Reconstruction of Amoebozoa Based on Concatenated Analyses of SSU-rDNA and Actin Genes , 2011, PloS one.

[28]  Raul Munoz,et al.  Release LTPs104 of the All-Species Living Tree. , 2011, Systematic and applied microbiology.

[29]  Oliver Eulenstein,et al.  Genome-scale phylogenetics: inferring the plant tree of life from 18,896 gene trees. , 2011, Systematic biology.

[30]  Toni Gabaldón,et al.  TreeKO: a duplication-aware algorithm for the comparison of phylogenetic trees , 2011, Nucleic acids research.

[31]  E. Rocha,et al.  Horizontal Transfer, Not Duplication, Drives the Expansion of Protein Families in Prokaryotes , 2011, PLoS genetics.

[32]  L. Mcdaniel,et al.  High Frequency of Horizontal Gene Transfer in the Oceans , 2010, Science.

[33]  Evgeny M. Zdobnov,et al.  The Newick utilities: high-throughput phylogenetic tree processing in the Unix shell , 2010, Bioinform..

[34]  Omar E. Cornejo,et al.  On the Diversity of Malaria Parasites in African Apes and the Origin of Plasmodium falciparum from Bonobos , 2010, PLoS pathogens.

[35]  Joaquín Dopazo,et al.  ETE: a python Environment for Tree Exploration , 2010, BMC Bioinformatics.

[36]  P. Fabre,et al.  Patterns of macroevolution among Primates inferred from a supermatrix of mitochondrial and nuclear DNA. , 2009, Molecular phylogenetics and evolution.

[37]  Albert J. Vilella,et al.  Joining forces in the quest for orthologs , 2009, Genome Biology.

[38]  L. Hug,et al.  Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups” , 2009, Proceedings of the National Academy of Sciences.

[39]  D. Soltis,et al.  Rosid radiation and the rapid rise of angiosperm-dominated forests , 2009, Proceedings of the National Academy of Sciences.

[40]  Nicholas H. Putnam,et al.  The Trichoplax genome and the nature of placozoans , 2008, Nature.

[41]  Nicholas H. Putnam,et al.  The amphioxus genome and the evolution of the chordate karyotype , 2008, Nature.

[42]  Kamran Shalchian-Tabrizi,et al.  Multigene Phylogeny of Choanozoa and the Origin of Animals , 2008, PloS one.

[43]  D. Hillis,et al.  Taxon sampling and the accuracy of phylogenetic analyses , 2008 .

[44]  David Q. Matus,et al.  Broad phylogenomic sampling improves resolution of the animal tree of life , 2008, Nature.

[45]  G. Petsko My worries are no longer behind me , 2007, Genome Biology.

[46]  J. Dopazo,et al.  The human phylome , 2007, Genome Biology.

[47]  Yan Boucher,et al.  Use of 16S rRNA and rpoB Genes as Molecular Markers for Microbial Ecology Studies , 2006, Applied and Environmental Microbiology.

[48]  B. Snel,et al.  Toward Automatic Reconstruction of a Highly Resolved Tree of Life , 2006, Science.

[49]  Yuji Inagaki,et al.  Comprehensive multigene phylogenies of excavate protists reveal the evolutionary positions of "primitive" eukaryotes. , 2006, Molecular biology and evolution.

[50]  F. Delsuc,et al.  Tunicates and not cephalochordates are the closest living relatives of vertebrates , 2006, Nature.

[51]  Vivek Gowri-Shankar,et al.  Consideration of RNA secondary structure significantly improves likelihood-based estimates of phylogeny: examples from the bilateria. , 2005, Molecular biology and evolution.

[52]  M. Steel,et al.  Phylogenetic Super-Networks from Partial Trees , 2004 .

[53]  Daniel H. Huson,et al.  Phylogenetic super-networks from partial trees , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[54]  Yan Boucher,et al.  Phylogenetic reconstruction and lateral gene transfer. , 2004, Trends in microbiology.

[55]  Susanne Schulmeister,et al.  Inconsistency of maximum parsimony revisited. , 2004, Systematic biology.

[56]  N. Grishin,et al.  Genome trees constructed using five different approaches suggest new major bacterial clades , 2001, BMC Evolutionary Biology.

[57]  P. Lio’,et al.  Molecular phylogenetics: state-of-the-art methods for looking into the past. , 2001, Trends in genetics : TIG.

[58]  E. Koonin,et al.  Horizontal gene transfer in prokaryotes: quantification and classification. , 2001, Annual review of microbiology.

[59]  Hervé Philippe,et al.  Early–branching or fast–evolving eukaryotes? An answer based on slowly evolving positions , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[60]  J. Lake,et al.  Evidence from 18S ribosomal DNA that the lophophorates are protostome animals , 1995, Science.

[61]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[62]  C. Woese,et al.  Phylogenetic structure of the prokaryotic domain: The primary kingdoms , 1977, Proceedings of the National Academy of Sciences of the United States of America.