Harvesting evolutionary signals in a forest of prokaryotic gene trees.

Phylogenomic studies produce increasingly large phylogenetic forests of trees with patchy taxonomical sampling. Typically, prokaryotic data generate thousands of gene trees of all sizes that are difficult, if not impossible, to root. Their topologies do not match the genealogy of lineages, as they are influenced not only by duplication, losses, and vertical descent but also by lateral gene transfer (LGT) and recombination. Because this complexity in part reflects the diversity of evolutionary processes, the study of phylogenetic forests is thus a great opportunity to improve our understanding of prokaryotic evolution. Here, we show how the rich evolutionary content of such novel phylogenetic objects can be exploited through the development of new approaches designed specifically for extracting the multiple evolutionary signals present in the forest of life, that is, by slicing up trees into remarkable bits and pieces: clans, slices, and clips. We harvested a forest of 6,901 unrooted gene trees comprising up to 100 prokaryotic genomes (41 archaea and 59 bacteria) to search for evolutionary events that a species tree would not account for. We identified 1) trees and partitions of trees that reflected the lifestyle of organisms rather than their taxonomy, 2) candidate lifestyle-specific genetic modules, used by distinct unrelated organisms to adapt to the same environment, 3) gene families, nonrandomly distributed in the functional space, that were frequently exchanged between archaea and bacteria, sometimes without major changes in their sequences. Finally, 4) we reconstructed polarized networks of genetic partnerships between archaea and bacteria to describe some of the rules affecting LGT between these two Domains.

[1]  M. Santana,et al.  The adaptive genome of Desulfovibrio vulgaris Hildenborough. , 2006, FEMS microbiology letters.

[2]  J. Peter Gogarten,et al.  Intertwined Evolutionary Histories of Marine Synechococcus and Prochlorococcus marinus , 2009, Genome biology and evolution.

[3]  François-Joseph Lapointe,et al.  Clanistics: a multi-level perspective for harvesting unrooted gene trees. , 2010, Trends in microbiology.

[4]  H. Matsuda,et al.  Biased biological functions of horizontally transferred genes in prokaryotic genomes , 2004, Nature Genetics.

[5]  Søren J. Sørensen,et al.  Conjugative plasmids: vessels of the communal gene pool , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[6]  T. Sekizuka,et al.  Cloning, sequencing and characterization of a urease gene operon from urease‐positive thermophilic Campylobacter (UPTC) , 2007, Journal of applied microbiology.

[7]  Alfons J. M. Stams,et al.  Atypical one-carbon metabolism of an acetogenic and hydrogenogenic Moorella thermoacetica strain , 2009, Archives of Microbiology.

[8]  J. Overmann,et al.  Ultrastructural Characterization of the Prokaryotic Symbiosis in “Chlorochromatium aggregatum” , 2008, Journal of bacteriology.

[9]  J. Lawrence,et al.  Phylogenetic incongruence arising from fragmented speciation in enteric bacteria , 2010, Proceedings of the National Academy of Sciences.

[10]  G. Moreno-Hagelsieb,et al.  Beyond the bounds of orthology: functional inference from metagenomic context. , 2010, Molecular bioSystems.

[11]  Sarita Ranjan,et al.  Prediction of DtxR regulon: Identification of binding sites and operons controlled by Diphtheria toxin repressor in Corynebacterium diphtheriae , 2004, BMC Microbiology.

[12]  R. Overbeek,et al.  The genome of Methanosarcina mazei: evidence for lateral gene transfer between bacteria and archaea. , 2002, Journal of molecular microbiology and biotechnology.

[13]  W. Doolittle,et al.  Lateral gene transfer and the origins of prokaryotic groups. , 2003, Annual review of genetics.

[14]  John Boyle,et al.  Cytoscape: a community-based framework for network modeling. , 2009, Methods in molecular biology.

[15]  Florent E. Angly,et al.  Comparative Metagenomics Reveals Host Specific Metavirulomes and Horizontal Gene Transfer Elements in the Chicken Cecum Microbiome , 2008, PloS one.

[16]  H. Ochman,et al.  Lateral gene transfer and the nature of bacterial innovation , 2000, Nature.

[17]  W. Doolittle,et al.  The genome of Salinibacter ruber: convergence and gene exchange among hyperhalophilic bacteria and archaea. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[18]  T. Gaasterland,et al.  Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes. , 1998, Microbial & comparative genomics.

[19]  Eric Bapteste,et al.  On the need for integrative phylogenomics, and some steps toward its creation , 2010 .

[20]  Pietro Liò,et al.  Analysis of plasmid genes by phylogenetic profiling and visualization of homology relationships using Blast2Network , 2008, BMC Bioinformatics.

[21]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Vincent J. Denef,et al.  Strain-resolved community proteomics reveals recombining genomes of acidophilic bacteria , 2007, Nature.

[23]  I. Paulsen,et al.  Coastal Synechococcus metagenome reveals major roles for horizontal gene transfer and plasmids in population diversity. , 2009, Environmental microbiology.

[24]  Hongwei Wu,et al.  Association analysis of the general environmental conditions and prokaryotes' gene distributions in various functional groups. , 2010, Genomics.

[25]  S. Sonea,et al.  Evolution of the genomic systems of prokaryotes and its momentous consequences , 2001, International microbiology : the official journal of the Spanish Society for Microbiology.

[26]  A. Walsby,et al.  Gas vesicles , 1994, Microbiological reviews.

[27]  M. Ragan,et al.  Lateral genetic transfer: open issues , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[28]  Otto X. Cordero,et al.  The impact of long-distance horizontal gene transfer on prokaryotic genome size , 2009, Proceedings of the National Academy of Sciences.

[29]  Robert Gentleman,et al.  Using GOstats to test gene lists for GO term association , 2007, Bioinform..

[30]  Damian Szklarczyk,et al.  eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations , 2009, Nucleic Acids Res..

[31]  E. Koonin,et al.  Search for a 'Tree of Life' in the thicket of the phylogenetic forest , 2009, Journal of biology.

[32]  Paul Wilmes,et al.  The dynamic genetic repertoire of microbial communities , 2009, FEMS microbiology reviews.

[33]  W. Martin,et al.  Getting a better picture of microbial evolution en route to a network of genomes , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[34]  Michael Shmoish,et al.  Potential photosynthesis gene recombination between Prochlorococcus and Synechococcus via viral intermediates. , 2005, Environmental microbiology.

[35]  Mihai Pop,et al.  Microbiome Metagenomic Analysis of the Human Distal Gut , 2009 .

[36]  W. Doolittle,et al.  Lateral gene transfer , 2011, Current Biology.

[37]  E. Koonin,et al.  The phylogenetic forest and the quest for the elusive tree of life. , 2009, Cold Spring Harbor symposia on quantitative biology.

[38]  R. O'HARA,et al.  Population thinking and tree thinking in systematics , 1997 .

[39]  S. Salzberg,et al.  Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima , 1999, Nature.

[40]  Eric Bapteste,et al.  Network analyses structure genetic diversity in independent genetic worlds , 2009, Proceedings of the National Academy of Sciences.

[41]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[42]  Konstantinos Mavromatis,et al.  Microbial co-habitation and lateral gene transfer: what transposases can tell us , 2009, Genome Biology.

[43]  Robert G. Beiko,et al.  Distinguishing Microbial Genome Fragments Based on Their Composition: Evolutionary and Comparative Genomic Perspectives , 2010, Genome biology and evolution.

[44]  J. Shaffer Multiple Hypothesis Testing , 1995 .

[45]  W. Hennig Phylogenetic Systematics , 2002 .

[46]  Eric Bapteste,et al.  INAUGURAL ARTICLE by a Recently Elected Academy Member:Pattern pluralism and the Tree of Life hypothesis , 2007 .

[47]  Andrew C. Tolonen,et al.  Transfer of photosynthesis genes to and from Prochlorococcus viruses. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[48]  J. Lawrence,et al.  The interplay of homologous recombination and horizontal gene transfer in bacterial speciation. , 2009, Methods in molecular biology.

[49]  Maureen A. O’Malley,et al.  Prokaryotic evolution and the tree of life are two different things , 2009, Biology Direct.

[50]  W. Doolittle,et al.  The practice of classification and the theory of evolution, and what the demise of Charles Darwin's tree of life hypothesis means for both of them , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[51]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[52]  Manesh Shah,et al.  Genome divergence in two Prochlorococcus ecotypes reflects oceanic niche differentiation , 2003, Nature.

[53]  M. Pop,et al.  Metagenomic Analysis of the Human Distal Gut Microbiome , 2006, Science.

[54]  W. Doolittle,et al.  Eradicating typological thinking in prokaryotic systematics and evolution. , 2009, Cold Spring Harbor symposia on quantitative biology.

[55]  W. Martin,et al.  The tree of one percent , 2006, Genome Biology.

[56]  E. Nevo,et al.  Different Clustering of Genomes Across Life Using the A-T-C-G and Degenerate R-Y Alphabets: Early and Late Signaling on Genome Evolution? , 2007, Journal of Molecular Evolution.

[57]  Rainer Merkl,et al.  A Comparative Categorization of Protein Function Encoded in Bacterial or Archeal Genomic Islands , 2004, Journal of Molecular Evolution.

[58]  Olga Zhaxybayeva,et al.  On the chimeric nature, thermophilic origin, and phylogenetic placement of the Thermotogales , 2009, Proceedings of the National Academy of Sciences.

[59]  L. Mcdaniel,et al.  High Frequency of Horizontal Gene Transfer in the Oceans , 2010, Science.

[60]  B. Snel,et al.  Function prediction and protein networks. , 2003, Current opinion in cell biology.

[61]  Tal Dagan,et al.  Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution , 2008, Proceedings of the National Academy of Sciences.

[62]  Matthias E. Futschik,et al.  Genome-wide expression dynamics of a marine virus and host reveal features of co-evolution , 2007, Nature.

[63]  D. Hull Are Species Really Individuals , 1976 .

[64]  Mark Wilkinson,et al.  Of clades and clans: terms for phylogenetic relationships in unrooted trees. , 2007, Trends in ecology & evolution.

[65]  Henk Bolhuis,et al.  Environmental genomics of "Haloquadratum walsbyi" in a saltern crystallizer indicates a large pool of accessory genes in an otherwise coherent species , 2006, BMC Genomics.

[66]  Douglas A. Wolfe,et al.  Nonparametrics: Statistical Methods Based on Ranks and Its Impact on the Field of Nonparametric Statistics , 2012 .

[67]  Klaus Peter Schliep,et al.  phangorn: phylogenetic analysis in R , 2010, Bioinform..

[68]  D. Marco Metagenomics : theory, methods and applications , 2010 .

[69]  W. Doolittle,et al.  Lateral genomics. , 1999, Trends in cell biology.

[70]  E. Koonin,et al.  The Tree and Net Components of Prokaryote Evolution , 2010, Genome biology and evolution.

[71]  Gipsi Lima-Mendez,et al.  Reticulate representation of evolutionary and functional relationships between phage genomes. , 2008, Molecular biology and evolution.

[72]  Maureen L. Coleman,et al.  Ecosystem-specific selection pressures revealed through comparative population genomics , 2010, Proceedings of the National Academy of Sciences.

[73]  Rick L. Stevens,et al.  Functional metagenomic profiling of nine biomes , 2008, Nature.

[74]  J. Overmann,et al.  Identification and analysis of four candidate symbiosis genes from 'Chlorochromatium aggregatum', a highly developed bacterial symbiosis. , 2008, Environmental microbiology.

[75]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[76]  Yan Boucher,et al.  Lateral gene transfer challenges principles of microbial systematics. , 2008, Trends in microbiology.

[77]  S. Lucas,et al.  Whole-Genome Analysis of the Methyl tert-Butyl Ether-Degrading Beta-Proteobacterium Methylibium petroleiphilum PM1 , 2006, Journal of bacteriology.

[78]  Jillian F. Banfield,et al.  Genome dynamics in a natural archaeal population , 2007, Proceedings of the National Academy of Sciences.

[79]  R. Sokal,et al.  Numerical Taxonomy: The Principles and Practice of Numerical Classification. , 1975 .

[80]  J. McInerney,et al.  The prokaryotic tree of life: past, present... and future? , 2008, Trends in ecology & evolution.