Tree of Life Based on Genome Context Networks

Efforts in phylogenomics have greatly improved our understanding of the backbone tree of life. However, due to the systematic error in sequence data, a sequence-based phylogenomic approach leads to well-resolved but statistically significant incongruence. Thus, independent test of current phylogenetic knowledge is required. Here, we have devised a distance-based strategy to reconstruct a highly resolved backbone tree of life, on the basis of the genome context networks of 195 fully sequenced representative species. Along with strongly supporting the monophylies of three superkingdoms and most taxonomic sub-divisions, the derived tree also suggests some intriguing results, such as high G+C gram positive origin of Bacteria, classification of Symbiobacterium thermophilum and Alcanivorax borkumensis in Firmicutes. Furthermore, simulation analyses indicate that addition of more gene relationships with high accuracy can greatly improve the resolution of the phylogenetic tree. Our results demonstrate the feasibility of the reconstruction of highly resolved phylogenetic tree with extensible gene networks across all three domains of life. This strategy also implies that the relationships between the genes (gene network) can define what kind of species it is.

[1]  M. Gouy,et al.  A nonhyperthermophilic common ancestor to extant life forms. , 1999, Science.

[2]  W. Doolittle,et al.  Prokaryotic evolution in light of gene transfer. , 2002, Molecular biology and evolution.

[3]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[4]  J. Lobry,et al.  Relationships Between Genomic G+C Content, RNA Secondary Structures, and Optimal Growth Temperature in Prokaryotes , 1997, Journal of Molecular Evolution.

[5]  Lindell Bromham,et al.  Molecular dating when rates vary. , 2005, Trends in ecology & evolution.

[6]  Matteo Pellegrini,et al.  Prolinks: a database of protein functional linkages derived from coevolution , 2004, Genome Biology.

[7]  Runsheng Chen,et al.  Phylophenetic properties of metabolic pathway topologies as revealed by global analysis , 2006, BMC Bioinformatics.

[8]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[9]  C. Woese,et al.  Bacterial evolution , 1987, Microbiological reviews.

[10]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[11]  A. L. Koch Were Gram-positive rods the first bacteria? , 2003, Trends in microbiology.

[12]  B. Snel,et al.  Toward Automatic Reconstruction of a Highly Resolved Tree of Life , 2006, Science.

[13]  L. Mai The Delphic boat: What genomes tell us , 2003 .

[14]  Alan Saghatelian,et al.  Erratum: Corrigendum: Assignment of protein function in the postgenomic era , 2005, Nature chemical biology.

[15]  C. Woese A New Biology for a New Century , 2004, Microbiology and Molecular Biology Reviews.

[16]  J. Whitfield Linnaeus at 300: We are family , 2007, Nature.

[17]  C. Kurland,et al.  What tangled web: barriers to rampant horizontal gene transfer. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[18]  G. Caetano-Anollés,et al.  Global phylogeny determined by the combination of protein domains in proteomes. , 2006, Molecular biology and evolution.

[19]  M. Gouy,et al.  HOBACGEN: database system for comparative genomics in bacteria. , 2000, Genome research.

[20]  James R. Brown Ancient horizontal gene transfer , 2003, Nature Reviews Genetics.

[21]  Aleksey Y. Ogurtsov,et al.  Analysis of internal loops within the RNA secondary structure in almost quadratic time , 2006, Bioinform..

[22]  Junhyong Kim,et al.  The Cobweb of Life Revealed by Genome-Scale Estimates of Horizontal Gene Transfer , 2005, PLoS biology.

[23]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[24]  T. Cavalier-smith,et al.  Rooting the Eukaryote Tree by Using a Derived Gene Fusion , 2002, Science.

[25]  F. Delsuc,et al.  Phylogenomics and the reconstruction of the tree of life , 2005, Nature Reviews Genetics.

[26]  Paramvir S. Dehal,et al.  Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate , 2005, PLoS biology.

[27]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[28]  D. Eisenberg,et al.  A combined algorithm for genome-wide prediction of protein function , 1999, Nature.

[29]  D. Segrè,et al.  Supporting Online Material Materials and Methods Tables S1 and S2 References the Effect of Oxygen on Biochemical Networks and the Evolution of Complex Life , 2022 .

[30]  J. Andersson,et al.  Insights into the evolutionary process of genome degradation. , 1999, Current opinion in genetics & development.

[31]  K. Timmis,et al.  Genome sequence of the ubiquitous hydrocarbon-degrading marine bacterium Alcanivorax borkumensis , 2006, Nature Biotechnology.

[32]  M. Hattori,et al.  Genome sequence of Symbiobacterium thermophilum, an uncultivable bacterium that depends on microbial commensalism. , 2004, Nucleic acids research.

[33]  R. Lewontin Directions in evolutionary biology. , 2002, Annual review of genetics.

[34]  N. Grishin,et al.  Genome trees and the tree of life. , 2002, Trends in genetics : TIG.

[35]  C. Kurland,et al.  Horizontal gene transfer: A critical view , 2003 .

[36]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.

[37]  W. Martin,et al.  Out of Thin Air , 2008, Science.

[38]  S. Garcia-Vallvé,et al.  Horizontal gene transfer in bacterial and archaeal complete genomes. , 2000, Genome research.

[39]  David P. Kreil,et al.  Identification of thermophilic species by the amino acid compositions deduced from their genomes. , 2001, Nucleic acids research.

[40]  J. Wootton,et al.  Analysis of compositionally biased regions in sequence databases. , 1996, Methods in enzymology.

[41]  A. Barabasi,et al.  Comparable system-level organization of Archaea and Eukaryotes , 2001, Nature Genetics.

[42]  P. Holland,et al.  Rare genomic changes as a tool for phylogenetics. , 2000, Trends in ecology & evolution.

[43]  Antoine Danchin,et al.  The Delphic Boat: What Genomes Tell Us , 2002 .

[44]  Ambuj K. Singh,et al.  Deriving phylogenetic trees from the similarity analysis of metabolic pathways , 2003, ISMB.

[45]  F. Delsuc,et al.  Phylogenomics: the beginning of incongruence? , 2006, Trends in genetics : TIG.

[46]  Dieter Söll,et al.  The genome of Nanoarchaeum equitans: Insights into early archaeal evolution and derived parasitism , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[47]  M. J. Park,et al.  Symbiobacterium thermophilum gen. nov., sp. nov., a symbiotic thermophile that depends on co-culture with a Bacillus strain for growth. , 2000, International journal of systematic and evolutionary microbiology.

[48]  J. Lake,et al.  Deriving the genomic tree of life in the presence of horizontal gene transfer: conditioned reconstruction. , 2004, Molecular biology and evolution.

[49]  Michael J. Stanhope,et al.  Universal trees based on large combined protein sequence data sets , 2001, Nature Genetics.

[50]  Shailesh V. Date,et al.  A Probabilistic Functional Network of Yeast Genes , 2004, Science.

[51]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .