Horizontal gene transfers as metagenomic gene duplications.

While it is well accepted that horizontal gene transfer plays an important role in the evolution and the diversification of prokaryotic genomes, many questions remain open regarding its functional mechanisms of action and its interplay with the extant genome. This study addresses the relationship between proteome innovation by horizontal gene transfer and genome content in Proteobacteria. We characterize the transferred genes, focusing on the protein domain compositions and their relationships with the existing protein domain superfamilies in the genome. In agreement with previous observations, we find that the protein domain architectures of horizontally transferred genes are significantly shorter than the genomic average. Furthermore, protein domains that are more common in the total pool of genomes appear to have a proportionally higher chance to be transferred. This suggests that transfer events behave as if they were drawn randomly from a cross-genomic community gene pool, much like gene duplicates are drawn from a genomic gene pool. Finally, horizontally transferred genes carry domains of exogenous families less frequently for larger genomes, although they might do it more than expected by chance.

[1]  E. Koonin,et al.  The Impact of Comparative Genomics on Our Understanding of Evolution , 2000, Cell.

[2]  E. Koonin,et al.  Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic world , 2008, Nucleic acids research.

[3]  C. Pál,et al.  Adaptive evolution of bacterial metabolic networks by horizontal gene transfer , 2005, Nature Genetics.

[4]  Cyrus Chothia,et al.  The SUPERFAMILY database in 2007: families and functions , 2006, Nucleic Acids Res..

[5]  M. Huynen,et al.  The frequency distribution of gene family sizes in complete genomes. , 1998, Molecular biology and evolution.

[6]  Erik van Nimwegen,et al.  The evolution of domain-content in bacterial genomes , 2008, Biology Direct.

[7]  Tatiana A. Tatusova,et al.  The National Center for Biotechnology Information's Protein Clusters Database , 2008, Nucleic Acids Res..

[8]  E. Koonin,et al.  Horizontal gene transfer in prokaryotes: quantification and classification. , 2001, Annual review of microbiology.

[9]  R. Stein,et al.  On the need for widespread horizontal gene transfers under genome size constraint , 2009, Biology Direct.

[10]  M. Gerstein,et al.  The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties , 2002, Genome Biology.

[11]  H. Ochman,et al.  Lateral gene transfer and the nature of bacterial innovation , 2000, Nature.

[12]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[13]  Uri Gophna,et al.  Complexity, connectivity, and duplicability as barriers to lateral gene transfer , 2007, Genome Biology.

[14]  Santiago Garcia-Vallvé,et al.  HGT-DB: a database of putative horizontally transferred genes in prokaryotic complete genomes , 2003, Nucleic Acids Res..

[15]  Eugene V. Koonin,et al.  Are There Laws of Genome Evolution? , 2011, PLoS Comput. Biol..

[16]  Tim J. P. Hubbard,et al.  SCOP database in 2004: refinements integrate structure and sequence family data , 2004, Nucleic Acids Res..

[17]  Cheryl P. Andam,et al.  Biased gene transfer in microbial evolution , 2011, Nature Reviews Microbiology.

[18]  J. Lake,et al.  Genomic evidence for two functionally distinct gene classes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[19]  A. Elofsson,et al.  What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? , 2006, Genome Biology.

[20]  C. Pál,et al.  Integration of horizontally transferred genes into regulatory interaction networks takes many million years. , 2008, Molecular biology and evolution.

[21]  C. Chothia,et al.  Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. , 2001, Journal of molecular biology.

[22]  J. Townsend,et al.  Horizontal gene transfer, genome innovation and evolution , 2005, Nature Reviews Microbiology.

[23]  Bruno Bassetti,et al.  Universal features in the genome-level evolution of protein domains , 2008, Genome Biology.

[24]  Rajeev K. Azad,et al.  Detecting laterally transferred genes: use of entropic clustering methods and genome position , 2007, Nucleic acids research.

[25]  E. Rocha,et al.  Horizontal Transfer, Not Duplication, Drives the Expansion of Protein Families in Prokaryotes , 2011, PLoS genetics.

[26]  J. Lake,et al.  Horizontal gene transfer among genomes: the complexity hypothesis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[27]  E. Nimwegen Scaling Laws in the Functional Content of Genomes , 2003, physics/0307001.

[28]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[29]  C. Orengo,et al.  Protein families and their evolution-a structural perspective. , 2005, Annual review of biochemistry.

[30]  A. Emili,et al.  Global Functional Atlas of Escherichia coli Encompassing Previously Uncharacterized Proteins , 2009, PLoS biology.

[31]  M. Gerstein,et al.  Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model. , 2001, Journal of molecular biology.