Persistence drives gene clustering in bacterial genomes

BackgroundGene clustering plays an important role in the organization of the bacterial chromosome and several mechanisms have been proposed to explain its extent. However, the controversies raised about the validity of each of these mechanisms remind us that the cause of this gene organization remains an open question. Models proposed to explain clustering did not take into account the function of the gene products nor the likely presence or absence of a given gene in a genome. However, genomes harbor two very different categories of genes: those genes present in a majority of organisms – persistent genes – and those present in very few organisms – rare genes.ResultsWe show that two classes of genes are significantly clustered in bacterial genomes: the highly persistent and the rare genes. The clustering of rare genes is readily explained by the selfish operon theory. Yet, genes persistently present in bacterial genomes are also clustered and we try to understand why. We propose a model accounting specifically for such clustering, and show that indispensability in a genome with frequent gene deletion and insertion leads to the transient clustering of these genes. The model describes how clusters are created via the gene flux that continuously introduces new genes while deleting others. We then test if known selective processes, such as co-transcription, physical interaction or functional neighborhood, account for the stabilization of these clusters.ConclusionWe show that the strong selective pressure acting on the function of persistent genes, in a permanent state of flux of genes in bacterial genomes, maintaining their size fairly constant, that drives persistent genes clustering. A further selective stabilization process might contribute to maintaining the clustering.

[1]  S. R. Jammalamadaka,et al.  Topics in Circular Statistics , 2001 .

[2]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[3]  Csaba Pál,et al.  Evidence against the selfish operon theory. , 2004, Trends in genetics : TIG.

[4]  Maria J Martin,et al.  Comparing bacterial genomes through conservation profiles. , 2003, Genome research.

[5]  Eduardo P C Rocha,et al.  Replication‐associated gene dosage effects shape the genomes of fast‐growing bacteria but only for transcription and translation genes , 2006, Molecular microbiology.

[6]  E. Rocha DNA repeats lead to the accelerated loss of gene order in bacteria. , 2003, Trends in genetics : TIG.

[7]  J. Monod,et al.  Genetic regulatory mechanisms in the synthesis of proteins. , 1961, Journal of Molecular Biology.

[8]  H. Mori,et al.  Evolutionary instability of operon structures disclosed by sequence comparisons of complete microbial genomes. , 1999, Molecular biology and evolution.

[9]  S. G. Stephens Possible significances of duplication in evolution. , 1951, Advances in Genetics.

[10]  P. Bork,et al.  Identification and analysis of evolutionarily cohesive functional modules in protein networks. , 2006, Genome research.

[11]  Julio Collado-Vides,et al.  RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions , 2005, Nucleic Acids Res..

[12]  Hiroshi Mizoguchi,et al.  Cell size and nucleoid organization of engineered Escherichia coli cells with a reduced genome , 2004, Molecular microbiology.

[13]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[14]  A. Emili,et al.  Interaction network containing conserved and essential protein complexes in Escherichia coli , 2005, Nature.

[15]  Hongwei Wu,et al.  Detecting uber-operons in prokaryotic genomes , 2006, Nucleic acids research.

[16]  Katherine H. Huang,et al.  Operon formation is driven by co-regulation and not by horizontal gene transfer. , 2005, Genome research.

[17]  J. Monod,et al.  [Operon: a group of genes with the expression coordinated by an operator]. , 1960, Comptes rendus hebdomadaires des seances de l'Academie des sciences.

[18]  P Guerdoux-Jamet,et al.  Indigo: a World-Wide-Web review of genomes and gene functions. , 1998, FEMS microbiology reviews.

[19]  E. Rocha Inference and analysis of the relative stability of bacterial chromosomes. , 2006, Molecular biology and evolution.

[20]  P Bork,et al.  Gene context conservation of a higher order than operons. , 2000, Trends in biochemical sciences.

[21]  J R Roth,et al.  Selfish operons: horizontal transfer may drive the evolution of gene clusters. , 1996, Genetics.

[22]  Florence Tama,et al.  Structure of the E. coli protein-conducting channel bound to a translating ribosome , 2005, Nature.

[23]  E. Lewis Pseudoallelism and gene evolution. , 1951, Cold Spring Harbor symposia on quantitative biology.

[24]  J. H. Zar,et al.  Biostatistical Analysis (5th Edition) , 1984 .

[25]  H. Ochman,et al.  Evolutionary dynamics of full genome content in Escherichia coli , 2000, The EMBO journal.

[26]  Antoine Danchin,et al.  How essential are nonessential genes? , 2005, Molecular biology and evolution.

[27]  Temple F. Smith,et al.  Operons in Escherichia coli: genomic analyses and predictions. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[28]  N. Moran,et al.  Evolutionary Origins of Genomic Repertoires in Bacteria , 2005, PLoS biology.

[29]  E. Koonin,et al.  Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context. , 2001, Genome research.

[30]  S. Eriksson,et al.  Bacterial genome size reduction by experimental evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  J. W. Campbell,et al.  Experimental Determination and System Level Analysis of Essential Genes in Escherichia coli MG1655 , 2003, Journal of bacteriology.

[32]  M. Riley,et al.  Organization of the bacterial chromosome , 1990, Microbiological reviews.

[33]  S. Ehrlich,et al.  Essential Bacillus subtilis genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[34]  J. Changeux,et al.  Selective stabilisation of developing synapses as a mechanism for the specification of neuronal networks , 1976, Nature.

[35]  S. G. Stephens Possible Significance of Duplication in Evolution , 1951 .

[36]  A. Danchin,et al.  Specialized microbial databases for inductive exploration of microbial genome sequences , 2005, BMC Genomics.

[37]  Eugene V Koonin,et al.  Connected gene neighborhoods in prokaryotic genomes. , 2002, Nucleic acids research.

[38]  Eduardo P C Rocha,et al.  Order and disorder in bacterial genomes. , 2004, Current opinion in microbiology.

[39]  George M. Church,et al.  Estimating and improving protein interaction error rates , 2004 .

[40]  J. Parkhill,et al.  Comparative genomic structure of prokaryotes. , 2004, Annual review of genetics.

[41]  Warren C. Lathe,et al.  Predicting protein function by genomic context: quantitative evaluation and qualitative inferences. , 2000, Genome research.

[42]  Javier Tamames,et al.  Evolution of gene order conservation in prokaryotes , 2001, Genome Biology.

[43]  John C. Wyngaard,et al.  Structure of the PBL , 1988 .

[44]  P. Bork,et al.  Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs , 2004, Nature Biotechnology.

[45]  P. Bork,et al.  Measuring genome evolution. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Claudio Agostinelli,et al.  circular: Circular Statistics, from "Topics in circular Statistics" (2001) S. Rao Jammalamadaka and A. SenGupta, World Scientific. , 2004 .

[47]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[48]  S. Andersson,et al.  Microbial genome evolution: sources of variability. , 2002, Current opinion in microbiology.

[49]  Michael Y. Galperin,et al.  Who's your neighbor? New computational approaches for functional genomics , 2000, Nature Biotechnology.

[50]  A. Valencia,et al.  Analysis of the Cellular Functions of Escherichia coli Operons and Their Conservation in Bacillus subtilis , 2002, Journal of Molecular Evolution.

[51]  S. Salzberg,et al.  Prediction of transcription terminators in bacterial genomes. , 2000, Journal of molecular biology.