Genomic organization of evolutionarily correlated genes in bacteria: limits and strategies.

The need for efficient molecular interplay in time and space within a cell imposes strong constraints that could be partially relaxed if relative gene positions along chromosomes were appropriate. Comparative genomics studies have demonstrated the short-scale conservation of gene proximity along bacterial chromosomes. Additionally, the long-range periodic positioning of evolutionarily correlated genes within Escherichia coli has recently been highlighted. To gain further insight into these different genetic organizations, we examined the compromise between chromosomal proximity and periodicity for all available eubacterial genomes by evaluating groups of evolutionarily correlated genes from a benchmark data set. In enterobacteria, strict chromosomal proximity is found to be limited to groups under 20 genes, whereas periodicity is significant in all groups over 50. The E. coli K12 genome bears 511 periodic genes (12% of the genome), whose orthologs are found to be periodic in all eubacterial phyla. These periodic genes predominantly function in macromolecular synthesis and spatial organization of cellular components. They are enriched in essential and housekeeping genes and tend to often be constitutively expressed. On this basis, it is argued that chromosomal proximity and periodicity are ubiquitous complementary genomic strategies that favor the build-up of local concentrations of co-functional molecules. In particular, the periodic layout may facilitate chromosome folding to spatially organize the construction of major cell components. The transition at 20 genes is reminiscent of the size of the longest operons and of topological microdomains. The range for which DNA neighborhood optimizes biochemical interactions might therefore be defined by DNA topology.

[1]  Ivan Junier,et al.  Spatial and Topological Organization of DNA Chains Induced by Gene Co-localization , 2010, PLoS Comput. Biol..

[2]  Cédric Vaillant,et al.  Transcription-Based Solenoidal Model of Chromosomes , 2004, Complexus.

[3]  Jeremy D. Glasner,et al.  Genome-Scale Analysis of the Uses of the Escherichia coli Genome: Model-Driven Analysis of Heterogeneous Data Sets , 2003, Journal of bacteriology.

[4]  Katherine H. Huang,et al.  The MicrobesOnline Web site for comparative genomics. , 2005, Genome research.

[5]  Paul J. Choi,et al.  Quantifying E. coli Proteome and Transcriptome with Single-Molecule Sensitivity in Single Cells , 2010, Science.

[6]  B. Müller-Hill,et al.  High local protein concentrations at promoters: strategies in prokaryotic and eukaryotic cells. , 2001, BioEssays : news and reviews in molecular, cellular and developmental biology.

[7]  D. Sherratt,et al.  The two Escherichia coli chromosome arms locate to separate cell halves. , 2006, Genes & development.

[8]  J. E. Cabrera,et al.  Active Transcription of rRNA Operons Is a Driving Force for the Distribution of RNA Polymerase in Bacteria: Effect of Extrachromosomal Copies of rrnB on the In Vivo Localization of RNA Polymerase , 2006, Journal of bacteriology.

[9]  O. Sliusarenko,et al.  Spatial organization of the flow of genetic information in bacteria , 2010, Nature.

[10]  Javier Tamames,et al.  Evolution of gene order conservation in prokaryotes , 2001, Genome Biology.

[11]  Jennifer A. Mitchell,et al.  Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells , 2010, Nature Genetics.

[12]  Daniel Segrè,et al.  Chromosomal periodicity of evolutionarily conserved gene pairs , 2007, Proceedings of the National Academy of Sciences.

[13]  Michael K. Gilson,et al.  ASAP, a systematic annotation package for community analysis of genomes , 2003, Nucleic Acids Res..

[14]  Akira Ishihama,et al.  Two types of localization of the DNA‐binding proteins within the Escherichia coli nucleoid , 2000, Genes to cells : devoted to molecular & cellular mechanisms.

[15]  Ivan Junier,et al.  Periodic pattern detection in sparse boolean sequences , 2010, Algorithms for Molecular Biology.

[16]  L. Mirny,et al.  How gene order is influenced by the biophysics of transcription regulation , 2007, Proceedings of the National Academy of Sciences.

[17]  O. Espéli,et al.  Organization of the Escherichia coli chromosome into macrodomains and its possible functional implications. , 2006, Journal of structural biology.

[18]  J. Errington,et al.  Compartmentalization of transcription and translation in Bacillus subtilis , 2000, The EMBO journal.

[19]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[20]  Arkady B Khodursky,et al.  Spatial patterns of transcriptional activity in the chromosome of Escherichia coli , 2004, Genome Biology.

[21]  B. Müller-Hill,et al.  Repression of lac promoter as a function of distance, phase and quality of an auxiliary lac operator. , 1996, Journal of molecular biology.

[22]  Peter R. Cook,et al.  Predicting three-dimensional genome structure from transcriptional activity , 2002, Nature Genetics.

[23]  A. Moya,et al.  Determination of the Core of a Minimal Bacterial Gene Set , 2004, Microbiology and Molecular Biology Reviews.

[24]  A. Travers,et al.  Coordination of genomic structure and transcription by the main bacterial nucleoid‐associated protein HU , 2010, EMBO reports.

[25]  J. Shaffer Multiple Hypothesis Testing , 1995 .

[26]  Peter R. Cook,et al.  Similar active genes cluster in specialized transcription factories , 2008, The Journal of cell biology.

[27]  Cameron S. Osborne,et al.  Active genes dynamically colocalize to shared sites of ongoing transcription , 2004, Nature Genetics.

[28]  François Képès,et al.  Periodic transcriptional organization of the E.coli genome. , 2004, Journal of molecular biology.

[29]  C. D. Hardy,et al.  Topological domain structure of the Escherichia coli chromosome. , 2004, Genes & development.

[30]  P. Fraser,et al.  Nuclear organization of the genome and the potential for gene regulation , 2007, Nature.

[31]  John Kuriyan,et al.  The origin of protein interactions and allostery in colocalization , 2007, Nature.

[32]  E. Koonin Orthologs, Paralogs, and Evolutionary Genomics 1 , 2005 .

[33]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Alessandra Carbone,et al.  Chromosomal periodicity and positional networks of genes in Escherichia coli , 2010 .

[35]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[36]  P. Bork,et al.  Measuring genome evolution. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  E. Koonin Orthologs, paralogs, and evolutionary genomics. , 2005, Annual review of genetics.

[38]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[39]  Paul A. Wiggins,et al.  Strong intranucleoid interactions organize the Escherichia coli chromosome into a nucleoid filament , 2010, Proceedings of the National Academy of Sciences.

[40]  E. Gilson,et al.  A haploid-specific transcriptional response to irradiation in Saccharomyces cerevisiae , 2005, Nucleic acids research.

[41]  S. Leibler,et al.  DNA looping and physical constraints on transcription regulation. , 2003, Journal of molecular biology.

[42]  Antoine Danchin,et al.  Persistence drives gene clustering in bacterial genomes , 2008, BMC Genomics.

[43]  Terence Hwa,et al.  Combinatorial transcriptional control of the lactose operon of Escherichia coli , 2007, Proceedings of the National Academy of Sciences.

[44]  Saeed Saberi,et al.  Chromosome Driven Spatial Patterning of Proteins in Bacteria , 2010, PLoS Comput. Biol..

[45]  J. E. Cabrera,et al.  The distribution of RNA polymerase in Escherichia coli is dynamic and sensitive to environmental cues , 2003, Molecular microbiology.

[46]  E. Rocha The organization of the bacterial genome. , 2008, Annual review of genetics.

[47]  Richard A Stein,et al.  Organization of supercoil domains and their reorganization by transcription , 2005, Molecular microbiology.

[48]  References , 1971 .

[49]  Benno Müller-Hill,et al.  Repression oflacPromoter as a Function of Distance, Phase and Quality of an AuxiliarylacOperator , 1996 .

[50]  Michael Y. Galperin,et al.  The COG database: new developments in phylogenetic classification of proteins from complete genomes , 2001, Nucleic Acids Res..