Uneven distribution of GATC motifs in the Escherichia coli chromosome, its plasmids and its phages.

This work reconsiders the GATC motif distribution in a 1.6 Mb segment of the Escherichia coli genome, compared to its distribution in phages and plasmids. At first sight the distribution of GATC words looks random. But when a realistic model of the chromosome (made of average genes having the same codon usage as in the real chromasome), is used as a theoretical reference, strong biasesare observed. GATC pairs such as GATCNNGATC are under-represented while there is a strong positive selection for motifs separated by 10, 19, 70 and 1100 bp. The last class is the only one present in E. coli parasites. It can be ascribed to the triggering sequences of the long-patch mismatch repair system. The 6 bp class overlaps with the consensus of CAP (catabolite activator protein) and FNR (fumarate/nitrate regulator) binding sites, thus accounting for counter-selection. The other classes, which could be targets for a nucleic acid-binding protein, are almost always present inside protein coding sequences, and are members of clusters of GATC motifs. Analysis of the genes containing these motifs suggests that they correspond to a regulatory process monitoring the shift from anaerobic to aerobic growth conditions. In particular this regulation, closing down transcription of a large number of genes involved in intermediary metabolism would be well suited for the cold and oxygen shift from the mammal's gut to the standard environmental conditions. In this process the methylation status of GATC clusters would be very important for tuning transcription, and a DNA binding protein, probably a member of the cold-shock proteins family would be needed for alleviating the effects mediated by slackening of the pace of methylation during the shift.

[1]  Nancy Kleckner,et al.  SeqA: A negative modulator of replication initiation in E. coli , 1994, Cell.

[2]  J. Griffith,et al.  Bidirectional excision in methyl-directed mismatch repair. , 1993, The Journal of biological chemistry.

[3]  Antoine Danchin,et al.  METALGEN.DB: metabolism linked to the genome of Escherichia coli, a graphics-oriented database , 1993, Comput. Appl. Biosci..

[4]  A. Gunasekera,et al.  DNA sequence determinants for binding of the Escherichia coli catabolite gene activator protein. , 1992, The Journal of biological chemistry.

[5]  E. Boye,et al.  Quantitation of Dam methyltransferase in Escherichia coli , 1992, Journal of bacteriology.

[6]  A. Danchin,et al.  Evidence for horizontal gene transfer in Escherichia coli speciation. , 1991, Journal of molecular biology.

[7]  K. Makino,et al.  Purification and characterization of the Escherichia coli OxyR protein, the positive regulator for a hydrogen peroxide-inducible regulon. , 1991, Journal of biochemistry.

[8]  N. Sternberg,et al.  Cleavage of the bacteriophage P1 packaging site (pac) is regulated by adenine methylation. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[9]  N. Kleckner,et al.  E. coli oriC and the dnaA gene promoter are sequestered from dam methyltransferase following the passage of the chromosomal replication fork , 1990, Cell.

[10]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M. Bölker,et al.  The Escherichia coli regulatory protein OxyR discriminates between methylated and unmethylated states of the phage Mu mom promoter. , 1989, The EMBO journal.

[12]  S. Cole,et al.  Molecular genetic analysis of FNR‐dependent promoters , 1989, Molecular microbiology.

[13]  S. Hattman,et al.  Molecular cloning, sequencing, and mapping of the bacteriophage T2 dam gene , 1988, Journal of bacteriology.

[14]  Charles H. Bennett Logical depth and physical complexity , 1988 .

[15]  J. Jiricny,et al.  d(GATC) sequences influence Escherichia coli mismatch repair in a distance-dependent manner from positions both upstream and downstream of the mismatch. , 1988, Nucleic acids research.

[16]  D. Russell,et al.  Hemimethylation prevents DNA replication in E. coli , 1987, Cell.

[17]  D. Söll,et al.  The effect of dam methylation on the expression of glnS in E. coli. , 1987, Biochimie.

[18]  H. Blöcker,et al.  The mom gene of bacteriophage Mu: the mechanism of methylation‐dependent expression. , 1986, The EMBO journal.

[19]  A. Hénaut,et al.  The origins of the strategy of codon use. , 1985, Biochimie.

[20]  H. P. Yockey,et al.  Information Theory And Molecular Biology , 1992 .

[21]  M. Marinus,et al.  The great GATC: DNA methylation in E. coli. , 1989, Trends in genetics : TIG.

[22]  P. Modrich,et al.  Repair of DNA base-pair mismatches in extracts of Escherichia coli. , 1984, Cold Spring Harbor symposia on quantitative biology.