The Contributions of Replication Orientation, Gene Direction, and Signal Sequences to Base-Composition Asymmetries in Bacterial Genomes

Abstract. Asymmetries in base composition between the leading and the lagging strands have been observed previously in many prokaryotic genomes. Since a majority of genes is encoded on the leading strand in these genomes, previous analyses have not been able to determine the relative contribution to the base composition skews of replication processes and transcriptional and/or translational forces. Using qualitative graphical presentations and quantitative statistical analyses (analysis of variance), we have found that a significant proportion of the GC and AT skews can be attributed to replication orientation, i.e., the sequence of a gene is influenced by whether it is encoded on the leading or lagging strand. This effect of replication orientation on skews is independent of, and can be opposite in sign to, the effects of transcriptional or translational processes, such as selection for codon usage, amino acid preferences, expression levels (inferred from codon adaptation index), or potential short signal sequences (e.g., chi sequences). Mutational differences between the leading and the lagging strands are the most likely explanation for a significant proportion of the base composition skew in these bacterial genomes. The finding that base composition skews due to replication orientation are independent of those due to selection for function of the encoded protein may complicate the interpretation of phylogenetic relationships, conserved positions in nucleotide or amino acid sequence alignments, and codon usage patterns.

[1]  A. Danchin,et al.  Universal replication biases in bacteria , 1999, Molecular microbiology.

[2]  Temple F. Smith,et al.  Patterns of Genome Organization in Bacteria , 1998, Science.

[3]  K. Novak The complete genome sequence… , 1998, Nature Medicine.

[4]  S. Salzberg,et al.  Skewed oligomers and origins of replication. , 1998, Gene.

[5]  H. Yoshikawa,et al.  Genes and their organization in the replication origin region of the bacterial chromosome , 1992, Molecular microbiology.

[6]  Bacterial DNA strand compositional asymmetry: response. , 1999, Trends in microbiology.

[7]  B. J. Hinnebusch,et al.  Physical mapping of an origin of bidirectional replication at the centre of the Borrelia burgdorferi linear chromosome , 1999, Molecular microbiology.

[8]  S. Karlin,et al.  Strand compositional asymmetry in bacterial and large viral genomes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  R. Huber,et al.  The complete genome of the hyperthermophilic bacterium Aquifex aeolicus , 1998, Nature.

[10]  J. Lobry Asymmetric substitution patterns in the two DNA strands of bacteria. , 1996, Molecular biology and evolution.

[11]  J. Lobry,et al.  Origin of Replication of Mycoplasma genitalium , 1996, Science.

[12]  C. Kurland,et al.  Codon preferences in free-living microorganisms. , 1990, Microbiological reviews.

[13]  R. Fleischmann,et al.  The complete genome sequence of the hyperthermophilic, sulphate-reducing archaeon Archaeoglobus fulgidus , 1997, Nature.

[14]  S. Cebrat,et al.  How does replication-associated mutational pressure influence amino acid composition of proteins? , 1999, Genome research.

[15]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[16]  Y. Nakamura,et al.  Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions (supplement). , 1996, DNA research : an international journal for rapid publication of reports on genes and genomes.

[17]  A. Goffeau,et al.  The complete genome sequence of the Gram-positive bacterium Bacillus subtilis , 1997, Nature.

[18]  T. Sicheritz-Pontén,et al.  The genome sequence of Rickettsia prowazekii and the origin of mitochondria , 1998, Nature.

[19]  S. Karlin Bacterial DNA strand compositional asymmetry. , 1999, Trends in microbiology.

[20]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[21]  A. Worcel,et al.  A DNA fragment containing the origin of replication of the Escherichia coli chromosome. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[22]  H. Ochman,et al.  Strand asymmetries in DNA evolution. , 1997, Trends in genetics : TIG.

[23]  K. H. Wolfe,et al.  Base Composition Skews, Replication Orientation, and Gene Orientation in 12 Prokaryote Genomes , 1998, Journal of Molecular Evolution.

[24]  D C Shields,et al.  Synonymous codon usage in Bacillus subtilis reflects both translational selection and mutational biases. , 1987, Nucleic acids research.

[25]  F. Robb,et al.  Complete sequence and gene organization of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3. , 1998, DNA research : an international journal for rapid publication of reports on genes and genomes.

[26]  J. Lobry,et al.  A simple vectorial representation of DNA sequences for the detection of replication origins in bacteria. , 1996, Biochimie.

[27]  Sayaka,et al.  Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. , 1996, DNA research : an international journal for rapid publication of reports on genes and genomes.

[28]  A. Bhagwat,et al.  Transcription-induced mutations: increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[29]  C. Dutta,et al.  Codon usage in highly expressed genes of Haemophillus influenzae and Mycobacterium tuberculosis: translational selection versus mutational bias. , 1998, Gene.

[30]  Yoshikawa Hiroshi,et al.  Replication origin of the Bacillus subtilis chromosome determined by hybridization of the first-replicating DNA with cloned fragments from the replication origin region of the chromosome , 1984 .

[31]  Mark Borodovsky,et al.  The complete genome sequence of the gastric pathogen Helicobacter pylori , 1997, Nature.

[32]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[33]  S. Salzberg,et al.  Complete genome sequence of Treponema pallidum, the syphilis spirochete. , 1998, Science.

[34]  H. Ochman,et al.  Asymmetries Generated by Transcription-Coupled Repair in Enterobacterial Genes , 1996, Science.

[35]  Ronald W. Davis,et al.  Comparative genomes of Chlamydia pneumoniae and C. trachomatis , 1999, Nature Genetics.

[36]  G. Church,et al.  Complete genome sequence of Methanobacterium thermoautotrophicum deltaH: functional analysis and comparative genomics , 1997, Journal of bacteriology.

[37]  R. W. Davis,et al.  Genome sequence of an obligate intracellular pathogen of humans: Chlamydia trachomatis. , 1998, Science.

[38]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[39]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[40]  L. Shapiro,et al.  Bacterial chromosome origins of replication. , 1993, Current opinion in genetics & development.

[41]  A Grigoriev,et al.  Analyzing genomes with cumulative skew diagrams. , 1998, Nucleic acids research.

[42]  H. Hilbert,et al.  Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae. , 1996, Nucleic acids research.

[43]  Paweł Mackiewicz,et al.  Effect of replication on the third base of codons , 1999 .

[44]  Guy Perrière,et al.  Correspondence discriminant analysis: a multivariate method for comparing classes of protein and nucleic acid sequences , 1996, Comput. Appl. Biosci..

[45]  S. Salzberg,et al.  Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi , 1997, Nature.