Proteome composition and codon usage in spirochaetes: species-specific and DNA strand-specific mutational biases.

The genomes of the spirochaetes Borrelia burgdorferi and Treponema pallidum show strong strand-specific skews in nucleotide composition, with the leading strand in replication being richer in G and T than the lagging strand in both species. This mutation bias results in codon usage and amino acid composition patterns that are significantly different between genes encoded on the two strands, in both species. There are also substantial differences between the species, with T.pallidum having a much higher G+C content than B. burgdorferi. These changes in amino acid and codon compositions represent neutral sequence change that has been caused by strong strand- and species-specific mutation pressures. Genes that have been relocated between the leading and lagging strands since B. burgdorferi and T.pallidum diverged from a common ancestor now show codon and amino acid compositions typical of their current locations. There is no evidence that translational selection operates on codon usage in highly expressed genes in these species, and the primary influence on codon usage is whether a gene is transcribed in the same direction as replication, or opposite to it. The dnaA gene in both species has codon usage patterns distinctive of a lagging strand gene, indicating that the origin of replication lies downstream of this gene, possibly within dnaN. Our findings strongly suggest that gene-finding algorithms that ignore variability within the genome may be flawed.

[1]  B. K. Davis Evolution of the genetic code. , 1999, Progress in biophysics and molecular biology.

[2]  Shōzō Ōsawa,et al.  Evolution of the genetic code , 1995 .

[3]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[4]  S. Eykyn Microbiology , 1950, The Lancet.

[5]  R. Clarke,et al.  Theory and Applications of Correspondence Analysis , 1985 .