Simple sequence repeats in prokaryotic genomes

Simple sequence repeats (SSRs) in DNA sequences are composed of tandem iterations of short oligonucleotides and may have functional and/or structural properties that distinguish them from general DNA sequences. They are variable in length because of slip-strand mutations and may also affect local structure of the DNA molecule or the encoded proteins. Long SSRs (LSSRs) are common in eukaryotes but rare in most prokaryotes. In pathogens, SSRs can enhance antigenic variance of the pathogen population in a strategy that counteracts the host immune response. We analyze representations of SSRs in >300 prokaryotic genomes and report significant differences among different prokaryotes as well as among different types of SSRs. LSSRs composed of short oligonucleotides (1–4 bp length, designated LSSR1–4) are often found in host-adapted pathogens with reduced genomes that are not known to readily survive in a natural environment outside the host. In contrast, LSSRs composed of longer oligonucleotides (5–11 bp length, designated LSSR5–11) are found mostly in nonpathogens and opportunistic pathogens with large genomes. Comparisons among SSRs of different lengths suggest that LSSR1–4 are likely maintained by selection. This is consistent with the established role of some LSSR1–4 in enhancing antigenic variance. By contrast, abundance of LSSR5–11 in some genomes may reflect the SSRs' general tendency to expand rather than their specific role in the organisms' physiology. Differences among genomes in terms of SSR representations and their possible interpretations are discussed.

[1]  J. Kypr,et al.  Factors Influencing Dna Expansion in the Course of Polymerase Chain Reaction , 2007, Nucleosides, nucleotides & nucleic acids.

[2]  Jan Mrázek,et al.  Pattern locator: a new tool for finding local sequence patterns in genomic DNA sequences , 2006, Bioinform..

[3]  J. Mrázek Analysis of distribution indicates diverse functions of simple sequence repeats in Mycoplasma genomes. , 2006, Molecular biology and evolution.

[4]  M. Bichara,et al.  Mechanisms of tandem repeat instability in bacteria. , 2006, Mutation research.

[5]  Y. Kashi,et al.  Simple sequence repeats as advantageous mutators in evolution. , 2006, Trends in genetics : TIG.

[6]  W. Nierman,et al.  Bacterial genome adaptation to niches: Divergence of the potential virulence genes in three Burkholderia species of different survival strategies , 2005, BMC Genomics.

[7]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[8]  E. Groisman,et al.  The origin and evolution of human pathogens , 2005, Molecular Microbiology.

[9]  O. White,et al.  Structural flexibility in the Burkholderia mallei genome. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Kim Rutherford,et al.  Genomic plasticity of the causative agent of melioidosis, Burkholderia pseudomallei. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  E. Rocha An appraisal of the potential for illegitimate recombination in bacterial genomes and its consequences: from duplications to genome reduction. , 2003, Genome research.

[12]  B. Duim,et al.  Homonucleotide stretches in chromosomal DNA of Campylobacter jejuni display high frequency polymorphism as detected by direct PCR analysis. , 2002, FEMS microbiology letters.

[13]  Eduardo P C Rocha,et al.  Genomic repeats, genome plasticity and the dynamics of Mycoplasma evolution. , 2002, Nucleic acids research.

[14]  M. Perutz,et al.  Aggregation of proteins with expanded glutamine and alanine repeats of the glutamine-rich and asparagine-rich domains of Sup35 and of the amyloid β-peptide of amyloid plaques , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  G. Shoham,et al.  Structural analysis of an Escherichia coli endonuclease VIII covalent reaction intermediate , 2002, The EMBO journal.

[16]  S. Karlin,et al.  Amino acid runs in eukaryotic proteomes and disease associations , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  G. Lawson,et al.  Lawsonia intracellularis: getting inside the pathogenesis of proliferative enteropathy. , 2001, Veterinary microbiology.

[18]  D. Metzgar,et al.  The microsatellites of Escherichia coli: rapidly evolving repetitive DNAs in a non‐pathogenic prokaryote , 2001, Molecular microbiology.

[19]  J. Jurka,et al.  Microsatellites in different eukaryotic genomes: survey and analysis. , 2000, Genome research.

[20]  R. Shafer,et al.  Biological aspects of DNA/RNA quadruplexes , 2000, Biopolymers.

[21]  Y. Kashi,et al.  Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism. , 2000, Genome research.

[22]  J. Kypr,et al.  Nucleotide sequences flanking dinucleotide microsatellites in the human, mouse and Drosophila genomes. , 1999, Journal of biomolecular structure & dynamics.

[23]  P. Markham,et al.  Expression of the pMGA Genes of Mycoplasma gallisepticum Is Controlled by Variation in the GAA Trinucleotide Repeat Lengths within the 5′ Noncoding Regions , 1998, Infection and Immunity.

[24]  C. Wills,et al.  Abundant microsatellite polymorphism in Saccharomyces cerevisiae, and the different distributions of microsatellites in eight prokaryotes and S. cerevisiae, result from strong mutation pressures and a variety of selective forces. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[25]  S Karlin,et al.  Compositional biases of bacterial genomes and evolutionary implications , 1997, Journal of bacteriology.

[26]  C. Caskey,et al.  Trinucleotide repeat disorders in humans: discussions of mechanisms and medical issues , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[27]  S. Karlin,et al.  Frequent oligonucleotides and peptides of the Haemophilus influenzae genome. , 1996, Nucleic acids research.

[28]  J. Kraut,et al.  Crystal structures of human DNA polymerase beta complexed with DNA: implications for catalytic mechanism, processivity, and fidelity. , 1996, Biochemistry.

[29]  R. Fleischmann,et al.  DNA repeats identify novel virulence genes in Haemophilus influenzae. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Nowak,et al.  Adaptive evolution of highly mutable loci in pathogenic bacteria , 1994, Current Biology.

[31]  K. V. van Holde,et al.  Unusual DNA structures, chromatin and transcription. , 1994, BioEssays : news and reviews in molecular, cellular and developmental biology.

[32]  D Larhammar,et al.  Biological origins of long-range correlations and compositional variations in DNA. , 1993, Nucleic acids research.

[33]  S. Barns,et al.  Ileal symbiont intracellularis, an obligate intracellular bacterium of porcine intestines showing a relationship to Desulfovibrio species. , 1993, International journal of systematic bacteriology.

[34]  S Karlin,et al.  Assessments of DNA inhomogeneities in yeast chromosome III. , 1993, Nucleic acids research.

[35]  David R. Wolf,et al.  Base compositional structure of genomes. , 1992, Genomics.

[36]  F. Mooi,et al.  Fimbrial phase variation in Bordetella pertussis: a novel mechanism for transcriptional regulation. , 1990, The EMBO journal.

[37]  J. Dahlberg,et al.  Topology and formation of triple-stranded H-DNA. , 1989, Science.

[38]  T. Meyer,et al.  Opacity genes in Neisseria gonorrhoeae: Control of phase and antigenic variation , 1986, Cell.

[39]  A. Rich,et al.  The sequence (dC-dA)n X (dG-dT)n forms left-handed Z-DNA in negatively supercoiled plasmids. , 1983, Proceedings of the National Academy of Sciences of the United States of America.