Differential distribution of simple sequence repeats in eukaryotic genome sequences.

Complete chromosome/genome sequences available from humans, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, and Saccharomyces cerevisiae were analyzed for the occurrence of mono-, di-, tri-, and tetranucleotide repeats. In all of the genomes studied, dinucleotide repeat stretches tended to be longer than other repeats. Additionally, tetranucleotide repeats in humans and trinucleotide repeats in Drosophila also seemed to be longer. Although the trends for different repeats are similar between different chromosomes within a genome, the density of repeats may vary between different chromosomes of the same species. The abundance or rarity of various di- and trinucleotide repeats in different genomes cannot be explained by nucleotide composition of a sequence or potential of repeated motifs to form alternative DNA structures. This suggests that in addition to nucleotide composition of repeat motifs, characteristic DNA replication/repair/recombination machinery might play an important role in the genesis of repeats. Moreover, analysis of complete genome coding DNA sequences of Drosophila, C. elegans, and yeast indicated that expansions of codon repeats corresponding to small hydrophilic amino acids are tolerated more, while strong selection pressures probably eliminate codon repeats encoding hydrophobic and basic amino acids. The locations and sequences of all of the repeat loci detected in genome sequences and coding DNA sequences are available at http://www.ncl-india.org/ssr and could be useful for further studies.

[1]  D. Tautz,et al.  Simple sequences are ubiquitous repetitive components of eukaryotic genomes. , 1984, Nucleic acids research.

[2]  D. Tautz,et al.  Cryptic simplicity in DNA is a major source of genetic variation , 1986, Nature.

[3]  A. Rich,et al.  (dC‐dA)n.(dG‐dT)n sequences have evolutionarily conserved chromosomal locations in Drosophila with implications for roles in chromosome structure and function. , 1987, The EMBO journal.

[4]  G. Gutman,et al.  Slipped-strand mispairing: a major mechanism for DNA sequence evolution. , 1987, Molecular biology and evolution.

[5]  J. Beckmann,et al.  Toward a Unified Approach to Genetic Mapping of Eukaryotes Based on Sequence Tagged Microsatellite Sites , 1990, Bio/Technology.

[6]  D. Schorderet,et al.  Analysis of CpG suppression in methylated and nonmethylated species. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[7]  M. Morgante,et al.  PCR-amplified microsatellites as markers in plant genetics. , 1993, The Plant journal : for cell and molecular biology.

[8]  H Green,et al.  Codon reiteration and the evolution of proteins. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[9]  A. Marquis Gacy,et al.  Trinucleotide repeats that expand in human disease form hairpin structures in vitro , 1995, Cell.

[10]  T. Kamp,et al.  Hairpin properties of single-stranded DNA containing a GC-rich triplet repeat: (CTG)15. , 1995, Nucleic acids research.

[11]  O. Panaud,et al.  Frequency of microsatellite sequences in rice (Oryza sativa L.). , 1995, Genome.

[12]  S. Warren,et al.  Trinucleotide repeat expansion and human disease. , 1995, Annual review of genetics.

[13]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[14]  B. Dujon,et al.  Distribution and variability of trinucleotide repeats in the genome of the yeast Saccharomyces cerevisiae. , 1996, Gene.

[15]  D N Stivers,et al.  Relative mutation rates at di-, tri-, and tetranucleotide microsatellite loci. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[16]  T. Petes,et al.  Microsatellite instability in yeast: dependence on the length of the microsatellite. , 1997, Genetics.

[17]  T. Petes,et al.  Stabilization of microsatellite sequences by variant repeats in the yeast Saccharomyces cerevisiae. , 1997, Genetics.

[18]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[19]  R. Sinden,et al.  Trinucleotide repeat DNA structures: dynamic mutations from dynamic DNA. , 1998, Current opinion in structural biology.

[20]  R. Durrett,et al.  Equilibrium distributions of microsatellite repeat length resulting from a balance between slippage events and point mutations. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[21]  K. Wetterstrand,et al.  The mutation rates of di-, tri- and tetranucleotide repeats in Drosophila melanogaster. , 1998, Molecular biology and evolution.

[22]  Melanie E. Goward,et al.  The DNA sequence of human chromosome 22 , 1999, Nature.

[23]  T. Petes,et al.  Triplet repeats form secondary structures that escape DNA repair in yeast. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[24]  John M. Hancock,et al.  Amino Acid Reiterations in Yeast Are Overrepresented in Particular Classes of Proteins and Show Evidence of a Slippage-Like Mutational Process , 1999, Journal of Molecular Evolution.

[25]  C. Schlötterer,et al.  Distribution of dinucleotide microsatellites in the Drosophila melanogaster genome. , 1999, Molecular biology and evolution.

[26]  William C. Nierman,et al.  Lin, X. et al. Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature 402, 761-768 , 1999 .

[27]  M. Cotton,et al.  Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana , 1999, Nature.

[28]  Max F. Perutz,et al.  Glutamine repeats and neurodegenerative diseases: molecular aspects. , 1999, Trends in biochemical sciences.

[29]  C. Schlötterer,et al.  Removal of microsatellite interruptions by DNA replication slippage: phylogenetic evidence from Drosophila. , 2000, Molecular biology and evolution.

[30]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[31]  Mei Peng,et al.  The direction of microsatellite mutations is dependent upon allele length , 2000, Nature Genetics.

[32]  J. Jurka,et al.  Microsatellites in different eukaryotic genomes: survey and analysis. , 2000, Genome research.

[33]  R. Durrett,et al.  Distribution and abundance of microsatellites in the yeast genome can Be explained by a balance between slippage events and point mutations. , 2000, Molecular biology and evolution.

[34]  M. V. Katti,et al.  Amino acid repeat patterns in protein sequences: Their diversity and structural‐functional implications , 2000, Protein science : a publication of the Protein Society.

[35]  Hans Ellegren,et al.  Heterogeneous mutation processes in human microsatellite DNA sequences , 2000, Nature Genetics.

[36]  C. Schlötterer,et al.  Long microsatellite alleles in Drosophila melanogaster have a downward mutation bias and short persistence times, which cause their genome-wide underrepresentation. , 2000, Genetics.