Trinucleotide repeats are clustered in regulatory genes in Saccharomyces cerevisiae.

The genome of Saccharomyces cerevisiae contains numerous unstable microsatellite sequences. Mononucleotide and dinucleotide repeats are rarely found in ORFs, and when present in an ORF are frequently located in an intron or at the C terminus of the protein, suggesting that their instability is deleterious to gene function. DNA trinucleotide repeats (TNRs) are found at a higher-than-expected frequency within ORFs, and the amino acids encoded by the TNRs represent a biased set. TNRs are rarely conserved between genes with related sequences, suggesting high instability or a recent origin. The genes in which TNRs are most frequently found are related to cellular regulation. The protein structural database is notably lacking in proteins containing amino acid tracts, suggesting that they are not located in structured regions of a protein but are rather located between domains. This conclusion is consistent with the location of amino acid tracts in two protein families. The preferred location of TNRs within the ORFs of genes related to cellular regulation together with their instability suggest that TNRs could have an important role in speciation. Specifically, TNRs could serve as hot spots for recombination leading to domain swapping, or mutation of TNRs could allow rapid evolution of new domains of protein structure.

[1]  S. Lindquist,et al.  Oligopeptide-repeat expansions modulate ‘protein-only’ inheritance in yeast , 1999, Nature.

[2]  D. Livingston,et al.  Orientation dependence of trinucleotide CAG repeat instability in Saccharomyces cerevisiae , 1996, Molecular and cellular biology.

[3]  M. MacDonald,et al.  Trinucleotide instability: a repeating theme in human inherited disorders. , 1996, Annual review of medicine.

[4]  K. Fischbeck,et al.  Trinucleotide repeats in neurogenetic disorders. , 1996, Annual review of neuroscience.

[5]  Temple F. Smith,et al.  Biology's new Rosetta stone , 1997, Nature.

[6]  K. Kinzler,et al.  Genetic instability in colorectal cancers , 1997, Nature.

[7]  Michael E. Smith,et al.  DNA sequences of two yeast promoter-up mutants , 1983, Nature.

[8]  T. Petes,et al.  Microsatellite instability in yeast: dependence on repeat unit size and DNA mismatch repair genes , 1997, Molecular and cellular biology.

[9]  W. J. Dickinson,et al.  Marginal fitness contributions of nonessential genes in yeast. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[10]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[11]  J. Hopper,et al.  Sequence conservation in the Saccharomyces and Kluveromyces GAL11 transcription activators suggests functional domains. , 1991, Nucleic acids research.

[12]  M. Nowak,et al.  Adaptive evolution of highly mutable loci in pathogenic bacteria , 1994, Current Biology.

[13]  M. Carlson,et al.  The N-Terminal TPR Region Is the Functional Domain of SSN6, a Nuclear Phosphoprotein of Saccharomyces cerevisiae , 1990, Molecular and cellular biology.

[14]  J. Miret,et al.  Instability of CAG and CTG trinucleotide repeats in Saccharomyces cerevisiae , 1997, Molecular and cellular biology.

[15]  John M. Hancock,et al.  Simple sequences and the expanding genome. , 1996, BioEssays : news and reviews in molecular, cellular and developmental biology.

[16]  S. Karlin,et al.  A comparative analysis of distinctive features of yeast protein sequences , 1993, Yeast.

[17]  C. Wills,et al.  Abundant microsatellite polymorphism in Saccharomyces cerevisiae, and the different distributions of microsatellites in eight prokaryotes and S. cerevisiae, result from strong mutation pressures and a variety of selective forces. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[18]  K. Wise,et al.  Molecular basis of Mycoplasma surface antigenic variation: a novel set of divergent genes undergo spontaneous mutation of periodic coding regions and 5′ regulatory sequences. , 1991, The EMBO journal.

[19]  J. Stavenhagen,et al.  Stability of a CTG/CAG trinucleotide repeat in yeast is dependent on its orientation in the genome , 1997, Molecular and cellular biology.

[20]  K. Struhl,et al.  Yeast homologues of higher eukaryotic TFIID subunits. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[21]  M. Carlson,et al.  The SNF5 protein of Saccharomyces cerevisiae is a glutamine- and proline-rich transcriptional activator that affects expression of a broad spectrum of genes , 1990, Molecular and cellular biology.

[22]  B. Dujon,et al.  Distribution and variability of trinucleotide repeats in the genome of the yeast Saccharomyces cerevisiae. , 1996, Gene.

[23]  M. Pandolfo Molecular genetics and pathogenesis of Friedreich ataxia , 1998, Neuromuscular Disorders.

[24]  R. Triendl CJD link prompts ban on brain tissue use , 1997, nature.

[25]  J. Boeke,et al.  Small open reading frames: beautiful needles in the haystack. , 1997, Genome research.

[26]  André Goffeau,et al.  The yeast genome directory. , 1997, Nature.

[27]  D. Livingston,et al.  Destabilization of CAG trinucleotide repeat tracts by mismatch repair mutations in yeast. , 1997, Human molecular genetics.

[28]  R. Treisman Inside the MADS box , 1995, Nature.