Conservation of polyglutamine tract size between mice and humans depends on codon interruption.

Tandem repeats of amino acids are common in eukaryotic proteins (Green and Wang 1994; Karlin and Burge 1996; Albà, Santibáñez-Koref, and Hancock 1999). However, the functional significance of these repeats remains controversial. They have been suggested to play a role in transcriptional activation (Mitchell and Tjian 1989), and changes in glutamine repeat lengths can affect transcriptional activity of proteins (Gerber et al. 1994; Kazemi-Esfarjani, Trifiro, and Pinsky 1995). Glutamines may form polar zippers and connect transcription factors (Perutz et al. 1994). However, the evolutionary lability of these repeats in a variety of genes, including genes causing human neurodegenerative disorders, has been taken to suggest that they are not functionally important and that their size changes are driven by mutational forces (Green and Wang 1994; Rubinsztein et al. 1994, 1995). Comparative studies of repeat sizes between species have dealt with only a small sample of genes. It is therefore unclear whether the rapid change reported previously is representative of tandem amino acid repeats in general or whether conserved tandem amino acid repeats also exist. To address this question, we compared polyglutamine-encoding regions in a larger sample of genes from mice and humans. The GenBank subset of human and mouse sequences was searched for proteins with tracts of six or more glutamines using BLASTP (Altschul et al. 1990). Redundancy in the data was eliminated by running FASTA (Pearson and Lipman 1988). Sequences with 95% identity were considered redundant, and only one representative was used in subsequent analysis. Discrepancies in the lengths of polyglutamine tracts in nearly identical sequences were resolved by taking the sequence with the longest tract. The sample contained 68 different human and 27 different mouse proteins, yielding 96 polyglutamine tracts for humans and 52 for mice. BLASTP was then used to identify homologous sequences from the other species, and sequence similarity was confirmed using PILEUP (Genetics Computer Group 1997). In total, there were 44 different polyglutamine tracts in human proteins homologous to a mouse database entry, and 49 tracts in mouse proteins homologous to a human database entry; 21 were identified in both species.

[1]  John M. Hancock,et al.  Amino Acid Reiterations in Yeast Are Overrepresented in Particular Classes of Proteins and Show Evidence of a Slippage-Like Mutational Process , 1999, Journal of Molecular Evolution.

[2]  T. Petes,et al.  Stabilization of microsatellite sequences by variant repeats in the yeast Saccharomyces cerevisiae. , 1997, Genetics.

[3]  S Karlin,et al.  Trinucleotide repeats and long homopeptides in genes and proteins associated with nervous system disease and development. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[4]  H. Ellegren,et al.  Microsatellite ‘evolution’: directionality or bias? , 1995, Nature Genetics.

[5]  M. Perutz,et al.  Glutamine Repeats as Polar Zippers: Their Role in Inherited Neurodegenerative Disease , 1995, Molecular medicine.

[6]  M A Ferguson-Smith,et al.  Sequence variation and size ranges of CAG repeats in the Machado-Joseph disease, spinocerebellar ataxia type 1 and androgen receptor genes. , 1995, Human molecular genetics.

[7]  L Pinsky,et al.  Evidence for a repressive function of the long polyglutamine tract in the human androgen receptor: possible pathogenetic relevance for the (CAG)n-expanded neuronopathies. , 1995, Human molecular genetics.

[8]  Ronald Bontrop,et al.  Mutational bias provides a model for the evolution of Huntington's disease and predicts a general increase in disease prevalence , 1994, Nature Genetics.

[9]  H Green,et al.  Codon reiteration and the evolution of proteins. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[10]  S. Rusconi,et al.  Transcriptional activation modulated by homopolymeric glutamine and proline stretches. , 1994, Science.

[11]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[12]  R. Tjian,et al.  Transcriptional regulation in mammalian cells by sequence-specific DNA binding proteins. , 1989, Science.

[13]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[14]  G. Gutman,et al.  Slipped-strand mispairing: a major mechanism for DNA sequence evolution. , 1987, Molecular biology and evolution.