Analysis of intrachromosomal duplications in yeast Saccharomyces cerevisiae: a possible model for their origin.

The complete genome of the yeast Saccharomyces cerevisiae was investigated for intrachromosomal duplications at the level of nucleotide sequences. The analysis was performed by looking for long approximate repeats (from 30 to 3,885 bp) present on each of the chromosomes. We show that direct and inverted repeats exhibit very different characteristics: the two copies of direct repeats are more similar and longer than those of inverted repeats. Furthermore, contrary to the inverted repeats, a large majority of direct repeats appear to be closely spaced. The distance (delta) between the two copies is generally smaller than 1 kb. Further analysis of these "close direct repeats" shows a negative correlation between delta and the percentage of identity between the two copies, and a positive correlation between delta and repeat length. Moreover, contrary to the other categories of repeats, close direct repeats are mostly located within coding sequences (CDSs). We propose two hypotheses in order to interpret these observations: first, the deletion/conversion rate is negatively correlated with delta; second, there exists an active duplication mechanism which continuously creates close direct repeats, the other intrachromosomal repeats being the result, by chromosomal rearrangements of these "primary repeats."

[1]  G Muthukumar,et al.  Seripauperins of Saccharomyces cerevisiae: a new multigene family encoding serine-poor relatives of serine-rich proteins. , 1994, Gene.

[2]  E J Louis,et al.  The subtelomeric Y' repeat family in Saccharomyces cerevisiae: an experimental system for repeated sequence evolution. , 1990, Genetics.

[3]  Temple F. Smith,et al.  Comparison of the complete protein sets of worm and yeast: orthology and divergence. , 1998, Science.

[4]  The effect of the length of direct repeats and the presence of palindromes on deletion between directly repeated DNA sequences in bacteriophage T7. , 1991, Nucleic acids research.

[5]  S. Oliver,et al.  Erratum: Overview of the yeast genome , 1997, Nature.

[6]  R. Britten Precise sequence complementarity between yeast chromosome ends and two classes of just-subtelomeric sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  E. Coissac,et al.  A comparative study of duplications in bacteria and eukaryotes: the importance of telomeres. , 1997, Molecular biology and evolution.

[8]  D. Eisenberg,et al.  A census of protein repeats. , 1999, Journal of molecular biology.

[9]  E. Dervyn,et al.  Frequency of deletion formation decreases exponentially with distance between short direct repeats , 1994, Molecular microbiology.

[10]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[11]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[12]  E. Louis,et al.  Chromosome ends: all the same under their caps. , 1997, Current opinion in genetics & development.

[13]  S. Jinks-Robertson,et al.  Substrate length requirements for efficient mitotic recombination in Saccharomyces cerevisiae. , 1993, Molecular and cellular biology.

[14]  Arnold L. Rosenberg,et al.  Rapid identification of repeated patterns in strings, trees and arrays , 1972, STOC.

[15]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[16]  Serge A. Hazout,et al.  A strategy for finding regions of similarity in complete genome sequences , 1998, Bioinform..

[17]  Kenneth H. Wolfe,et al.  Gene Duplication and Gene Conversion in the Caenorhabditis elegans Genome , 1999, Journal of Molecular Evolution.

[18]  P. Slonimski,et al.  Two yeast chromosomes are related by a fossil duplication of their centromeric regions. , 1993, Comptes rendus de l'Academie des sciences. Serie III, Sciences de la vie.

[19]  S. Lovett,et al.  Recombination between repeats in Escherichia coli by a recA-independent, proximity-sensitive mechanism , 1994, Molecular and General Genetics MGG.

[20]  A Danchin,et al.  Analysis of long repeats in bacterial genomes reveals alternative evolutionary mechanisms in Bacillus subtilis and other competent prokaryotes. , 1999, Molecular biology and evolution.

[21]  S Karlin,et al.  An efficient algorithm for identifying matches with errors in multiple long molecular sequences. , 1991, Journal of molecular biology.

[22]  H. Klein Genetic control of intrachromosomal recombination , 1995, BioEssays : news and reviews in molecular, cellular and developmental biology.

[23]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[24]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.