Abundance, Distribution, and Mutation Rates of Homopolymeric Nucleotide Runs in the Genome of Caenorhabditis elegans

Homopolymeric nucleotide runs, also called mononucleotide microsatellites, are a ubiquitous, dominant, and mutagenic feature of eukaryotic genomes. A clear understanding of the forces that shape patterns of homopolymer evolution, however, is lacking. We provide a focused investigation of the abundance, chromosomal distribution, and mutation spectra of the four strand-specific homopolymer types (A, T, G, C) ≥8 bp in the genome of Caenorhabditis elegans. A and T homopolymers vastly outnumber G and C HPs, and the run-length distributions of A and T homopolymers differ significantly from G and C homopolymers. A scanning window analysis of homopolymer chromosomal distribution reveals distinct clusters of homopolymer density in autosome arms that are regions of high recombination in C. elegans. Dramatic biases are detected among closely spaced homopolymers; for instance, we observe 994 A homopolymers immediately followed by a T homopolymer (5′ to 3′) and only 8 instances of T homopolymers directly followed by an A homopolymer. Empirical homopolymer mutation assays in a set of C. elegans mutation-accumulation lines reveal an ∼20-fold higher mutation rate for G and C homopolymers compared to A and T homopolymers. Nuclear A and T homopolymers are also found to mutate ∼100-fold more slowly than mitochondrial A and T homopolymers. This integrative approach yields a total nuclear genome-wide homopolymer mutation rate estimate of ∼1.6 mutations per genome per generation.

[1]  T. Cebula,et al.  Fidelity of replication of repetitive DNA in mutS and repair proficient Escherichia coli. , 2001, Mutation research.

[2]  X. Matías-Guiu,et al.  Frameshift mutations at coding mononucleotide repeat microsatellites in endometrial carcinoma with microsatellite instability , 2000, Cancer.

[3]  G. P. Smith,et al.  Evolution of repeated DNA sequences by unequal crossover. , 1976, Science.

[4]  D. Chang,et al.  Microsatellites in the eukaryotic DNA mismatch repair genes as modulators of evolutionary mutation rate. , 2001, Genome research.

[5]  B. Harfe,et al.  Base Composition of Mononucleotide Runs Affects DNA Polymerase Slippage and Removal of Frameshift Intermediates by Mismatch Repair in Saccharomyces cerevisiae , 2002, Molecular and Cellular Biology.

[6]  S Karlin,et al.  Genome-scale compositional comparisons in eukaryotes. , 2001, Genome research.

[7]  M. Lynch,et al.  THE FITNESS EFFECTS OF SPONTANEOUS MUTATIONS IN CAENORHABDITIS ELEGANS , 2000, Evolution; international journal of organic evolution.

[8]  E. Boerwinkle,et al.  Recombinational and mutational hotspots within the human lipoprotein lipase gene. , 2000, American journal of human genetics.

[9]  T. Petes,et al.  Microsatellite instability in yeast: dependence on the length of the microsatellite. , 1997, Genetics.

[10]  J. Jurka,et al.  Microsatellites in different eukaryotic genomes: survey and analysis. , 2000, Genome research.

[11]  T. Kunkel,et al.  Exonucleolytic proofreading during replication of repetitive DNA. , 1996, Biochemistry.

[12]  G. Richard,et al.  Mini‐ and microsatellite expansions: the recombination connection , 2000, EMBO reports.

[13]  M. Lynch,et al.  The rate of spontaneous mutation for life-history traits in Caenorhabditis elegans. , 1999, Genetics.

[14]  A. Coulson,et al.  Meiotic recombination, noncoding DNA and genomic organization in Caenorhabditis elegans. , 1995, Genetics.

[15]  H. Ellegren,et al.  Microsatellite evolution: polarity of substitutions within repeats and neutrality of flanking sequences , 1999, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[16]  James Lyons-Weiler,et al.  Evolutionary origin, diversification and specialization of eukaryotic MutS homolog mismatch repair proteins. , 2000, Nucleic acids research.

[17]  J Wilder,et al.  Mobile elements and the genesis of microsatellites in dipterans. , 2001, Molecular biology and evolution.

[18]  L. Duret,et al.  Transposons but not retrotransposons are located preferentially in regions of high recombination rate in Caenorhabditis elegans. , 2000, Genetics.

[19]  S. Brahmachari,et al.  Polypurine/polypyrimidine sequences as cis-acting transcriptional regulators. , 1997, Gene.

[20]  B. Harfe,et al.  Sequence composition and context effects on the generation and repair of frameshift intermediates in mononucleotide runs in Saccharomyces cerevisiae. , 2000, Genetics.

[21]  J. Hodgkin,et al.  Natural variation and copulatory plug formation in Caenorhabditis elegans. , 1997, Genetics.

[22]  W Kelley Thomas,et al.  Phylogenetics in Caenorhabditis elegans: an analysis of divergence and outcrossing. , 2003, Molecular biology and evolution.

[23]  R. Durrett,et al.  Equilibrium distributions of microsatellite repeat length resulting from a balance between slippage events and point mutations. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  A. Rich,et al.  A bifurcated hydrogen-bonded conformation in the d(A.T) base pairs of the DNA dodecamer d(CGCAAATTTGCG) and its complex with distamycin. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[25]  John M. Hancock The contribution of slippage-like processes to genome evolution , 1995, Journal of Molecular Evolution.

[26]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[27]  K. Kinzler,et al.  Short mononucleotide repeat sequence variability in mismatch repair-deficient cancers. , 2001, Cancer research.

[28]  R. Durrett,et al.  Distribution and abundance of microsatellites in the yeast genome can Be explained by a balance between slippage events and point mutations. , 2000, Molecular biology and evolution.

[29]  H. Margalit,et al.  Microsatellite spreading in the human genome: evolutionary mechanisms and structural implications. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Lynch,et al.  High direct estimate of the mutation rate in the mitochondrial genome of Caenorhabditis elegans. , 2000, Science.

[31]  Y. Kashi,et al.  Simple sequence repeats as a source of quantitative genetic variation. , 1997, Trends in genetics : TIG.

[32]  V. Ambros,et al.  An Extensive Class of Small RNAs in Caenorhabditis elegans , 2001, Science.

[33]  J. E. Kranz,et al.  YPD, PombePD and WormPD: model organism volumes of the BioKnowledge library, an integrated resource for protein information. , 2001, Nucleic acids research.

[34]  V. Iyer,et al.  Poly(dA:dT), a ubiquitous promoter element that stimulates transcription via its intrinsic DNA structure. , 1995, The EMBO journal.

[35]  D. Gordenin,et al.  Yeast ARMs (DNA at-risk motifs) can reveal sources of genome instability. , 1998, Mutation research.

[36]  D. Gordenin,et al.  Hypermutability of homonucleotide runs in mismatch repair and DNA polymerase proofreading yeast mutants , 1997, Molecular and cellular biology.

[37]  Martin J. Pollard,et al.  High-throughput plasmid purification for capillary sequencing. , 2001, Genome research.

[38]  Marek S. Skrzypek,et al.  YPDTM, PombePDTM and WormPDTM: model organism volumes of the BioKnowledgeTM Library, an integrated resource for protein information , 2001, Nucleic Acids Res..

[39]  W. Gilbert,et al.  Formation of parallel four-stranded complexes by guanine-rich motifs in DNA and its implications for meiosis , 1988, Nature.

[40]  J. Berg Genome sequence of the nematode C. elegans: a platform for investigating biology. , 1998, Science.

[41]  M. V. Katti,et al.  Differential distribution of simple sequence repeats in eukaryotic genome sequences. , 2001, Molecular biology and evolution.

[42]  J. Leunissen,et al.  Distinct frequency-distributions of homopolymeric DNA tracts in different genomes. , 1998, Nucleic acids research.

[43]  D. Tautz,et al.  Cryptic simplicity in DNA is a major source of genetic variation , 1986, Nature.

[44]  A. Klug,et al.  The structure of an oligo(dA)·oligo(dT) tract and its biological implications , 1987, Nature.

[45]  D. Metzgar,et al.  Selection against frameshift mutations limits microsatellite expansion in coding DNA. , 2000, Genome research.

[46]  R. Mariani-Costantini,et al.  Instability at sequence repeats in melanocytic tumours , 2001, Melanoma research.