Single-nucleotide mutation rate increases close to insertions/deletions in eukaryotes

Mutation hotspots are commonly observed in genomic sequences and certain human disease loci, but general mechanisms for their formation remain elusive. Here we investigate the distribution of single-nucleotide changes around insertions/deletions (indels) in six independent genome comparisons, including primates, rodents, fruitfly, rice and yeast. In each of these genomic comparisons, nucleotide divergence (D) is substantially elevated surrounding indels and decreases monotonically to near-background levels over several hundred bases. D is significantly correlated with both size and abundance of nearby indels. In comparisons of closely related species, derived nucleotide substitutions surrounding indels occur in significantly greater numbers in the lineage containing the indel than in the one containing the ancestral (non-indel) allele; the same holds within species for single-nucleotide mutations surrounding polymorphic indels. We propose that heterozygosity for an indel is mutagenic to surrounding sequences, and use yeast genome-wide polymorphism data to estimate the increase in mutation rate. The consistency of these patterns within and between species suggests that indel-associated substitution is a general mutational mechanism.

[1]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[2]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  J. Crow The high spontaneous mutation rate: is it a health risk? , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Jerzy K. Kulski,et al.  Genomics of the major histocompatibility complex: haplotypes, duplication, retroviruses and disease , 1999, Immunological reviews.

[6]  D. Bonneau,et al.  Mutations of the human PTEN gene , 2000, Human mutation.

[7]  Alexey S Kondrashov,et al.  Patterns in spontaneous mutation revealed by human-baboon sequence comparison. , 2002, Trends in genetics : TIG.

[8]  Martin J Lercher,et al.  Human SNP variability and mutation rate are higher in regions of high recombination. , 2002, Trends in genetics : TIG.

[9]  H. Maki Origins of spontaneous mutations: specificity and directionality of base-substitution, frameshift, and sequence-substitution mutageneses. , 2002, Annual review of genetics.

[10]  I. Rogozin,et al.  Theoretical analysis of mutation hotspots and their DNA sequence context specificity. , 2003, Mutation research.

[11]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[12]  H. Ellegren,et al.  Mutation rate variation in the mammalian genome. , 2003, Current opinion in genetics & development.

[13]  S. Gaudieri,et al.  In polymorphic genomic regions indels cluster with nucleotide polymorphism: Quantum Genomics. , 2003, Gene.

[14]  Alexey S Kondrashov,et al.  Direct estimates of human per nucleotide mutation rates at 20 loci causing mendelian diseases , 2003, Human mutation.

[15]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[16]  H. Schrezenmeier,et al.  The spectrum of PIG-A gene mutations in aplastic anemia/paroxysmal nocturnal hemoglobinuria (AA/PNH): a high incidence of multiple mutations and evidence of a mutational hot spot. , 2003, Blood.

[17]  Terence Hwa,et al.  Substantial Regional Variation in Substitution Rates in the Human Genome: Importance of GC Content, Gene Density, and Telomere-Specific Effects , 2005, Journal of Molecular Evolution.

[18]  R. Hruban,et al.  Missense Mutations of MADH4 , 2004, Clinical Cancer Research.

[19]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[20]  Dmitri A. Petrov,et al.  DNA loss and evolution of genome size in Drosophila , 2002, Genetica.

[21]  K. Marder,et al.  Distribution, type, and origin of Parkin mutations: Review and case studies , 2004, Movement disorders : official journal of the Movement Disorder Society.

[22]  Y. Chi Homeodomain revisited: a lesson from disease-causing mutations , 2005, Human Genetics.

[23]  Dee R. Denver,et al.  High mutation rate and predominance of insertions in the Caenorhabditis elegans nuclear genome , 2004, Nature.

[24]  E. Check Human genome: Patchwork people , 2005, Nature.

[25]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[26]  Nicolas Galtier,et al.  Mutation hot spots in mammalian mitochondrial DNA. , 2005, Genome research.

[27]  D. Conrad,et al.  A high-resolution survey of deletion polymorphism in the human genome , 2006, Nature Genetics.

[28]  S. Pratt,et al.  Population genomic analysis of outcrossing and recombination in yeast , 2006, Nature Genetics.