The large‐scale distribution of somatic mutations in cancer genomes

Recently, the genome sequences from several cancers have been published, along with the genome from a noncancer tissue from the same individual, allowing the identification of new somatic mutations in the cancer. We show that there is significant variation in the density of mutations at the 1‐Mb scale within three cancer genomes and that the density of mutations is correlated between them. We also demonstrate that the density of mutations is correlated to that in the germline, as measured by the divergence between humans and chimpanzees and humans and macaques. We show that the density of mutations is correlated to the guanine and cytosine (GC) conent, replication time, distance to telomere and centromere, gene density, and nucleosome occupancy in the cancer genomes. However, overall, all factors explain less than 40% of the variance in mutation density and each factor explains very little of the variance. We find that genes associated with cancer occupy regions of the genome with significantly lower mutation rates than the average. Finally, we show that the density of mutations varies at a 10‐Mb and a chromosomal scale, but that the variation at these scales is weak. Hum Mutat 33:136–143, 2012. © 2011 Wiley Periodicals, Inc.

[1]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[2]  A. Reymond,et al.  Conserved non-genic sequences — an unexpected feature of mammalian genomes , 2005, Nature Reviews Genetics.

[3]  D. Busam,et al.  An Integrated Genomic Analysis of Human Glioblastoma Multiforme , 2008, Science.

[4]  M. Lynch Rate, molecular spectrum, and consequences of human mutation , 2010, Proceedings of the National Academy of Sciences.

[5]  Trevor J Pugh,et al.  Initial genome sequencing and analysis of multiple myeloma , 2011, Nature.

[6]  Tom Royce,et al.  A comprehensive catalogue of somatic mutations from a human cancer genome , 2010, Nature.

[7]  Dirk Schübeler,et al.  Global Reorganization of Replication Domains During Embryonic Stem Cell Differentiation , 2008, PLoS biology.

[8]  Kateryna Makova,et al.  Male-driven evolution. , 2002, Current opinion in genetics & development.

[9]  Manuel Serrano,et al.  The common biology of cancer and ageing , 2007, Nature.

[10]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[11]  G. Marais,et al.  Biased gene conversion: implications for genome and sex evolution. , 2003, Trends in genetics : TIG.

[12]  Alexander R. Griffing,et al.  Direct measure of the de novo mutation rate in autism and schizophrenia cohorts. , 2010, American journal of human genetics.

[13]  A. Sparks,et al.  The mutation spectrum revealed by paired genome sequences from a lung cancer patient , 2010, Nature.

[14]  J. Haldane,et al.  The mutation rate of the gene for haemophilia, and its segregation ratios in males and females. , 1947, Annals of eugenics.

[15]  J. Haber,et al.  Multiple Pathways of Recombination Induced by Double-Strand Breaks in Saccharomyces cerevisiae , 1999, Microbiology and Molecular Biology Reviews.

[16]  W. Miller,et al.  Human-macaque comparisons illuminate variation in neutral substitution rates , 2008, Genome Biology.

[17]  Carsten Schwarz,et al.  Genomewide comparison of DNA sequences between humans and chimpanzees. , 2002, American journal of human genetics.

[18]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[19]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[20]  D. Halligan,et al.  Distributions of selectively constrained sites and deleterious mutation rates in the hominid and murid genomes. , 2010, Molecular biology and evolution.

[21]  Jeffrey H. Chuang,et al.  Weak preservation of local neutral substitution rates across mammalian genomes , 2009, BMC Evolutionary Biology.

[22]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[23]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[24]  M. DePristo,et al.  Variation in genome-wide mutation rates within and between human families , 2011, Nature Genetics.

[25]  Laurent Farinelli,et al.  Impact of replication timing on non-CpG and CpG substitution rates in mammalian genomes. , 2010, Genome research.

[26]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[27]  L. Hurst,et al.  Local similarity in evolutionary rates extends over whole chromosomes in human-rodent and mouse-rat comparisons: implications for understanding the mechanistic basis of the male mutation bias. , 2001, Molecular biology and evolution.

[28]  Joshua F. McMichael,et al.  DNMT3A mutations in acute myeloid leukemia. , 2010, The New England journal of medicine.

[29]  Timothy B. Stockwell,et al.  Evaluation of next generation sequencing platforms for population targeted sequencing studies , 2009, Genome Biology.

[30]  K. Kuma,et al.  Male-driven molecular evolution: a model and nucleotide sequence analysis. , 1987, Cold Spring Harbor symposia on quantitative biology.

[31]  J. Licht,et al.  DNMT3A mutations in acute myeloid leukemia , 2011, Nature Genetics.

[32]  J. Vijg Somatic mutations and aging: a re-evaluation. , 2000, Mutation research.

[33]  P. Sharp,et al.  Chromosomal location effects on gene sequence evolution in mammals , 1999, Current Biology.

[34]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[35]  Hongkai Ji,et al.  Why do human diversity levels vary at a megabase scale? , 2005, Genome research.

[36]  Ken Chen,et al.  Recurring mutations found by sequencing an acute myeloid leukemia genome. , 2009, The New England journal of medicine.

[37]  Hans Ellegren,et al.  Characteristics, causes and evolutionary consequences of male-biased mutation , 2007, Proceedings of the Royal Society B: Biological Sciences.

[38]  William Stafford Noble,et al.  Widely distributed noncoding purifying selection in the human genome , 2007, Proceedings of the National Academy of Sciences.

[39]  J. Stamatoyannopoulos,et al.  Human mutation rate associated with DNA replication timing , 2009, Nature Genetics.

[40]  Gurpreet W. Tang,et al.  Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes , 2009, Nature.

[41]  E. Birney,et al.  A small cell lung cancer genome reports complex tobacco exposure signatures , 2009, Nature.

[42]  Bernadett Papp,et al.  Genome-wide dynamics of replication timing revealed by in vitro models of mouse embryogenesis. , 2010, Genome research.

[43]  Daniel J. Gaffney,et al.  The scale of mutational variation in the murid genome. , 2005, Genome research.

[44]  Amy E. Hawkins,et al.  DNA sequencing of a cytogenetically normal acute myeloid leukemia genome , 2008, Nature.

[45]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[46]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[47]  A. Eyre-Walker Studies of synonymous codon evolution in mammals , 1992 .

[48]  Michael O Dorschner,et al.  Sequencing newly replicated DNA reveals widespread plasticity in human replication timing , 2009, Proceedings of the National Academy of Sciences.

[49]  Wen-Hsiung Li,et al.  Mutation rates differ among regions of the mammalian genome , 1989, Nature.

[50]  A. Kondrashov Contamination of the genome by very slightly deleterious mutations: why have we not died 100 times over? , 1995, Journal of theoretical biology.