Differential DNA repair underlies mutation hotspots at active promoters in cancer genomes

Promoters are DNA sequences that have an essential role in controlling gene expression. While recent whole cancer genome analyses have identified numerous hotspots of somatic point mutations within promoters, many have not yet been shown to perturb gene expression or drive cancer development. As such, positive selection alone may not adequately explain the frequency of promoter point mutations in cancer genomes. Here we show that increased mutation density at gene promoters can be linked to promoter activity and differential nucleotide excision repair (NER). By analysing 1,161 human cancer genomes across 14 cancer types, we find evidence for increased local density of somatic point mutations within the centres of DNase I-hypersensitive sites (DHSs) in gene promoters. Mutated DHSs were strongly associated with transcription initiation activity, in which active promoters but not enhancers of equal DNase I hypersensitivity were most mutated relative to their flanking regions. Notably, analysis of genome-wide maps of NER shows that NER is impaired within the DHS centre of active gene promoters, while XPC-deficient skin cancers do not show increased promoter mutation density, pinpointing differential NER as the underlying cause of these mutation hotspots. Consistent with this finding, we observe that melanomas with an ultraviolet-induced DNA damage mutation signature show greatest enrichment of promoter mutations, whereas cancers that are not highly dependent on NER, such as colon cancer, show no sign of such enrichment. Taken together, our analysis has uncovered the presence of a previously unknown mechanism linking transcription initiation and NER as a major contributor of somatic point mutation hotspots at active gene promoters in cancer genomes.

[1]  R. C. Poulos,et al.  Systematic Screening of Promoter Regions Pinpoints Functional Cis-Regulatory Mutations in a Cutaneous Melanoma Genome , 2015, Molecular Cancer Research.

[2]  Eugene Berezikov,et al.  CONREAL: conserved regulatory elements anchored alignment algorithm for identification of transcription factor binding sites by phylogenetic footprinting. , 2003, Genome research.

[3]  Jason Piper,et al.  Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data , 2013, Nucleic acids research.

[4]  Shane J. Neph,et al.  An expansive human regulatory lexicon encoded in transcription factor footprints , 2012, Nature.

[5]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[6]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[7]  JohnB . Taylor,et al.  Rapid Deamination of Cyclobutane Pyrimidine Dimer Photoproducts at TCG Sites in a Translationally and Rotationally Positioned Nucleosome in Vivo* , 2015, The Journal of Biological Chemistry.

[8]  S. De,et al.  DNA replication timing and higher-order nuclear organization determine single nucleotide substitution patterns in cancer genomes , 2013, Nature Communications.

[9]  Ben Lehner,et al.  Differential DNA mismatch repair underlies mutation rate variation across the human genome , 2015, Nature.

[10]  Gabor T. Marth,et al.  Integrative Annotation of Variants from 1092 Humans: Application to Cancer Genomics , 2013, Science.

[11]  Paz Polak,et al.  Cell-of-origin chromatin organization shapes the mutational landscape of cancer , 2015, Nature.

[12]  J. Stamatoyannopoulos,et al.  Reduced local mutation density in regulatory DNA of cancer genomes is linked to DNA repair , 2013, Nature Biotechnology.

[13]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[14]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[15]  N. Bastien,et al.  Influence of cytosine methylation on ultraviolet-induced cyclobutane pyrimidine dimer formation in genomic DNA. , 2009, Mutation research.

[16]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[17]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[18]  Brian Craft,et al.  The Cancer Genomics Hub (CGHub): overcoming cancer through the power of torrential data , 2014, Database J. Biol. Databases Curation.

[19]  S. Tornaletti,et al.  DNA repair domains within a human gene: selective repair of sequences near the transcription initiation site. , 1996, The EMBO journal.

[20]  David Haussler,et al.  The Human Epigenome Browser at Washington University , 2011, Nature Methods.

[21]  P. Hanawalt,et al.  Selective removal of transcription-blocking DNA damage from the transcribed strand of the mammalian DHFR gene , 1987, Cell.

[22]  E. Birney,et al.  A small cell lung cancer genome reports complex tobacco exposure signatures , 2009, Nature.

[23]  J. Ragoussis,et al.  A Large Fraction of Extragenic RNA Pol II Transcription Sites Overlap Enhancers , 2010, PLoS biology.

[24]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[25]  J. Lieb,et al.  Genome-wide analysis of human global and transcription-coupled excision repair of UV damage at single-nucleotide resolution , 2015, Genes & development.

[26]  M. Snyder,et al.  Recurrent Somatic Mutations in Regulatory Regions of Human Cancer Genomes , 2015, Nature Genetics.

[27]  Lynda Chin,et al.  Highly Recurrent TERT Promoter Mutations in Human Melanoma , 2013, Science.

[28]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[29]  Wendy S. W. Wong,et al.  Strelka: accurate somatic small-variant calling from sequenced tumor-normal sample pairs , 2012, Bioinform..

[30]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[31]  M. Gut,et al.  Transcription initiation platforms and GTF recruitment at tissue-specific enhancers and promoters , 2011, Nature Structural &Molecular Biology.

[32]  Fidel Ramírez,et al.  deepTools: a flexible platform for exploring deep-sequencing data , 2014, Nucleic Acids Res..

[33]  Tom Royce,et al.  A comprehensive catalogue of somatic mutations from a human cancer genome , 2010, Nature.

[34]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[35]  S. Tornaletti,et al.  UV light as a footprinting agent: modulation of UV-induced DNA damage by transcription factors bound at the promoters of three human genes. , 1995, Journal of molecular biology.

[36]  D. Schadendorf,et al.  TERT Promoter Mutations in Familial and Sporadic Melanoma , 2013, Science.

[37]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[38]  H. Naegeli,et al.  Recognition of DNA Adducts by Human Nucleotide Excision Repair , 1996, The Journal of Biological Chemistry.

[39]  G. Pfeifer,et al.  Cell cycle-independent removal of UV-induced pyrimidine dimers from the promoter and the transcription initiation domain of the human CDC2 gene. , 2000, Nucleic acids research.

[40]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[41]  B. Schuster-Böckler,et al.  Chromatin organization is a major influence on regional mutation rates in human cancer cells , 2012, Nature.

[42]  E. Larsson,et al.  Systematic analysis of noncoding somatic mutations and gene expression alterations across 14 tumor types , 2014, Nature Genetics.

[43]  C. Sander,et al.  Genome-wide analysis of non-coding regulatory mutations in cancer , 2014, Nature Genetics.

[44]  Nam Huh,et al.  Transcription restores DNA repair to heterochromatin, determining regional mutation rates in cancer genomes. , 2014, Cell reports.

[45]  Wen-Hsiung Li,et al.  DNA replication timing and selection shape the landscape of nucleotide variation in cancer genomes , 2012, Nature Communications.