Guanine Holes Are Prominent Targets for Mutation in Cancer and Inherited Disease

Single base substitutions constitute the most frequent type of human gene mutation and are a leading cause of cancer and inherited disease. These alterations occur non-randomly in DNA, being strongly influenced by the local nucleotide sequence context. However, the molecular mechanisms underlying such sequence context-dependent mutagenesis are not fully understood. Using bioinformatics, computational and molecular modeling analyses, we have determined the frequencies of mutation at G•C bp in the context of all 64 5′-NGNN-3′ motifs that contain the mutation at the second position. Twenty-four datasets were employed, comprising >530,000 somatic single base substitutions from 21 cancer genomes, >77,000 germline single-base substitutions causing or associated with human inherited disease and 16.7 million benign germline single-nucleotide variants. In several cancer types, the number of mutated motifs correlated both with the free energies of base stacking and the energies required for abstracting an electron from the target guanines (ionization potentials). Similar correlations were also evident for the pathological missense and nonsense germline mutations, but only when the target guanines were located on the non-transcribed DNA strand. Likewise, pathogenic splicing mutations predominantly affected positions in which a purine was located on the non-transcribed DNA strand. Novel candidate driver mutations and tissue-specific mutational patterns were also identified in the cancer datasets. We conclude that electron transfer reactions within the DNA molecule contribute to sequence context-dependent mutagenesis, involving both somatic driver and passenger mutations in cancer, as well as germline alterations causing or associated with inherited disease.

[1]  D. Haussler,et al.  Phylogenetic estimation of context-dependent substitution rates by maximum likelihood. , 2003, Molecular biology and evolution.

[2]  T. Douki,et al.  Base pairing enhances fluorescence and favors cyclobutane dimer formation induced upon absorption of UVA radiation by DNA. , 2011, Journal of the American Chemical Society.

[3]  Trevor J Pugh,et al.  Initial genome sequencing and analysis of multiple myeloma , 2011, Nature.

[4]  A. Krylov,et al.  Electronic structure and spectroscopy of nucleic acid bases: ionization energies, ionization-induced structural changes, and photoelectron spectra. , 2010, The journal of physical chemistry. A.

[5]  B. Meunier,et al.  Guanine Oxidation by Electron Transfer: One‐ versus Two‐Electron Oxidation Mechanism , 2006, Chembiochem : a European journal of chemical biology.

[6]  B. Giese Long-distance electron transfer through DNA. , 2002, Annual review of biochemistry.

[7]  N. A. Temiz,et al.  APOBEC3B is an enzymatic source of mutation in breast cancer , 2013, Nature.

[8]  P. C. Hariharan,et al.  The influence of polarization functions on molecular orbital hydrogenation energies , 1973 .

[9]  Alan F. Rubin,et al.  Mutation patterns in cancer genomes , 2009, Proceedings of the National Academy of Sciences.

[10]  J. Barton,et al.  Solution, surface, and single molecule platforms for the study of DNA-mediated charge transport. , 2012, Physical chemistry chemical physics : PCCP.

[11]  Aleksandar Milosavljevic,et al.  Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties. , 2008, Genome research.

[12]  P. Hanawalt,et al.  Transcription-coupled DNA repair: two decades of progress and surprises , 2008, Nature Reviews Molecular Cell Biology.

[13]  M. Frank-Kamenetskii,et al.  Base-stacking and base-pairing contributions into thermal stability of the DNA double helix , 2006, Nucleic acids research.

[14]  Junmei Wang,et al.  How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? , 2000, J. Comput. Chem..

[15]  David G. Knowles,et al.  Fast Computation and Applications of Genome Mappability , 2012, PloS one.

[16]  T. Rognes,et al.  Large-scale inference of the point mutational spectrum in human segmental duplications , 2009, BMC Genomics.

[17]  E. Conwell,et al.  Hole traps in DNA calculated with exponential electron-lattice coupling , 2004 .

[18]  Zhengyan Kan,et al.  Exome sequencing identifies frequent mutation of ARID1A in molecular subtypes of gastric cancer , 2011, Nature Genetics.

[19]  G. Pfeifer,et al.  Mutational spectra of human cancer , 2009, Human Genetics.

[20]  D. Busam,et al.  An Integrated Genomic Analysis of Human Glioblastoma Multiforme , 2008, Science.

[21]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[22]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[23]  M. Soares,et al.  SNP-based prediction of the human germ cell methylation landscape. , 2009, Genomics.

[24]  Nicole I Bieberstein,et al.  Pause locally, splice globally. , 2011, Trends in cell biology.

[25]  A. Børresen-Dale,et al.  Mutational Processes Molding the Genomes of 21 Breast Cancers , 2012, Cell.

[26]  K. Senthilkumar,et al.  Mapping the sites for selective oxidation of guanines in DNA. , 2003, Journal of the American Chemical Society.

[27]  M. Dizdaroglu Oxidatively induced DNA damage: mechanisms, repair and disease. , 2012, Cancer letters.

[28]  T. Fennell,et al.  Melanoma genome sequencing reveals frequent PREX2 mutations , 2012, Nature.

[29]  C. Burrows,et al.  Oxidative Nucleobase Modifications Leading to Strand Scission. , 1998, Chemical reviews.

[30]  JohnB . Taylor,et al.  Acceleration of 5-methylcytosine deamination in cyclobutane dimers by G and its implications for UV-induced C-to-T mutation hotspots. , 2009, Journal of molecular biology.

[31]  D. Angelov,et al.  Origin of the heterogeneous distribution of the yield of guanyl radical in UV laser photolyzed DNA. , 2005, Biophysical journal.

[32]  S. Delaney,et al.  Structure-dependent DNA damage and repair in a trinucleotide repeat sequence. , 2009, Biochemistry.

[33]  Jian-Qun Chen,et al.  Increased complexity of gene structure and base composition in vertebrates. , 2011, Journal of genetics and genomics = Yi chuan xue bao.

[34]  P. Stenson,et al.  Comparative analysis of germline and somatic microlesion mutational spectra in 17 human tumor suppressor genes , 2011, Human mutation.

[35]  David N Cooper,et al.  On the sequence‐directed nature of human gene mutation: The role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease , 2011, Human mutation.

[36]  Mark S. Gordon,et al.  General atomic and molecular electronic structure system , 1993, J. Comput. Chem..

[37]  H. Sugiyama,et al.  Theoretical Studies of GG-Specific Photocleavage of DNA via Electron Transfer: Significant Lowering of Ionization Potential and 5‘-Localization of HOMO of Stacked GG Bases in B-Form DNA , 1996 .

[38]  Jonathan W. Pillow,et al.  POSTER PRESENTATION Open Access , 2013 .

[39]  Lior Pachter,et al.  Identification and correction of systematic error in high-throughput sequence data , 2011, BMC Bioinformatics.

[40]  Robert W Sobol,et al.  Base excision repair and lesion-dependent subpathways for repair of oxidative DNA damage. , 2011, Antioxidants & redox signaling.

[41]  K. Bhakat,et al.  Oxidative genome damage and its repair: Implications in aging and neurodegenerative diseases , 2012, Mechanisms of Ageing and Development.

[42]  M. Prévost,et al.  Influence of the Sequence Dependent Ionization Potentials of Guanines on the Luminescence Quenching of Ru-Labeled Oligonucleotides: A Theoretical and Experimental Study , 2002 .

[43]  Jotun Hein,et al.  A nucleotide substitution model with nearest-neighbour interactions , 2004, ISMB/ECCB.

[44]  Jacqueline K. Barton,et al.  Oxidative DNA damage through long-range electron transfer , 1996, Nature.

[45]  B Honig,et al.  A free energy analysis of nucleic acid base stacking in aqueous solution. , 1995, Biophysical journal.

[46]  E. Birney,et al.  A small cell lung cancer genome reports complex tobacco exposure signatures , 2009, Nature.

[47]  A. Ashworth,et al.  Whole genome sequencing of matched primary and metastatic acral melanomas. , 2012, Genome research.

[48]  Y. Kitagawa,et al.  EXPERIMENTAL AND THEORETICAL STUDIES ON THE SELECTIVITY OF GGG TRIPLETS TOWARD ONE-ELECTRON OXIDATION IN B-FORM DNA , 1999 .

[49]  N. Geacintov,et al.  Lifetimes and reaction pathways of guanine radical cations and neutral guanine radicals in an oligonucleotide in aqueous solutions. , 2012, Journal of the American Chemical Society.

[50]  M. Wasielewski,et al.  Dynamics and Equilibria for Oxidation of G, GG, and GGG Sequences in DNA Hairpins , 2000 .

[51]  Juliane C. Dohm,et al.  Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia , 2011, Nature.

[52]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[53]  Tom Royce,et al.  A comprehensive catalogue of somatic mutations from a human cancer genome , 2010, Nature.

[54]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[55]  Brett M. Bode,et al.  MacMolPlt: a graphical user interface for GAMESS. , 1998, Journal of molecular graphics & modelling.

[56]  A. Sparks,et al.  The mutation spectrum revealed by paired genome sequences from a lung cancer patient , 2010, Nature.

[57]  N. Geacintov,et al.  The role of one-electron reduction of lipid hydroperoxides in causing DNA damage. , 2009, Chemistry.

[58]  C. V. Jongeneel,et al.  Exome sequencing identifies recurrent somatic MAP2K1 and MAP2K2 mutations in melanoma , 2011, Nature Genetics.

[59]  I. Minko,et al.  Mutagenic potential of DNA-peptide crosslinks mediated by acrolein-derived DNA adducts. , 2008, Mutation research.

[60]  Ming Yi,et al.  WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data , 2006, BMC Bioinformatics.

[61]  A. Sivachenko,et al.  A Landscape of Driver Mutations in Melanoma , 2012, Cell.

[62]  Steven A. Roberts,et al.  Clustered mutations in yeast and in human cancers can arise from damaged long single-strand DNA regions. , 2012, Molecular cell.

[63]  Matthew J. Davis,et al.  Exome sequencing identifies recurrent somatic RAC1 mutations in melanoma , 2012, Nature Genetics.

[64]  Guy Baele,et al.  A model-based approach to study nearest-neighbor influences reveals complex substitution patterns in non-coding sequences. , 2008, Systematic biology.

[65]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[66]  N. Rösch,et al.  Energetics of hole transfer in DNA , 2000 .

[67]  A. McKenna,et al.  The Mutational Landscape of Head and Neck Squamous Cell Carcinoma , 2011, Science.

[68]  M. Hutter Stability of the guanine-cytosine radical cation in DNA base pairs triplets , 2006 .

[69]  D. Cooper,et al.  Non-B DNA-forming Sequences and WRN Deficiency Independently Increase the Frequency of Base Substitution in Human Cells* , 2011, The Journal of Biological Chemistry.

[70]  S. Davis,et al.  Exome sequencing identifies GRIN2A as frequently mutated in melanoma , 2011, Nature Genetics.

[71]  G. Schatz,et al.  Electron donor-acceptor interactions with flanking purines influence the efficiency of thymine photodimerization. , 2011, Journal of the American Chemical Society.

[72]  Xiang-Jun Lu,et al.  Web 3DNA—a web server for the analysis, reconstruction, and visualization of three-dimensional nucleic-acid structures , 2009, Nucleic Acids Res..

[73]  Ming Yi,et al.  Seeking unique and common biological themes in multiple gene lists or datasets: pathway pattern extraction pipeline for pathway-level comparative analysis , 2009, BMC Bioinformatics.

[74]  Robert G. Parr,et al.  Density Functional Theory of Electronic Structure , 1996 .

[75]  Hidenori Ojima,et al.  High-resolution characterization of a hepatocellular carcinoma genome , 2011, Nature Genetics.

[76]  Li Ding,et al.  Genomic Landscape of Non-Small Cell Lung Cancer in Smokers and Never-Smokers , 2012, Cell.

[77]  Donald G Truhlar,et al.  Density functionals with broad applicability in chemistry. , 2008, Accounts of chemical research.

[78]  Eric S. Lander,et al.  The genomic complexity of primary human prostate cancer , 2010, Nature.

[79]  Tiffany M. Maisonet,et al.  Dependence of DNA-protein cross-linking via guanine oxidation upon local DNA sequence as studied by restriction endonuclease inhibition. , 2012, Biochemistry.

[80]  Amy E. Hawkins,et al.  DNA sequencing of a cytogenetically normal acute myeloid leukemia genome , 2008, Nature.

[81]  Christian Gilissen,et al.  Disease gene identification strategies for exome sequencing , 2012, European Journal of Human Genetics.

[82]  Guy Baele,et al.  Efficient context-dependent model building based on clustering posterior distributions for non-coding sequences , 2009, BMC Evolutionary Biology.

[83]  H. Hakonarson,et al.  Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing , 2013, Genome Medicine.

[84]  A. Voityuk Estimation of electronic coupling in pi-stacked donor-bridge-acceptor systems: correction of the two-state model. , 2006, The Journal of chemical physics.

[85]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[86]  Sergio Gómez,et al.  Solving Non-Uniqueness in Agglomerative Hierarchical Clustering Using Multidendrograms , 2006, J. Classif..

[87]  Mark S. Gordon,et al.  Self‐consistent molecular orbital methods. XXIII. A polarization‐type basis set for second‐row elements , 1982 .

[88]  Margaret C. Linak,et al.  Sequence-specific error profile of Illumina sequencers , 2011, Nucleic acids research.

[89]  E. Conwell,et al.  Hole traps in DNA. , 2001, Journal of the American Chemical Society.

[90]  G. Pfeifer,et al.  UV wavelength-dependent DNA damage and human non-melanoma and melanoma skin cancer , 2012, Photochemical & Photobiological Sciences.

[91]  Bernd Giese,et al.  Direct observation of hole transfer through DNA by hopping between adenine bases and by tunnelling , 2001, Nature.

[92]  Conrad C. Huang,et al.  UCSF Chimera, MODELLER, and IMP: an integrated modeling system. , 2012, Journal of structural biology.

[93]  Angela N. Brooks,et al.  Mapping the Hallmarks of Lung Adenocarcinoma with Massively Parallel Sequencing , 2012, Cell.

[94]  Hiroshi Sugiyama,et al.  Mapping of the Hot Spots for DNA Damage by One-Electron Oxidation: Efficacy of GG Doublets and GGG Triplets as a Trap in Long-Range Hole Migration , 1998 .

[95]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[96]  K. Prince,et al.  Theoretical and experimental study of valence-shell ionization spectra of guanine. , 2009, The journal of physical chemistry. A.

[97]  R. Gibbs,et al.  Exome Sequencing of Head and Neck Squamous Cell Carcinoma Reveals Inactivating Mutations in NOTCH1 , 2011, Science.

[98]  G. Bouffard,et al.  Estimation of DNA Sequence Context-dependent Mutation Rates Using Primate Genomic Sequences , 2007, Journal of Molecular Evolution.

[99]  N. Hayward,et al.  Melanoma genetics: recent findings take us beyond well-traveled pathways. , 2012, The Journal of investigative dermatology.

[100]  D. Truhlar,et al.  The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals , 2008 .

[101]  M. Wasielewski,et al.  Excited state, charge transfer, and spin dynamics in DNA hairpin conjugates with perylenediimide hairpin linkers. , 2009, The journal of physical chemistry. A.

[102]  Michael T. Clegg,et al.  Neighboring base composition is strongly correlated with base substitution bias in a region of the chloroplast genome , 1995, Journal of Molecular Evolution.