MutationalPatterns: comprehensive genome-wide analysis of mutational processes

BackgroundBase substitution catalogues represent historical records of mutational processes that have been active in a cell. Such processes can be distinguished by various characteristics, like mutation type, sequence context, transcriptional and replicative strand bias, genomic distribution and association with (epi)-genomic features.ResultsWe have created MutationalPatterns, an R/Bioconductor package that allows researchers to characterize a broad range of patterns in base substitution catalogues to dissect the underlying molecular mechanisms. Furthermore, it offers an efficient method to quantify the contribution of known mutational signatures within single samples. This analysis can be used to determine whether certain DNA repair mechanisms are perturbed and to further characterize the processes underlying known mutational signatures.ConclusionsMutationalPatterns allows for easy characterization and visualization of mutational patterns. These analyses willsupport fundamental research into mutational mechanisms and may ultimately improve cancer diagnosis and treatment strategies. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns.

[1]  Gad Getz,et al.  Somatic ERCC2 Mutations Are Associated with a Distinct Genomic Signature in Urothelial Tumors , 2016, Nature Genetics.

[2]  M. Stratton,et al.  Mutational signatures associated with tobacco smoking in human cancer , 2016, bioRxiv.

[3]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[4]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[5]  Renaud Gaujoux,et al.  A flexible R package for nonnegative matrix factorization , 2010, BMC Bioinformatics.

[6]  A. Shilatifard,et al.  Precancer Atlas to Drive Precision Prevention Trials. , 2017, Cancer research.

[7]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[8]  Martin Renqiang Min,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[9]  B. Taylor,et al.  deconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution , 2016, Genome Biology.

[10]  Lodewyk F. A. Wessels,et al.  Abstract S6-05: High levels of APOBEC3B, a DNA deaminase and an enzymatic source of C-to-T transitions, are a validated marker of poor outcome in estrogen receptor-positive breast cancer , 2013 .

[11]  Reuben S Harris,et al.  The DNA cytosine deaminase APOBEC3B promotes tamoxifen resistance in ER-positive breast cancer , 2016, Science Advances.

[12]  M. Stephens,et al.  A Simple Model-Based Approach to Inferring and Visualizing Cancer Mutation Signatures , 2015, bioRxiv.

[13]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.

[14]  Andreas Schlicker,et al.  Elevated APOBEC3B Correlates with Poor Outcomes for Estrogen-Receptor-Positive Breast Cancers , 2014, Hormones and Cancer.

[15]  Julian Gehring,et al.  SomaticSignatures: inferring mutational signatures from single-nucleotide variants , 2014, bioRxiv.

[16]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[17]  S. Nik-Zainal,et al.  Use of CRISPR-modified human stem cell organoids to study the origin of mutational signatures in cancer , 2017, Science.

[18]  F. Supek,et al.  Clustered Mutation Signatures Reveal that Error-Prone DNA Repair Targets Mutations to Active Genes , 2017, Cell.

[19]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[20]  Hans Clevers,et al.  Tissue-specific mutation accumulation in human adult stem cells during life , 2016, Nature.

[21]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[22]  Peter J. Campbell,et al.  C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency , 2014, Genome research.

[23]  Chris Sander,et al.  Exonuclease mutations in DNA polymerase epsilon reveal replication strand specific mutation patterns and human origins of replication , 2014, Genome research.

[24]  Leland Wilkinson,et al.  ggplot2: Elegant Graphics for Data Analysis by WICKHAM, H. , 2011 .

[25]  Nam Huh,et al.  Transcription restores DNA repair to heterochromatin, determining regional mutation rates in cancer genomes. , 2014, Cell reports.

[26]  P. Campbell,et al.  EMu: probabilistic inference of mutational processes and their localization in the cancer genome , 2013, Genome Biology.

[27]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[28]  P. Hanawalt,et al.  Mutational Strand Asymmetries in Cancer Genomes Reveal Mechanisms of DNA Damage and Repair , 2016, Cell.

[29]  David M. Wilson,et al.  DNA repair mechanisms in dividing and non-dividing cells. , 2013, DNA repair.

[30]  E. Birney,et al.  HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures , 2017, Nature Medicine.

[31]  C. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[32]  Rafael Rosales,et al.  signeR: an empirical Bayesian approach to mutational signature discovery , 2017, Bioinform..

[33]  A. Børresen-Dale,et al.  Mutational Processes Molding the Genomes of 21 Breast Cancers , 2012, Cell.

[34]  Anushi Shah,et al.  Differential DNA repair underlies mutation hotspots at active promoters in cancer genomes , 2016, Nature.

[35]  Nuria Lopez-Bigas,et al.  Nucleotide excision repair is impaired by binding of transcription factors to DNA , 2015 .

[36]  J. Hoeijmakers DNA damage, aging, and cancer. , 2009, The New England journal of medicine.

[37]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[38]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[39]  P. Hanawalt,et al.  Transcription Domain-Associated Repair in Human Cells , 2006, Molecular and Cellular Biology.

[40]  Ben Lehner,et al.  Differential DNA mismatch repair underlies mutation rate variation across the human genome , 2015, Nature.

[41]  Xavier Castells,et al.  MutSpec: a Galaxy toolbox for streamlined analyses of somatic mutation spectra in human and mouse cancer genomes , 2016, BMC Bioinformatics.

[42]  Michael O Dorschner,et al.  Sequencing newly replicated DNA reveals widespread plasticity in human replication timing , 2009, Proceedings of the National Academy of Sciences.