The landscape of somatic mutations in protein coding genes in apparently benign human tissues carries signatures of relaxed purifying selection

Mutations acquired during development and aging lead to inter- and intra-tissue genetic variations. Evidence linking such mutations to complex traits and diseases is rising. We detected somatic mutations in protein-coding regions in 140 benign tissue samples representing nine tissue-types (bladder, breast, liver, lung, prostate, stomach, thyroid, head and neck) and paired blood from 70 donors. A total of 80% of the samples had 2–39 mutations detectable at tissue-level resolution. Factors such as age and smoking were associated with increased burden of detectable mutations, and tissues carried signatures of distinct mutagenic processes such as oxidative DNA damage and transcription-coupled repair. Using mutational signatures, we predicted that majority of the mutations in blood originated in hematopoietic stem and early progenitor cells. Missense to silent mutations ratio and the persistence of potentially damaging mutations in expressed genes carried signatures of relaxed purifying selection. Our findings have relevance for etiology, diagnosis and treatment of diseases including cancer.

[1]  Esti Yeger-Lotem,et al.  Cancer Evolution Is Associated with Pervasive Positive Selection on Globally Expressed Genes , 2014, PLoS genetics.

[2]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[3]  Sebastian Bonhoeffer,et al.  Dynamic variation in cycling of hematopoietic stem cells in steady state and inflammation , 2011, The Journal of experimental medicine.

[4]  M. Gerstein,et al.  Somatic copy-number mosaicism in human skin revealed by induced pluripotent stem cells , 2012, Nature.

[5]  Jing Hu,et al.  SIFT web server: predicting effects of amino acid substitutions on proteins , 2012, Nucleic Acids Res..

[6]  M. Hirst,et al.  Analysis of the clonal growth and differentiation dynamics of primitive barcoded human cord blood cells in NSG mice. , 2013, Blood.

[7]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[8]  L. Bystrykh,et al.  Heterogeneity of young and aged murine hematopoietic stem cells revealed by quantitative clonal analysis using cellular barcoding. , 2013, Blood.

[9]  S. Horvath DNA methylation age of human tissues and cell types , 2013, Genome Biology.

[10]  Tom Lenaerts,et al.  Dynamics of Mutant Cells in Hierarchical Organized Tissues , 2011, PLoS Comput. Biol..

[11]  C. Walsh,et al.  Somatic Mutation, Genomic Variation, and Neurological Disease , 2013, Science.

[12]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  I. Fidler,et al.  The pathogenesis of cancer metastasis: the 'seed and soil' hypothesis revisited , 2003, Nature Reviews Cancer.

[14]  Giovanni Parmigiani,et al.  Half or more of the somatic mutations in cancers of self-renewing tissues originate prior to tumor initiation , 2013, Proceedings of the National Academy of Sciences.

[15]  Subhajyoti De,et al.  Patterns of somatically acquired amplifications and deletions in apparently normal tissues of ovarian cancer patients. , 2014, Cell reports.

[16]  Tim Holland-Letz,et al.  Fundamental properties of unperturbed haematopoiesis from stem cells in vivo , 2015, Nature.

[17]  Rick Durrett,et al.  Population Genetics of Polymorphism and Divergence Under Fluctuating Selection , 2008, Genetics.

[18]  M. Stratton,et al.  High burden and pervasive positive selection of somatic mutations in normal human skin , 2015, Science.

[19]  Michael P. Snyder,et al.  Extensive genetic variation in somatic human tissues , 2012, Proceedings of the National Academy of Sciences.

[20]  P. Guttorp,et al.  The replication rate of human hematopoietic stem cells in vivo. , 2011, Blood.

[21]  E. Matunis,et al.  Stem cell competition: finding balance in the niche. , 2013, Trends in cell biology.

[22]  I. Lemischka,et al.  Clonal and systemic analysis of long-term hematopoiesis in the mouse. , 1990, Genes & development.

[23]  Asif U. Tamuri,et al.  Genome sequencing of normal cells reveals developmental lineages and mutational processes , 2014, Nature.

[24]  Shigehiko Kanaya,et al.  Codon Usage and tRNA Genes in Eukaryotes: Correlation of Codon Usage Diversity with Translation Efficiency and with CG-Dinucleotide Usage as Assessed by Multivariate Analysis , 2001, Journal of Molecular Evolution.

[25]  Peter Guttorp,et al.  Evidence that the number of hematopoietic stem cells per animal is conserved in mammals. , 2002, Blood.

[26]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[27]  H. Rubin,et al.  Fields and field cancerization: The preneoplastic origins of cancer , 2011, BioEssays : news and reviews in molecular, cellular and developmental biology.

[28]  Marcel J T Reinders,et al.  Somatic mutations found in the healthy blood compartment of a 115-yr-old woman demonstrate oligoclonal hematopoiesis , 2014, Genome research.

[29]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[30]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[31]  Laura C. Greaves,et al.  Comparison of Mitochondrial Mutation Spectra in Ageing Human Colonic Epithelium and Disease: Absence of Evidence for Purifying Selection in Somatic Mitochondrial DNA Point Mutations , 2012, PLoS genetics.

[32]  Tomas Lindahl,et al.  Human DNA repair genes, 2005. , 2005, Mutation research.

[33]  Vikas Bansal,et al.  A statistical method for the detection of variants from next-generation resequencing of DNA pools , 2010, Bioinform..

[34]  Ralf Herwig,et al.  The ConsensusPathDB interaction database: 2013 update , 2012, Nucleic Acids Res..

[35]  M. McCarthy,et al.  Age-related clonal hematopoiesis associated with adverse outcomes. , 2014, The New England journal of medicine.

[36]  R. Millikan,et al.  p53 mutations in benign breast tissue. , 1995, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[37]  H. Chernoff A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the sum of Observations , 1952 .

[38]  Hyung Sik Kim,et al.  Age-related changes in oxidative DNA damage and benzo(a)pyrene diolepoxide-I (BPDE-I)-DNA adduct levels in human stomach. , 2005, Journal of toxicology and environmental health. Part A.

[39]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[40]  Jan Vijg,et al.  Somatic mutations, genome mosaicism, cancer and aging. , 2014, Current opinion in genetics & development.

[41]  William Wheeler,et al.  Detectable clonal mosaicism and its relationship to aging and cancer , 2012, Nature Genetics.

[42]  P. Guttorp,et al.  The kinetics of clonal dominance in myeloproliferative disorders. , 2005, Blood.

[43]  Joshua F. McMichael,et al.  The Origin and Evolution of Mutations in Acute Myeloid Leukemia , 2012, Cell.

[44]  Subhajyoti De,et al.  Somatic mosaicism in healthy human tissues. , 2011, Trends in genetics : TIG.

[45]  William C Hines,et al.  Why don't we get more cancer? A proposed role of the microenvironment in restraining cancer progression , 2011, Nature Medicine.

[46]  J. DeGregori,et al.  Challenging the axiom: does the occurrence of oncogenic mutations truly limit cancer development with age? , 2013, Oncogene.

[47]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[48]  Annapurna Poduri,et al.  Single-Cell, Genome-wide Sequencing Identifies Clonal Somatic Copy-Number Variation in the Human Brain , 2014, Cell reports.

[49]  Vicky L Brandt,et al.  Safeguards for Cell Cooperation in Mouse Embryogenesis Shown by Genome-Wide Cheater Screen , 2013, Science.

[50]  David R. Hunter,et al.  mixtools: An R Package for Analyzing Mixture Models , 2009 .