Supervised mutational signatures for obesity and other tissue-specific etiological factors in cancer

Determining the etiologic basis of the mutations that are responsible for cancer is one of the fundamental challenges in modern cancer research. Different mutational processes induce different types of DNA mutations, providing ‘mutational signatures’ that have led to key insights into cancer etiology. The most widely used signatures for assessing genomic data are based on unsupervised patterns that are then retrospectively correlated with certain features of cancer. We show here that supervised machine-learning techniques can identify signatures, called SuperSigs, that are more predictive than those currently available. Surprisingly, we found that aging yields different SuperSigs in different tissues, and the same is true for environmental exposures. We were able to discover SuperSigs associated with obesity, the most important lifestyle factor contributing to cancer in Western populations.

[1]  Forrest W. Crawford,et al.  Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood , 2020, Nature Communications.

[2]  C. Tomasetti Mutated clones are the new normal , 2019, Science.

[3]  W. Willett,et al.  Cancer prevention: Molecular and epidemiologic consensus , 2018, Science.

[4]  R. Durrett,et al.  Role of stem-cell divisions in cancer risk , 2017, Nature.

[5]  B. Vogelstein,et al.  Stem cell divisions, somatic mutations, cancer etiology, and cancer prevention , 2017, Science.

[6]  Hans Clevers,et al.  Tissue-specific mutation accumulation in human adult stem cells during life , 2016, Nature.

[7]  Isidro Cortes-Ciriano,et al.  The Impact of Environmental and Endogenous Damage on Somatic Mutation Load in Human Skin Fibroblasts , 2016, PLoS genetics.

[8]  E. Giovannucci,et al.  Preventable Incidence and Mortality of Carcinoma Associated With Lifestyle Factors Among White Adults in the United States. , 2016, JAMA oncology.

[9]  J. Manson,et al.  Determinants and Consequences of Obesity. , 2016, American journal of public health.

[10]  M. Stratton,et al.  Mutational signatures associated with tobacco smoking in human cancer , 2016, Science.

[11]  G. Parmigiani,et al.  Familial Risk and Heritability of Cancer Among Twins in Nordic Countries. , 2016, JAMA.

[12]  M. Stratton,et al.  Clock-like mutational processes in human somatic cells , 2015, Nature Genetics.

[13]  D. Sahoo,et al.  CD14-expressing cancer cells establish the inflammatory and proliferative tumor microenvironment in bladder cancer , 2015, Proceedings of the National Academy of Sciences.

[14]  A. Morris,et al.  Cancer etiology. Variation in cancer risk among tissues can be explained by the number of stem cell divisions , 2015, BDJ.

[15]  Paz Polak,et al.  Cell-of-origin chromatin organization shapes the mutational landscape of cancer , 2015, Nature.

[16]  B. Vogelstein,et al.  Variation in cancer risk among tissues can be explained by the number of stem cell divisions , 2015, Science.

[17]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[18]  K. Kinzler,et al.  Mutational Signature of Aristolochic Acid Exposure as Revealed by Whole-Exome Sequencing , 2013, Science Translational Medicine.

[19]  P. A. Futreal,et al.  Genome-Wide Mutational Signatures of Aristolochic Acid and Its Application as a Screening Tool , 2013, Science Translational Medicine.

[20]  M. Stratton,et al.  Deciphering Signatures of Mutational Processes Operative in Human Cancer , 2013, Cell reports.

[21]  Giovanni Parmigiani,et al.  Half or more of the somatic mutations in cancers of self-renewing tissues originate prior to tumor initiation , 2013, Proceedings of the National Academy of Sciences.

[22]  Li Ding,et al.  Genomic Landscape of Non-Small Cell Lung Cancer in Smokers and Never-Smokers , 2012, Cell.

[23]  Angela N. Brooks,et al.  Mapping the Hallmarks of Lung Adenocarcinoma with Massively Parallel Sequencing , 2012, Cell.

[24]  G. Cumming,et al.  Smoking and the Lung , 2011 .

[25]  Kenneth Offit,et al.  Genome-wide association studies of cancer. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[26]  Gina Lee,et al.  Smoking and lung cancer: the role of inflammation. , 2008, Proceedings of the American Thoracic Society.

[27]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[28]  P. Hainaut,et al.  Patterns of p53 G-->T transversions in lung cancers reflect the primary mutagenic signature of DNA-damage by tobacco smoke. , 2001, Carcinogenesis.

[29]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[30]  W. Peng,et al.  Accelerated deamination of cytosine residues in UV-induced cyclobutane pyrimidine dimers leads to CC-->TT transitions. , 1996, Biochemistry.

[31]  J. Essigmann,et al.  Mutational properties of the primary aflatoxin B1-DNA adduct. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[32]  G A Colditz,et al.  Physical Activity, Obesity, and Risk for Colon Cancer and Adenoma in Men , 1995, Annals of Internal Medicine.

[33]  G. Pfeifer Mutagenesis at methylated CpG sequences. , 2006, Current topics in microbiology and immunology.