QuantumClone: clonal assessment of functional mutations in cancer based on a genotype-aware method for clonal reconstruction

Abstract Motivation In cancer, clonal evolution is assessed based on information coming from single nucleotide variants and copy number alterations. Nonetheless, existing methods often fail to accurately combine information from both sources to truthfully reconstruct clonal populations in a given tumor sample or in a set of tumor samples coming from the same patient. Moreover, previously published methods detect clones from a single set of variants. As a result, compromises have to be done between stringent variant filtering [reducing dispersion in variant allele frequency estimates (VAFs)] and using all biologically relevant variants. Results We present a framework for defining cancer clones using most reliable variants of high depth of coverage and assigning functional mutations to the detected clones. The key element of our framework is QuantumClone, a method for variant clustering into clones based on VAFs, genotypes of corresponding regions and information about tumor purity. We validated QuantumClone and our framework on simulated data. We then applied our framework to whole genome sequencing data for 19 neuroblastoma trios each including constitutional, diagnosis and relapse samples. We confirmed an enrichment of damaging variants within such pathways as MAPK (mitogen-activated protein kinases), neuritogenesis, epithelial-mesenchymal transition, cell survival and DNA repair. Most pathways had more damaging variants in the expanding clones compared to shrinking ones, which can be explained by the increased total number of variants between these two populations. Functional mutational rate varied for ancestral clones and clones shrinking or expanding upon treatment, suggesting changes in clone selection mechanisms at different time points of tumor evolution. Availability and implementation Source code and binaries of the QuantumClone R package are freely available for download at https://CRAN.R-project.org/package=QuantumClone. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[2]  A. Piquero,et al.  USING THE CORRECT STATISTICAL TEST FOR THE EQUALITY OF REGRESSION COEFFICIENTS , 1998 .

[3]  Tatiana Popova,et al.  Supplementary Methods , 2012, Acta Neuropsychiatrica.

[4]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[5]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[6]  Anushya Muruganujan,et al.  PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium , 2009, Nucleic Acids Res..

[7]  Z. Szallasi,et al.  Spatial and temporal diversity in genomic instability processes defines lung cancer evolution , 2014, Science.

[8]  Emmanuel Barillot,et al.  High-resolution mapping of DNA breakpoints to define true recurrences among ipsilateral breast cancers. , 2008, Journal of the National Cancer Institute.

[9]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[10]  Gabor T. Marth,et al.  SubcloneSeeker: a computational framework for reconstructing tumor clone structure for cancer variant interpretation and prioritization , 2014, Genome Biology.

[11]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[12]  Obi L. Griffith,et al.  SciClone: Inferring Clonal Architecture and Tracking the Spatial and Temporal Patterns of Tumor Evolution , 2014, PLoS Comput. Biol..

[13]  Sven Rahmann,et al.  Mutational dynamics between primary and relapse neuroblastomas , 2015, Nature Genetics.

[14]  B. Vogelstein,et al.  Variation in cancer risk among tissues can be explained by the number of stem cell divisions , 2015, Science.

[15]  Andrei Kucharavy,et al.  Targeting the Adaptability of Heterogeneous Aneuploids , 2015, Cell.

[16]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[17]  Andrei Zinovyev,et al.  Calculating Biological Module Enrichment or Depletion and Visualizing Data on Large-scale Molecular Maps with ACSNMineR and RNaviCell Packages , 2016, The R Journal.

[18]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[19]  Thomas B Kepler,et al.  Reconstructing a B-cell clonal lineage. I. Statistical inference of unobserved ancestors , 2013, F1000Research.

[20]  James D. Brenton,et al.  Phylogenetic Quantification of Intra-tumour Heterogeneity , 2013, PLoS Comput. Biol..

[21]  E. Barillot,et al.  Atlas of Cancer Signalling Network: a systems biology resource for integrative analysis of cancer data with Google Maps , 2015, Oncogenesis.

[22]  O. Delattre,et al.  Molecular pathogenesis of peripheral neuroblastic tumors , 2010, Oncogene.

[23]  Nilgun Donmez,et al.  Clonality inference in multiple tumor samples using phylogeny , 2015, Bioinform..

[24]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[25]  Gabor T. Marth,et al.  Integrative Annotation of Variants from 1092 Humans: Application to Cancer Genomics , 2013, Science.

[26]  Kaichun Wu,et al.  Genistein suppresses FLT4 and inhibits human colorectal cancer metastasis , 2014, Oncotarget.

[27]  D. Zwijnenburg,et al.  Sequencing of neuroblastoma identifies chromothripsis and defects in neuritogenesis genes , 2012, Nature.

[28]  Arto Mannermaa,et al.  Proto-oncogene PIM-1 is a novel estrogen receptor target associating with high grade breast tumors , 2013, Molecular and Cellular Endocrinology.

[29]  Shankar Vembu,et al.  PhyloWGS: Reconstructing subclonal composition and evolution from whole-genome sequencing of tumors , 2015, Genome Biology.

[30]  Christopher J. R. Illingworth,et al.  High-Definition Reconstruction of Clonal Composition in Cancer , 2014, Cell reports.

[31]  Shamil R. Sunyaev,et al.  Impact of deleterious passenger mutations on cancer progression , 2012, Proceedings of the National Academy of Sciences.

[32]  A. Bouchard-Côté,et al.  PyClone: statistical inference of clonal population structure in cancer , 2014, Nature Methods.

[33]  Jian Kuang,et al.  RSK Promotes Prostate Cancer Progression in Bone through ING3, CKAP2, and PTK6-Mediated Cell Survival , 2014, Molecular Cancer Research.

[34]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[35]  I. Adzhubei,et al.  Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 , 2013, Current protocols in human genetics.

[36]  Gudrun Schleiermacher,et al.  Relapsed neuroblastomas show frequent RAS-MAPK pathway mutations , 2015, Nature Genetics.

[37]  David E Larson,et al.  Using VarScan 2 for Germline Variant Calling and Somatic Mutation Detection , 2013, Current protocols in bioinformatics.

[38]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[39]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[40]  Shankar Vembu,et al.  Inferring clonal evolution of tumors from single nucleotide somatic mutations , 2012, BMC Bioinformatics.