SciClone: Inferring Clonal Architecture and Tracking the Spatial and Temporal Patterns of Tumor Evolution

The sensitivity of massively-parallel sequencing has confirmed that most cancers are oligoclonal, with subpopulations of neoplastic cells harboring distinct mutations. A fine resolution view of this clonal architecture provides insight into tumor heterogeneity, evolution, and treatment response, all of which may have clinical implications. Single tumor analysis already contributes to understanding these phenomena. However, cryptic subclones are frequently revealed by additional patient samples (e.g., collected at relapse or following treatment), indicating that accurately characterizing a tumor requires analyzing multiple samples from the same patient. To address this need, we present SciClone, a computational method that identifies the number and genetic composition of subclones by analyzing the variant allele frequencies of somatic mutations. We use it to detect subclones in acute myeloid leukemia and breast cancer samples that, though present at disease onset, are not evident from a single primary tumor sample. By doing so, we can track tumor evolution and identify the spatial origins of cells resisting therapy.

[1]  J. Salk Clonal evolution in cancer , 2010 .

[2]  Claude Preudhomme,et al.  Several types of mutations of the Abl gene can be found in chronic myeloid leukemia patients resistant to STI571, and they can pre-exist to the onset of treatment. , 2002, Blood.

[3]  Aleix Prat Aparicio Comprehensive molecular portraits of human breast tumours , 2012 .

[4]  Samuel Aparicio,et al.  Opening Pandora's Box--the new biology of driver mutations and clonal evolution in cancer as revealed by next generation sequencing. , 2012, Current opinion in genetics & development.

[5]  J. Troge,et al.  Tumour evolution inferred by single-cell sequencing , 2011, Nature.

[6]  Hagai Attias,et al.  Inferring Parameters and Structure of Latent Variable Models by Variational Bayes , 1999, UAI.

[7]  D. Shalloway,et al.  Efficient uncertainty minimization for fuzzy spectral clustering. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  A. Bashashati,et al.  Integrative analysis of genome-wide loss of heterozygosity and monoallelic expression at nucleotide resolution reveals disrupted pathways in triple-negative breast cancer , 2012, Genome research.

[9]  A. Bouchard-Côté,et al.  PyClone: statistical inference of clonal population structure in cancer , 2014, Nature Methods.

[10]  E. B. Andersen,et al.  Information Science and Statistics , 1986 .

[11]  S. Hochreiter,et al.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate , 2012, Nucleic acids research.

[12]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[13]  L. Pusztai,et al.  Cancer heterogeneity: implications for targeted therapeutics , 2013, British Journal of Cancer.

[14]  Ken Chen,et al.  Recurring mutations found by sequencing an acute myeloid leukemia genome. , 2009, The New England journal of medicine.

[15]  David Shalloway,et al.  Runx1 and p21 synergistically limit the extent of hair follicle stem cell quiescence in vivo , 2013, Proceedings of the National Academy of Sciences.

[16]  Joshua F. McMichael,et al.  DGIdb - Mining the druggable genome , 2013, Nature Methods.

[17]  Christopher M. Bishop,et al.  Robust Bayesian Mixture Modelling , 2005, ESANN.

[18]  J. Carpten,et al.  Clonal competition with alternating dominance in multiple myeloma. , 2012, Blood.

[19]  A. Børresen-Dale,et al.  The Life History of 21 Breast Cancers , 2012, Cell.

[20]  Arne Leijon,et al.  Bayesian Estimation of Beta Mixture Models with Variational Inference , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Joshua F. McMichael,et al.  Clonal evolution in relapsed acute myeloid leukemia revealed by whole genome sequencing , 2011, Nature.

[22]  Huanming Yang,et al.  Single-Cell Exome Sequencing Reveals Single-Nucleotide Mutation Characteristics of a Kidney Tumor , 2012, Cell.

[23]  Todd Richmond,et al.  Detection of Clinically Relevant Copy Number Variants with Whole‐Exome Sequencing , 2013, Human mutation.

[24]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[25]  Hagai Attias,et al.  A Variational Bayesian Framework for Graphical Models , 1999 .

[26]  Joshua F. McMichael,et al.  Genome Remodeling in a Basal-like Breast Cancer Metastasis and Xenograft , 2010, Nature.

[27]  C. Perou,et al.  Allele-specific copy number analysis of tumors , 2010, Proceedings of the National Academy of Sciences.

[28]  Xiaohong Li,et al.  A Comprehensive Survey of Clonal Diversity Measures in Barrett's Esophagus as Biomarkers of Progression to Esophageal Adenocarcinoma , 2010, Cancer Prevention Research.

[29]  G. Parmigiani,et al.  Heterogeneity of genomic evolution and mutational profiles in multiple myeloma , 2014, Nature Communications.

[30]  L. N. Kanal,et al.  Uncertainty in Artificial Intelligence 5 , 1990 .

[31]  P. A. Futreal,et al.  Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. , 2012, The New England journal of medicine.

[32]  Irmtraud M. Meyer,et al.  The clonal and mutational evolution spectrum of primary triple-negative breast cancers , 2012, Nature.

[33]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[34]  Omar Abdel-Wahab,et al.  The common feature of leukemia-associated IDH1 and IDH2 mutations is a neomorphic enzyme activity converting alpha-ketoglutarate to 2-hydroxyglutarate. , 2010, Cancer cell.

[35]  J. Troge,et al.  Inferring tumor progression from genomic heterogeneity. , 2010, Genome research.

[36]  Luca Toschi,et al.  Preexistence and clonal selection of MET amplification in EGFR mutant NSCLC. , 2010, Cancer cell.

[37]  A. McKenna,et al.  Evolution and Impact of Subclonal Mutations in Chronic Lymphocytic Leukemia , 2012, Cell.

[38]  Benjamin J. Raphael,et al.  THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data , 2013, Genome Biology.

[39]  Yan Guo,et al.  Comparative Study of Exome Copy Number Variation Estimation Tools Using Array Comparative Genomic Hybridization as Control , 2013, BioMed research international.

[40]  Huanming Yang,et al.  Single-Cell Exome Sequencing and Monoclonal Evolution of a JAK2-Negative Myeloproliferative Neoplasm , 2012, Cell.

[41]  David Shalloway,et al.  Macrostate data clustering. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  Manuela Zucknick,et al.  IDH1 and IDH2 mutations are frequent genetic alterations in acute myeloid leukemia and confer adverse prognosis in cytogenetically normal acute myeloid leukemia with NPM1 mutation without FLT3 internal tandem duplication. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[43]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[44]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[45]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[46]  Joshua F. McMichael,et al.  The Origin and Evolution of Mutations in Acute Myeloid Leukemia , 2012, Cell.

[47]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[48]  M. Nowak,et al.  Distant Metastasis Occurs Late during the Genetic Evolution of Pancreatic Cancer , 2010, Nature.

[49]  Steven J. M. Jones,et al.  Integrated genomic characterization of endometrial carcinoma , 2013, Nature.

[50]  M. Stratton,et al.  Subclonal phylogenetic structures in cancer revealed by ultra-deep sequencing , 2008, Proceedings of the National Academy of Sciences.

[51]  J. Carpten,et al.  Whole-genome sequencing of multiple myeloma from diagnosis to plasma cell leukemia reveals genomic initiating events, evolution, and clonal tides. , 2012, Blood.

[52]  Fang Wang,et al.  Targeted Inhibition of Mutant IDH2 in Leukemia Cells Induces Cellular Differentiation , 2013, Science.

[53]  Sung-Liang Yu,et al.  Pretreatment epidermal growth factor receptor (EGFR) T790M mutation predicts shorter EGFR tyrosine kinase inhibitor response duration in patients with non-small-cell lung cancer. , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[54]  Charles Swanton,et al.  Intratumor Heterogeneity: Seeing the Wood for the Trees , 2012, Science Translational Medicine.

[55]  K. Anderson,et al.  Genetic variegation of clonal architecture and propagating cells in leukaemia , 2011, Nature.

[56]  Christopher A. Miller,et al.  Clonal Architecture of Secondary Acute Myeloid Leukemia Defined by Single-Cell Sequencing , 2014, PLoS genetics.

[57]  N. Munshi,et al.  Minor clone provides a reservoir for relapse in multiple myeloma , 2013, Leukemia.

[58]  Shankar Vembu,et al.  Inferring clonal evolution of tumors from single nucleotide somatic mutations , 2012, BMC Bioinformatics.

[59]  E Mardis,et al.  Clonal diversity of recurrently mutated genes in myelodysplastic syndromes , 2013, Leukemia.

[60]  M. Gerlinger,et al.  How Darwinian models inform therapeutic failure initiated by clonal heterogeneity in cancer medicine , 2010, British Journal of Cancer.

[61]  Chris Sander,et al.  Emerging landscape of oncogenic signatures across human cancers , 2013, Nature Genetics.

[62]  Nizar Bouguila,et al.  Variational Learning for Finite Dirichlet Mixture Models and Applications , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[63]  P. Nowell The clonal evolution of tumor cell populations. , 1976, Science.