FastClone is a probabilistic tool for deconvoluting tumor heterogeneity in bulk-sequencing samples

Dissecting tumor heterogeneity is a key to understanding the complex mechanisms underlying drug resistance in cancers. The rich literature of pioneering studies on tumor heterogeneity analysis spurred a recent community-wide benchmark study that compares diverse modeling algorithms. Here we present FastClone, a top-performing algorithm in accuracy in this benchmark. FastClone improves over existing methods by allowing the deconvolution of subclones that have independent copy number variation events within the same chromosome regions. We characterize the behavior of FastClone in identifying subclones using stage III colon cancer primary tumor samples as well as simulated data. It achieves approximately 100-fold acceleration in computation for both simulated and patient data. The efficacy of FastClone will allow its application to large-scale data and clinical data, and facilitate personalized medicine in cancers.

[1]  Jan Schröder,et al.  Socrates: identification of genomic rearrangements in tumour genomes by re-aligning soft clipped reads , 2014, Bioinform..

[2]  Hao Chen,et al.  Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny , 2017, bioRxiv.

[3]  Nancy R. Zhang,et al.  Assessing intratumor heterogeneity and tracking longitudinal and spatial clonal evolutionary history by next-generation sequencing , 2016, Proceedings of the National Academy of Sciences.

[4]  C. Swanton,et al.  Implications of intratumour heterogeneity for treatment stratification , 2014, The Journal of pathology.

[5]  David C Wedge,et al.  Principles of Reconstructing the Subclonal Architecture of Cancers. , 2017, Cold Spring Harbor perspectives in medicine.

[6]  P. Nowell The clonal evolution of tumor cell populations. , 1976, Science.

[7]  Yixuan Wang,et al.  A graph-based algorithm for estimating clonal haplotypes of tumor sample from sequencing data , 2019, BMC Medical Genomics.

[8]  Ken Chen,et al.  Combining accurate tumor genome simulation with crowdsourcing to benchmark somatic structural variant detection , 2018, bioRxiv.

[9]  Niko Beerenwinkel,et al.  BitPhylogeny: a probabilistic framework for reconstructing intra-tumor phylogenies , 2015, Genome Biology.

[10]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[11]  Charles Swanton,et al.  Translational Implications of Tumor Heterogeneity , 2015, Clinical Cancer Research.

[12]  A. Bouchard-Côté,et al.  PyClone: statistical inference of clonal population structure in cancer , 2014, Nature Methods.

[13]  Shankar Vembu,et al.  Inferring clonal evolution of tumors from single nucleotide somatic mutations , 2012, BMC Bioinformatics.

[14]  Jack Kuipers,et al.  Integrative inference of subclonal tumour evolution from single-cell and bulk sequencing data , 2019, Nature Communications.

[15]  Chen Wang,et al.  CancerTracer: a curated database for intrapatient tumor heterogeneity , 2019, Nucleic Acids Res..

[16]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[17]  Nathan M. Wilson,et al.  A community effort to create standards for evaluating tumor subclonal reconstruction , 2019, Nature Biotechnology.

[18]  Ali Moeini,et al.  BAMSE: Bayesian model selection for tumor phylogeny inference among multiple samples , 2019, BMC Bioinformatics.

[19]  A. Jackson,et al.  The mutation rate and cancer. , 1998, Genetics.

[20]  A. McKenna,et al.  Widespread genetic heterogeneity in multiple myeloma: implications for targeted therapy. , 2014, Cancer cell.

[21]  Daniel Q. Naiman,et al.  SubClonal Hierarchy Inference from Somatic Mutations: Automatic Reconstruction of Cancer Evolutionary Trees from Multi-region Next Generation Sequencing , 2015, bioRxiv.

[22]  A. Bardelli,et al.  Somatic alterations as the basis for resistance to targeted therapies , 2014, The Journal of pathology.

[23]  James D. Brenton,et al.  Phylogenetic Quantification of Intra-tumour Heterogeneity , 2013, PLoS Comput. Biol..

[24]  Leif E. Peterson,et al.  Progression inference for somatic mutations in cancer , 2017, Heliyon.

[25]  P. Johnston,et al.  Cancer drug resistance: an evolving paradigm , 2013, Nature Reviews Cancer.

[26]  G. Parmigiani,et al.  Heterogeneity of genomic evolution and mutational profiles in multiple myeloma , 2014, Nature Communications.

[27]  Giovanni Martinelli,et al.  Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data , 2016, BMC Bioinformatics.

[28]  H. Wheeler,et al.  Molecular Heterogeneity in Glioblastoma: Potential Clinical Implications , 2015, Front. Oncol..

[29]  Anders Isaksson,et al.  Patchwork: allele-specific copy number analysis of whole-genome sequenced tumor tissue , 2013, Genome Biology.

[30]  Russell Schwartz,et al.  Deconvolution and phylogeny inference of structural variations in tumor genomic samples , 2018, bioRxiv.

[31]  Shankar Vembu,et al.  PhyloWGS: Reconstructing subclonal composition and evolution from whole-genome sequencing of tumors , 2015, Genome Biology.

[32]  Benjamin J. Raphael,et al.  Inferring the Mutational History of a Tumor Using Multi-state Perfect Phylogeny Mixtures. , 2016, Cell systems.

[33]  Joshua M. Stuart,et al.  Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection , 2015, Nature Methods.

[34]  Gabor T. Marth,et al.  SubcloneSeeker: a computational framework for reconstructing tumor clone structure for cancer variant interpretation and prioritization , 2014, Genome Biology.

[35]  Charles Swanton,et al.  Tumour heterogeneity and the evolution of polyclonal drug resistance , 2014, Molecular oncology.

[36]  Y. Kluger,et al.  TrAp: a tree approach for fingerprinting subclonal tumor composition , 2013, Nucleic acids research.

[37]  Obi L. Griffith,et al.  SciClone: Inferring Clonal Architecture and Tracking the Spatial and Temporal Patterns of Tumor Evolution , 2014, PLoS Comput. Biol..

[38]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[39]  A. Shaw,et al.  Tumour heterogeneity and resistance to cancer therapies , 2018, Nature Reviews Clinical Oncology.

[40]  K. Anderson,et al.  Genetic variegation of clonal architecture and propagating cells in leukaemia , 2011, Nature.

[41]  L. Looijenga,et al.  TargetClone: A multi-sample approach for reconstructing subclonal evolution of tumors , 2018, PloS one.

[42]  Nilgun Donmez,et al.  Clonality inference in multiple tumor samples using phylogeny , 2015, Bioinform..

[43]  Christopher J. R. Illingworth,et al.  High-Definition Reconstruction of Clonal Composition in Cancer , 2014, Cell reports.

[44]  E. Fearon,et al.  Lymph Node Metastases in Colon Cancer Are Polyclonal , 2017, Clinical Cancer Research.