Predicting clone genotypes from tumor bulk sequencing of multiple samples

Motivation Analyses of data generated from bulk sequencing of tumors have revealed extensive genomic heterogeneity within patients. Many computational methods have been developed to enable the inference of genotypes of tumor cell populations (clones) from bulk sequencing data. However, the relative and absolute accuracy of available computational methods in estimating clone counts and clone genotypes is not yet known. Results We have assessed the performance of nine methods, including eight previously-published and one new method (CloneFinder), by analyzing computer simulated datasets. CloneFinder, LICHeE, CITUP and cloneHD inferred clone genotypes with low error (<5% per clone) for a majority of datasets in which the tumor samples contained evolutionarily-related clones. Computational methods did not perform well for datasets in which tumor samples contained mixtures of clones from different clonal lineages. Generally, the number of clones was underestimated by cloneHD and overestimated by PhyloWGS, and BayClone2, Canopy and Clomial required prior information regarding the number of clones. AncesTree and Canopy did not produce results for a large number of datasets. Overall, the deconvolution of clone genotypes from single nucleotide variant (SNV) frequency differences among tumor samples remains challenging, so there is a need to develop more accurate computational methods and robust software for clone genotype inference. Availability and implementation CloneFinder is implemented in Python and is available from https://github.com/gstecher/CloneFinderAPI. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Nancy R. Zhang,et al.  Assessing intratumor heterogeneity and tracking longitudinal and spatial clonal evolutionary history by next-generation sequencing , 2016, Proceedings of the National Academy of Sciences.

[2]  Krishnendu Chatterjee,et al.  Reconstructing metastatic seeding patterns of human cancers , 2017, Nature Communications.

[3]  Ali Bashashati,et al.  Divergent modes of clonal spread and intraperitoneal mixing in high-grade serous ovarian cancer , 2016, Nature Genetics.

[4]  C. Curtis,et al.  A Big Bang model of human colorectal tumor growth , 2015, Nature Genetics.

[5]  Benjamin J. Raphael,et al.  Integrated genomic characterization of oesophageal carcinoma , 2017, Nature.

[6]  Nicholas Navin,et al.  Tumor evolution: Linear, branching, neutral or punctuated? , 2017, Biochimica et biophysica acta. Reviews on cancer.

[7]  Iman Hajirasouliha,et al.  Fast and scalable inference of multi-sample cancer lineages , 2014, Genome Biology.

[8]  Anne-Marie Mes-Masson,et al.  Engineered in-vitro cell line mixtures and robust evaluation of computational methods for clonal decomposition and longitudinal dynamics in cancer , 2017, Scientific Reports.

[9]  Benjamin J. Raphael,et al.  Reconstruction of clonal trees and tumor composition from multi-sample sequencing data , 2015, Bioinform..

[10]  S. Tavaré,et al.  Whole-genome sequencing provides new insights into the clonal architecture of Barrett’s esophagus and esophageal adenocarcinoma , 2015, Nature Genetics.

[11]  Hai-Jian Sun,et al.  Recent advances and current issues in single-cell sequencing of tumors. , 2015, Cancer letters.

[12]  Shankar Vembu,et al.  Inferring clonal evolution of tumors from single nucleotide somatic mutations , 2012, BMC Bioinformatics.

[13]  William Stafford Noble,et al.  Inferring Clonal Composition from Multiple Sections of a Breast Cancer , 2014, PLoS Comput. Biol..

[14]  R. Arceci Intratumor Heterogeneity and Branched Evolution Revealed by Multiregion Sequencing , 2012 .

[15]  Jenny Taylor,et al.  Monitoring chronic lymphocytic leukemia progression by whole genome sequencing reveals heterogeneous clonal evolution patterns. , 2012, Blood.

[16]  Yuan Ji,et al.  BayClone: Bayesian Nonparametric Inference of Tumor Subclones Using NGS Data , 2014, Pacific Symposium on Biocomputing.

[17]  Fabio Vandin Computational Methods for Characterizing Cancer Mutational Heterogeneity , 2017, Front. Genet..

[18]  F. Markowetz,et al.  How Subclonal Modeling Is Changing the Metastatic Paradigm , 2016, Clinical Cancer Research.

[19]  K. Schittkowski,et al.  NONLINEAR PROGRAMMING , 2022 .

[20]  Woo Suk Hong,et al.  Inferring the Origin of Metastases from Cancer Phylogenies. , 2015, Cancer research.

[21]  F. Markowetz,et al.  Cancer Evolution: Mathematical Models and Computational Inference , 2014, Systematic biology.

[22]  S. Singh,et al.  Autophagy in Cancer Stem Cells: A Potential Link Between Chemoresistance, Recurrence, and Metastasis , 2015, BioResearch open access.

[23]  P. A. Futreal,et al.  Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing , 2014, Nature Genetics.

[24]  Nilgun Donmez,et al.  Clonality inference in multiple tumor samples using phylogeny , 2015, Bioinform..

[25]  M. Nykter,et al.  The Evolutionary History of Lethal Metastatic Prostate Cancer , 2015, Nature.

[26]  Shankar Vembu,et al.  PhyloWGS: Reconstructing subclonal composition and evolution from whole-genome sequencing of tumors , 2015, Genome Biology.

[27]  M. Nei,et al.  A new method of inference of ancestral nucleotide and amino acid sequences. , 1995, Genetics.

[28]  Satoru Miyano,et al.  Integrated Multiregional Analysis Proposing a New Model of Colorectal Cancer Evolution , 2016, PLoS genetics.

[29]  D. Posada,et al.  Multiregional Tumor Trees Are Not Phylogenies , 2017, Trends in cancer.

[30]  N. McGranahan,et al.  Inferring mutational timing and reconstructing tumour evolutionary histories. , 2015, Biochimica et biophysica acta.

[31]  Christina Curtis,et al.  A population genetics perspective on the determinants of intra-tumor heterogeneity. , 2017, Biochimica et biophysica acta. Reviews on cancer.

[32]  Christopher J. R. Illingworth,et al.  High-Definition Reconstruction of Clonal Composition in Cancer , 2014, Cell reports.

[33]  A. McKenna,et al.  Evolution and Impact of Subclonal Mutations in Chronic Lymphocytic Leukemia , 2012, Cell.

[34]  Z. Szallasi,et al.  Spatial and temporal diversity in genomic instability processes defines lung cancer evolution , 2014, Science.

[35]  Serena Nik-Zainal,et al.  Mechanisms underlying mutational signatures in human cancers , 2014, Nature Reviews Genetics.

[36]  W. Koh,et al.  Single-cell genome sequencing: current state of the science , 2016, Nature Reviews Genetics.

[37]  M. Nei,et al.  Molecular Evolution and Phylogenetics , 2000 .

[38]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[39]  H. Heng,et al.  Evolutionary mechanism unifies the hallmarks of cancer , 2015, International journal of cancer.

[40]  A. Børresen-Dale,et al.  The Life History of 21 Breast Cancers , 2012, Cell.

[41]  A. McKenna,et al.  Paired Exome Analysis of Barrett’s Esophagus and Adenocarcinoma , 2015, Nature Genetics.

[42]  N. Navin,et al.  The first five years of single-cell cancer genomics and beyond , 2015, Genome research.