Longitudinal cancer evolution from single cells

The rise of longitudinal single-cell sequencing experiments on patient-derived cell cultures, xenografts and organoids is opening new opportunities to track cancer evolution in single tumors and to investigate intra-tumor heterogeneity. This is particularly relevant when assessing the efficacy of therapies over time on the clonal composition of a tumor and in the identification of resistant subclones. We here introduce LACE (Longitudinal Analysis of Cancer Evolution), the first algorithmic framework that processes single-cell somatic mutation profiles from cancer samples collected at different time points and in distinct experimental settings, to produce longitudinal models of cancer evolution. Our approach solves a Boolean matrix factorization problem with phylogenetic constraints, by maximizing a weighted likelihood function computed on multiple time points, and we show with simulations that it outperforms state-of-the-art methods for both bulk and single-cell sequencing data. Remarkably, as the results are robust with respect to high levels of data-specific errors, LACE can be employed to process single-cell mutational profiles as generated by calling variants from the increasingly available scRNA-seq data, thus obviating the need of relying on rarer and more expensive genome sequencing experiments. This also allows to investigate the relation between genomic clonal evolution and phenotype at the single-cell level. To illustrate the capabilities of LACE, we show its application to a longitudinal scRNA-seq dataset of patient-derived xenografts of BRAFV600E/K mutant melanomas, in which we characterize the impact of concurrent BRAF/MEK-inhibition on clonal evolution, also by showing that distinct genetic clones reveal different sensitivity to the therapy. Furthermore, the analysis of a longitudinal dataset of breast cancer PDXs from targeted scDNA-sequencing experiments delivers a high-resolution characterization of intra-tumor heterogeneity, also allowing the detection of a late de novo subclone.

[1]  Fabian J Theis,et al.  SCANPY: large-scale single-cell gene expression data analysis , 2018, Genome Biology.

[2]  Renier J. Brentjens,et al.  Driving CAR T-cells forward , 2016, Nature Reviews Clinical Oncology.

[3]  M. Carroll,et al.  BCR-ABL kinase is dead; long live the CML stem cell. , 2011, The Journal of clinical investigation.

[4]  Matthew A Myers,et al.  CALDER: Inferring Phylogenetic Trees from Longitudinal Tumor Samples. , 2019, Cell systems.

[5]  M. Nosrati,et al.  A multi-marker assay to distinguish malignant melanomas from benign nevi , 2009, Proceedings of the National Academy of Sciences.

[6]  Alexandre Bouchard-Côté,et al.  ddClone: joint statistical inference of clonal populations from single cell and bulk tumour sequencing data , 2017, Genome Biology.

[7]  Aleksandra A. Kolodziejczyk,et al.  Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression , 2015, Nature Communications.

[8]  Laura M. Heiser,et al.  Tumor-Derived Cell Lines as Molecular Models of Cancer Pharmacogenomics , 2015, Molecular Cancer Research.

[9]  Giulio Caravagna,et al.  Learning mutational graphs of individual tumour evolution from single-cell and multi-region sequencing data , 2017, BMC Bioinformatics.

[10]  Feifang Hu,et al.  The Relevance Weighted Likelihood With Applications , 2001 .

[11]  Giancarlo Mauri,et al.  Algorithmic methods to infer the evolutionary trajectories in cancer progression , 2015, Proceedings of the National Academy of Sciences.

[12]  A. Bouchard-Côté,et al.  PyClone: statistical inference of clonal population structure in cancer , 2014, Nature Methods.

[13]  Maxim Teslenko,et al.  MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space , 2012, Systematic biology.

[14]  M. Kimura The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. , 1969, Genetics.

[15]  Luay Nakhleh,et al.  SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data , 2019, Genome Research.

[16]  R. Hagedoorn,et al.  Adoptive Immunotherapy Using PRAME-Specific T Cells in Medulloblastoma. , 2018, Cancer research.

[17]  A. Bardelli,et al.  Radiologic and Genomic Evolution of Individual Metastases during HER2 Blockade in Colorectal Cancer. , 2018, Cancer cell.

[18]  Benjamin J. Raphael,et al.  Reconstruction of clonal trees and tumor composition from multi-sample sequencing data , 2015, Bioinform..

[19]  Mohammed El-Kebir,et al.  SPhyR: tumor phylogeny estimation from single-cell sequencing data under loss and error , 2018, Bioinform..

[20]  Ana Maria Tomé,et al.  From Binary NMF to Variational Bayes NMF: A Probabilistic Approach , 2016 .

[21]  George Thomas,et al.  Ribosome biogenesis in cancer: new players and therapeutic avenues , 2017, Nature Reviews Cancer.

[22]  Fenglin Liu,et al.  Systematic comparative analysis of single-nucleotide variant detection methods from single-cell RNA sequencing data , 2019, Genome Biology.

[23]  R. Irizarry,et al.  Missing data and technical variability in single‐cell RNA‐sequencing experiments , 2018, Biostatistics.

[24]  Hans Clevers,et al.  Modeling Development and Disease with Organoids , 2016, Cell.

[25]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[26]  Kok Siong Ang,et al.  A benchmark of batch-effect correction methods for single-cell RNA sequencing data , 2020, Genome Biology.

[27]  N. McGranahan,et al.  The causes and consequences of genetic heterogeneity in cancer evolution , 2013, Nature.

[28]  Jong Kyoung Kim,et al.  Corrigendum: Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression , 2015, Nature Communications.

[29]  Alexey M. Kozlov,et al.  Eleven grand challenges in single-cell data science , 2020, Genome Biology.

[30]  Giulio Caravagna,et al.  Detecting repeated cancer evolution from multi-region tumor sequencing data , 2018, Nature Methods.

[31]  C. Tyler-Smith,et al.  Ancient DNA and the rewriting of human history: be sparing with Occam’s razor , 2016, Genome Biology.

[32]  Benjamin J. Raphael,et al.  Inferring the Mutational History of a Tumor Using Multi-state Perfect Phylogeny Mixtures. , 2016, Cell systems.

[33]  Joseph M. Scandura,et al.  Genotyping of Transcriptomes links somatic mutations and cell identity , 2019, Nature.

[34]  Shawn M. Gillespie,et al.  Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma , 2014, Science.

[35]  D. Posada,et al.  Multiregional Tumor Trees Are Not Phylogenies , 2017, Trends in cancer.

[36]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[37]  Aiqing Li,et al.  Identification of COL1A1 and COL1A2 as candidate prognostic factors in gastric cancer , 2016, World Journal of Surgical Oncology.

[38]  Paola Bonizzoni,et al.  Does relaxing the infinite sites assumption give better tumor phylogenies? An ILP-based comparative approach , 2017, bioRxiv.

[39]  Fabian J Theis,et al.  Current best practices in single‐cell RNA‐seq analysis: a tutorial , 2019, Molecular systems biology.

[40]  E. Shapiro,et al.  Single-cell sequencing-based technologies will revolutionize whole-organism science , 2013, Nature Reviews Genetics.

[41]  C. Ponting,et al.  G&T-seq: parallel sequencing of single-cell genomes and transcriptomes , 2015, Nature Methods.

[42]  J. Troge,et al.  Tumour evolution inferred by single-cell sequencing , 2011, Nature.

[43]  Florian Markowetz,et al.  OncoNEM: inferring tumor evolution from single-cell sequencing data , 2016, Genome Biology.

[44]  Benjamin Werner,et al.  Longitudinal Liquid Biopsy and Mathematical Modeling of Clonal Evolution Forecast Time to Treatment Failure in the PROSPECT-C Phase II Colorectal Cancer Clinical Trial. , 2018, Cancer discovery.

[45]  R. Tibshirani,et al.  Local Likelihood Estimation , 1987 .

[46]  Fabian J. Theis,et al.  Diffusion maps for high-dimensional single-cell analysis of differentiation data , 2015, Bioinform..

[47]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[48]  Jack Kuipers,et al.  Integrative inference of subclonal tumour evolution from single-cell and bulk sequencing data , 2019, Nature Communications.

[49]  Dan Gusfield,et al.  Efficient algorithms for inferring evolutionary trees , 1991, Networks.

[50]  Ken Chen,et al.  Genotyping tumor clones from single-cell data , 2016, Nature Methods.

[51]  Martin J. Aryee,et al.  Lineage Tracing in Humans Enabled by Mitochondrial Mutations and Single-Cell Genomics , 2019, Cell.

[52]  Chun Cheng,et al.  Chaperonin containing TCP1, subunit 8 (CCT8) is upregulated in hepatocellular carcinoma and promotes HCC proliferation , 2014, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[53]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[54]  V. P. Collins,et al.  Intratumor heterogeneity in human glioblastoma reflects cancer evolutionary dynamics , 2013, Proceedings of the National Academy of Sciences.

[55]  N. Ahmad,et al.  Large-Scale Label-Free Comparative Proteomics Analysis of Polo-Like Kinase 1 Inhibition via the Small-Molecule Inhibitor BI 6727 (Volasertib) in BRAFV600E Mutant Melanoma Cells , 2014, Journal of proteome research.

[56]  Hossein Farahani,et al.  E-scape: interactive visualization of single-cell phylogenetics and cancer evolution , 2017, Nature Methods.

[57]  K. Flaherty,et al.  Toward Minimal Residual Disease-Directed Therapy in Melanoma , 2018, Cell.

[58]  P. Nowell The clonal evolution of tumor cell populations. , 1976, Science.

[59]  S. Ariyan,et al.  Genome-wide screen of promoter methylation identifies novel markers in melanoma. , 2009, Genome research.

[60]  Sohrab P. Shah,et al.  Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution , 2014, Nature.

[61]  Ken Chen,et al.  SiFit: inferring tumor trees from single-cell sequencing data under finite-sites models , 2017, Genome Biology.

[62]  Benjamin J. Raphael,et al.  Inferring Parsimonious Migration Histories for Metastatic Cancers , 2018, Nature Genetics.

[63]  Mengjie Chen,et al.  SNV identification from single-cell RNA sequencing data. , 2019, Human molecular genetics.

[64]  R. Gillies,et al.  Evolutionary dynamics of carcinogenesis and why targeted therapy does not work , 2012, Nature Reviews Cancer.

[65]  Nathaniel D. Omans,et al.  Genotyping of Transcriptomes links somatic mutations and cell identity , 2019, Nature.

[66]  J. Vijg,et al.  Single-cell whole-genome sequencing reveals the functional landscape of somatic mutations in B lymphocytes across the human lifespan , 2019, Proceedings of the National Academy of Sciences.

[67]  P. A. Futreal,et al.  High-throughput single-cell DNA sequencing of acute myeloid leukemia tumors with droplet microfluidics , 2018, Genome research.

[68]  M. Copland,et al.  CD93 is expressed on chronic myeloid leukemia stem cells and identifies a quiescent population which persists after tyrosine kinase inhibitor therapy , 2020, Leukemia.

[69]  Hui Sun Leong,et al.  Longitudinal single-cell RNA sequencing of patient-derived primary cells reveals drug-induced infidelity in stem cell hierarchy , 2018, Nature Communications.

[70]  K. Holt,et al.  Performance of neural network basecalling tools for Oxford Nanopore sequencing , 2019, Genome Biology.