Parsimonious Clone Tree Reconciliation in Cancer

Every tumor is composed of heterogeneous clones, each corresponding to a distinct subpopulation of cells that accumulated different types of somatic mutations, ranging from single-nucleotide variants (SNVs) to copy-number aberrations (CNAs). As the analysis of this intra-tumor heterogeneity has important clinical applications, several computational methods have been introduced to identify clones from DNA sequencing data. However, due to technological and methodological limitations, current analyses are restricted to identifying tumor clones only based on either SNVs or CNAs, preventing a comprehensive characterization of a tumor’s clonal composition. To overcome these challenges, we formulate the identification of clones in terms of both SNVs and CNAs as a reconciliation problem while accounting for uncertainty in the input SNV and CNA proportions. We thus characterize the computational complexity of this problem and we introduce a mixed integer linear programming formulation to solve it exactly. On simulated data, we show that tumor clones can be identified reliably, especially when further taking into account the ancestral relationships that can be inferred from the input SNVs and CNAs. On 49 tumor samples from 10 prostate cancer patients, our reconciliation approach provides a higher resolution view of tumor evolution than previous studies.

[1]  Layla Oesper,et al.  A Consensus Approach to Infer Tumor Evolutionary Histories , 2018, BCB.

[2]  Chris Sander,et al.  Emerging landscape of oncogenic signatures across human cancers , 2013, Nature Genetics.

[3]  P. Nowell The clonal evolution of tumor cell populations. , 1976, Science.

[4]  Benjamin J. Raphael,et al.  Phylogenetic Copy-Number Factorization of Multiple Tumor Samples , 2018, J. Comput. Biol..

[5]  Benjamin J Raphael,et al.  SCARLET: Single-cell tumor phylogeny inference with copy-number constrained mutation losses. , 2020, Cell systems.

[6]  Yong Wang,et al.  Single-cell DNA sequencing reveals a late-dissemination model in metastatic colorectal cancer , 2017, Genome research.

[7]  N. McGranahan,et al.  The causes and consequences of genetic heterogeneity in cancer evolution , 2013, Nature.

[8]  Benjamin J. Raphael,et al.  Tumor phylogeny inference using tree-constrained importance sampling , 2017, Bioinform..

[9]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[10]  Nicolai J. Birkbak,et al.  Tracking the Evolution of Non‐Small‐Cell Lung Cancer , 2017, The New England journal of medicine.

[11]  C. Maley,et al.  Accurate Reconstruction of the Temporal Order of Mutations in Neoplastic Progression , 2011, Cancer Prevention Research.

[12]  Ron Shamir,et al.  Copy-Number Evolution Problems: Complexity and Algorithms , 2016, WABI.

[13]  Y. Kluger,et al.  TrAp: a tree approach for fingerprinting subclonal tumor composition , 2013, Nucleic acids research.

[14]  Mike A. Steel,et al.  Refining Phylogenetic Trees Given Additional Data: An Algorithm Based on Parsimony , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Benjamin J. Raphael,et al.  Reconstruction of clonal trees and tumor composition from multi-sample sequencing data , 2015, Bioinform..

[16]  Benjamin J. Raphael,et al.  Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data , 2018, Nature Communications.

[17]  Jun Guo,et al.  Inferring the Temporal Order of Cancer Gene Mutations in Individual Tumor Samples , 2014, PloS one.

[18]  S. Redner,et al.  Organization of growing random networks. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  S. C. Sahinalp,et al.  ReMixT: clone-specific genomic structure estimation in cancer , 2017, Genome Biology.

[20]  Iman Hajirasouliha,et al.  Fast and scalable inference of multi-sample cancer lineages , 2014, Genome Biology.

[21]  Ron Shamir,et al.  Complexity and algorithms for copy-number evolution problems , 2017, Algorithms for Molecular Biology.

[22]  James D. Brenton,et al.  Phylogenetic Quantification of Intra-tumour Heterogeneity , 2013, PLoS Comput. Biol..

[23]  Martin Ester,et al.  Uncovering the subtype-specific temporal order of cancer pathway dysregulation , 2019, bioRxiv.

[24]  Benjamin J. Raphael,et al.  The Copy-Number Tree Mixture Deconvolution Problem and Applications to Multi-sample Bulk Sequencing Tumor Data , 2017, RECOMB.

[25]  W. Koh,et al.  Single-cell genome sequencing: current state of the science , 2016, Nature Reviews Genetics.

[26]  Nancy R. Zhang,et al.  Assessing intratumor heterogeneity and tracking longitudinal and spatial clonal evolutionary history by next-generation sequencing , 2016, Proceedings of the National Academy of Sciences.

[27]  Benjamin J. Raphael,et al.  Inferring the Mutational History of a Tumor Using Multi-state Perfect Phylogeny Mixtures. , 2016, Cell systems.

[28]  Q. Morris,et al.  A practical guide to cancer subclonal reconstruction from DNA sequencing , 2021, Nature Methods.

[29]  M. Nykter,et al.  The Evolutionary History of Lethal Metastatic Prostate Cancer , 2015, Nature.

[30]  David S. Johnson,et al.  Complexity Results for Multiprocessor Scheduling under Resource Constraints , 1975, SIAM J. Comput..

[31]  Shankar Vembu,et al.  PhyloWGS: Reconstructing subclonal composition and evolution from whole-genome sequencing of tumors , 2015, Genome Biology.

[32]  Nicolai J. Birkbak,et al.  Pervasive chromosomal instability and karyotype order in tumour evolution , 2020, Nature.

[33]  Gun Ho Jang,et al.  A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns , 2016, Nature.

[34]  Christopher J. R. Illingworth,et al.  High-Definition Reconstruction of Clonal Composition in Cancer , 2014, Cell reports.

[35]  Gunnar Rätsch,et al.  Reconstructing tumor evolutionary histories and clone trees in polynomial-time with SubMARine , 2020, bioRxiv.

[36]  David Fernández-Baca,et al.  The Perfect Phylogeny Problem , 2001 .

[37]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[38]  The Icgctcga Pan-Cancer Analysis of Whole Genomes Consortium Pan-cancer analysis of whole genomes , 2020 .

[39]  Benjamin J. Raphael,et al.  THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data , 2013, Genome Biology.

[40]  N. McGranahan,et al.  Biological and therapeutic impact of intratumor heterogeneity in cancer evolution. , 2015, Cancer cell.

[41]  Darwin Meets Graph Theory on a Strange Planet: Counting Full \(n\)-ary Trees with Labeled Leafs , 2015 .

[42]  A. Kolomeisky,et al.  Temporal order of mutations influences cancer initiation dynamics , 2021, bioRxiv.

[43]  Nilgun Donmez,et al.  Clonality inference in multiple tumor samples using phylogeny , 2015, Bioinform..

[44]  Mohammed El-Kebir,et al.  On the Non-uniqueness of Solutions to the Perfect Phylogeny Mixture Problem , 2018, RECOMB-CG.