Robust and accurate deconvolution of tumor populations uncovers evolutionary mechanisms of breast cancer metastasis

Abstract Motivation Cancer develops and progresses through a clonal evolutionary process. Understanding progression to metastasis is of particular clinical importance, but is not easily analyzed by recent methods because it generally requires studying samples gathered years apart, for which modern single-cell sequencing is rarely an option. Revealing the clonal evolution mechanisms in the metastatic transition thus still depends on unmixing tumor subpopulations from bulk genomic data. Methods We develop a novel toolkit called robust and accurate deconvolution (RAD) to deconvolve biologically meaningful tumor populations from multiple transcriptomic samples spanning the two progression states. RAD uses gene module compression to mitigate considerable noise in RNA, and a hybrid optimizer to achieve a robust and accurate solution. Finally, we apply a phylogenetic algorithm to infer how associated cell populations adapt across the metastatic transition via changes in expression programs and cell-type composition. Results We validated the superior robustness and accuracy of RAD over alternative algorithms on a real dataset, and validated the effectiveness of gene module compression on both simulated and real bulk RNA data. We further applied the methods to a breast cancer metastasis dataset, and discovered common early events that promote tumor progression and migration to different metastatic sites, such as dysregulation of ECM-receptor, focal adhesion and PI3k-Akt pathways. Availability and implementation The source code of the RAD package, models, experiments and technical details such as parameters, is available at https://github.com/CMUSchwartzLab/RAD. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  P. Rorsman,et al.  Gene expression profiling in single cells from the pancreatic islets of Langerhans reveals lognormal distribution of mRNA levels. , 2005, Genome research.

[2]  George C Tseng,et al.  Transcriptome Characterization of Matched Primary Breast and Brain Metastatic Tumors to Detect Novel Actionable Targets , 2018, Journal of the National Cancer Institute.

[3]  Paula D. Bos,et al.  Metastasis: from dissemination to organ-specific colonization , 2009, Nature Reviews Cancer.

[4]  Metastatic breast cancers have reduced immune cell recruitment but harbor increased macrophages relative to their matched primary tumors , 2019, Journal of Immunotherapy for Cancer.

[5]  Niko Beerenwinkel,et al.  Computational Cancer Biology: An Evolutionary Perspective , 2016, PLoS Comput. Biol..

[6]  Yi Zhong,et al.  Digital sorting of complex tissues for cell type-specific gene expression profiles , 2013, BMC Bioinformatics.

[7]  E. Winer,et al.  CNS metastases in breast cancer. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[8]  N. Navin,et al.  The first five years of single-cell cancer genomics and beyond , 2015, Genome research.

[9]  P. LoRusso,et al.  Review: Targeting the Hedgehog pathway in cancer , 2010, Therapeutic advances in medical oncology.

[10]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[11]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[12]  Daisuke Hoshino,et al.  Turnover of Focal Adhesions and Cancer Cell Migration , 2012, International journal of cell biology.

[13]  Russell Schwartz,et al.  A simplicial complex-based approach to unmixing tumor progression data , 2015, BMC Bioinformatics.

[14]  Robert T. Jones,et al.  Genomic Characterization of Brain Metastases Reveals Branched Evolution and Potential Therapeutic Targets. , 2015, Cancer discovery.

[15]  Adrian V. Lee,et al.  Intrinsic Subtype Switching and Acquired ERBB2/HER2 Amplifications and Mutations in Breast Cancer Brain Metastases , 2017, JAMA oncology.

[16]  Anthony T Papenfuss,et al.  A community-based model of rapid autopsy in end-stage cancer patients , 2016, Nature Biotechnology.

[17]  Russell Schwartz,et al.  Deconvolution and phylogeny inference of structural variations in tumor genomic samples , 2018, bioRxiv.

[18]  Russell Schwartz,et al.  Tumor heterogeneity assessed by sequencing and fluorescence in situ hybridization (FISH) data , 2020, bioRxiv.

[19]  Benjamin J Raphael,et al.  netNMF-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis. , 2020, Genome research.

[20]  Russell Schwartz,et al.  Tumor Copy Number Deconvolution Integrating Bulk and Single-Cell Sequencing Data. , 2020, Journal of computational biology : a journal of computational molecular cell biology.

[21]  Maxim N. Artyomov,et al.  Complete deconvolution of cellular mixtures based on linearity of transcriptional signatures , 2019, Nature Communications.

[22]  Thomas Lengauer,et al.  Mtreemix: a software package for learning and using mixture models of mutagenetic trees , 2005, Bioinform..

[23]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[24]  Zhang Liu,et al.  Interior-point methods for large-scale cone programming , 2011 .

[25]  Rohit Bhargava,et al.  Exome-capture RNA-sequencing of decade-old breast cancers and matched decalcified bone metastases identifies clinically actionable targets , 2017, bioRxiv.

[26]  Rongrong Ji,et al.  Robust nonnegative matrix factorization via L1 norm regularization by multiplicative updating rules , 2012, 2014 IEEE International Conference on Image Processing (ICIP).

[27]  Gabriele Schackert,et al.  Evolutionary Trajectories of IDHWT Glioblastomas Reveal a Common Path of Early Tumorigenesis Instigated Years ahead of Initial Diagnosis. , 2019, Cancer cell.

[28]  Jan Sundquist,et al.  Clinical landscape of cancer metastases , 2018, Cancer medicine.

[29]  William W. Cohen,et al.  From genome to phenome: Predicting multiple cancer phenotypes based on somatic genomic alterations via the genomic impact transformer , 2019, PSB.

[30]  Kathryn Roeder,et al.  A United Statistical Framework for Single Cell and Bulk Sequencing Data , 2016, bioRxiv.

[31]  Camille Stephan-Otto Attolini,et al.  A Differentiation-Based Phylogeny of Cancer Subtypes , 2010, PLoS Comput. Biol..

[32]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[33]  Russell Schwartz,et al.  Phylogenies Derived from Matched Transcriptome Reveal the Evolution of Cell Populations and Temporal Order of Perturbed Pathways in Breast Cancer Brain Metastases , 2019, ISMCO.

[34]  Zhandong Liu,et al.  Gene expression deconvolution in linear space , 2011, Nature Methods.

[35]  Russell Schwartz,et al.  Applying unmixing to gene expression data for tumor phylogeny inference , 2010, BMC Bioinformatics.

[36]  R. Schwartz,et al.  Network-Based Inference of Cancer Progression from Microarray Data , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[37]  R. Schwartz,et al.  Improving personalized prediction of cancer prognoses with clonal evolution models , 2019, bioRxiv.

[38]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[39]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[40]  Mark M. Davis,et al.  Cell type–specific gene expression differences in complex tissues , 2010, Nature Methods.

[41]  X. Guan,et al.  Cancer metastases: challenges and opportunities , 2015, Acta pharmaceutica Sinica. B.

[42]  Gianluca Bontempi,et al.  Biological Processes Associated with Breast Cancer Clinical Outcome Depend on the Molecular Subtypes , 2008, Clinical Cancer Research.

[43]  Konstantinos Lefkimmiatis,et al.  Extracellular calcium and cAMP: second messengers as "third messengers"? , 2007, Physiology.

[44]  Rohit Bhargava,et al.  Frequent ESR1 and CDK Pathway Copy-Number Alterations in Metastatic Breast Cancer , 2018, Molecular Cancer Research.

[45]  A. Schäffer,et al.  Tumor classification using phylogenetic methods on expression data. , 2004, Journal of theoretical biology.

[46]  A. Schäffer,et al.  The evolution of tumour phylogenetics: principles and practice , 2017, Nature Reviews Genetics.