Computational modelling in single-cell cancer genomics: methods and future directions

Single-cell technologies have revolutionized biomedical research by enabling scalable measurement of the genome, transcriptome, proteome, ​and epigenome of multiple systems at single-cell resolution. Now widely applied to cancer models, these assays offer new insights into tumour heterogeneity, which underlies cancer initiation, progression, and relapse. However, the large quantities of high-dimensional, noisy data produced by single-cell assays can complicate data analysis, obscuring biological signals with technical artefacts. In this review article, we outline the major challenges in analyzing single-cell cancer genomics data and survey the current computational tools available to tackle these. We further outline unsolved problems that we consider major opportunities for future methods development to help interpret the vast quantities of data being generated.

[1]  Benjamin J. Raphael,et al.  Inferring Parsimonious Migration Histories for Metastatic Cancers , 2018, Nature Genetics.

[2]  Atul J. Butte,et al.  Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage , 2018, Nature Immunology.

[3]  Wyeth W. Wasserman,et al.  Interfaces of Malignant and Immunologic Clonal Dynamics in Ovarian Cancer , 2018, Cell.

[4]  G. Sanguinetti,et al.  scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells , 2018, Nature Communications.

[5]  Sören Müller,et al.  CONICS integrates scRNA-seq with DNA sequencing to map gene expression to tumor sub-clones , 2018, Bioinform..

[6]  Sean C. Bendall,et al.  Single-cell developmental classification of B cell precursor acute lymphoblastic leukemia at diagnosis reveals predictors of relapse , 2018, Nature Medicine.

[7]  Anne E Carpenter,et al.  CellProfiler 3.0: Next-generation image processing for biology , 2018, PLoS biology.

[8]  Fred A. Hamprecht,et al.  ilastik: interactive machine learning for (bio)image analysis , 2019, Nature Methods.

[9]  Itai Yanai,et al.  Integrating microarray-based spatial transcriptomics and single-cell RNA-seq reveals tissue architecture in pancreatic ductal adenocarcinomas , 2020, Nature Biotechnology.

[10]  Ken Chen,et al.  Monovar: single nucleotide variant detection in single cells , 2016, Nature Methods.

[11]  Sean C. Bendall,et al.  Data-Driven Phenotypic Dissection of AML Reveals Progenitor-like Cells that Correlate with Prognosis , 2015, Cell.

[12]  Kieran R. Campbell,et al.  clonealign: statistical integration of independent single-cell RNA and DNA sequencing data from human cancers , 2019, Genome Biology.

[13]  B. Becher,et al.  CyTOF workflow: differential discovery in high-throughput high-dimensional cytometry datasets , 2017, F1000Research.

[14]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, Nature Communications.

[15]  Jack Kuipers,et al.  Single-cell mutation identification via phylogenetic inference , 2018, Nature Communications.

[16]  Johan Hartman,et al.  Chemoresistance Evolution in Triple-Negative Breast Cancer Delineated by Single-Cell Sequencing , 2018, Cell.

[17]  G. Nolan,et al.  Mass Cytometry: Single Cells, Many Features , 2016, Cell.

[18]  Sean C. Bendall,et al.  Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE , 2011, Nature Biotechnology.

[19]  Siddharth S. Dey,et al.  Single-cell 5hmC sequencing reveals chromosome-wide cell-to-cell variability and enables lineage reconstruction , 2016, Nature Biotechnology.

[20]  Kieran R. Campbell,et al.  Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling , 2019, Nature Methods.

[21]  Nilgun Donmez,et al.  Clonality inference in multiple tumor samples using phylogeny , 2015, Bioinform..

[22]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[23]  Irmtraud M. Meyer,et al.  The clonal and mutational evolution spectrum of primary triple-negative breast cancers , 2012, Nature.

[24]  M. Robinson,et al.  A systematic performance evaluation of clustering methods for single-cell RNA-seq data. , 2018, F1000Research.

[25]  Peter W. Harrison,et al.  Cardelino: computational integration of somatic clonal substructure and single-cell transcriptomes , 2020, Nature Methods.

[26]  J. Maaskola,et al.  Charting Tissue Expression Anatomy by Spatial Transcriptome Deconvolution , 2018 .

[27]  Miguel A. Andrade-Navarro,et al.  Assessment of computational methods for the analysis of single-cell ATAC-seq data , 2019, Genome Biology.

[28]  Anthony Liew,et al.  Identification of grade and origin specific cell populations in serous epithelial ovarian cancer by single cell RNA-seq , 2018, PloS one.

[29]  Holger Moch,et al.  The single-cell pathology landscape of breast cancer , 2020, Nature.

[30]  Mary E. Edgerton,et al.  Multiclonal Invasion in Breast Tumors Identified by Topographic Single Cell Sequencing , 2018, Cell.

[31]  Takaya Nagasaki,et al.  Cancer-Associated Fibroblasts: Their Characteristics and Their Roles in Tumor Growth , 2015, Cancers.

[32]  Feng Li,et al.  CellMarker: a manually curated resource of cell markers in human and mouse , 2018, Nucleic Acids Res..

[33]  Kieran R. Campbell,et al.  Dissociation of solid tumor tissues with cold active protease for single-cell RNA-seq minimizes conserved collagenase-associated stress responses , 2019, Genome Biology.

[34]  Kim-Anh Lê Cao,et al.  DIABLO: an integrative approach for identifying key molecular drivers from multi-omics assays , 2019, Bioinform..

[35]  Sean C. Bendall,et al.  Multiplexed ion beam imaging of human breast tumors , 2014, Nature Medicine.

[36]  Benjamin J. Raphael,et al.  Characterizing the allele- and haplotype-specific copy number landscape of cancer genomes at single-cell resolution with CHISEL , 2019 .

[37]  J. Salk Clonal evolution in cancer , 2010 .

[38]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[39]  L. J. K. Wee,et al.  Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors , 2017, Nature Genetics.

[40]  M. Schaub,et al.  SC3 - consensus clustering of single-cell RNA-Seq data , 2016, Nature Methods.

[41]  Haruka Ozaki,et al.  Millefy: visualizing cell-to-cell heterogeneity in read coverage of single-cell RNA sequencing datasets , 2019, BMC Genomics.

[42]  Daniel Osorio,et al.  Systematic determination of the mitochondrial proportion in human and mice tissues for single-cell RNA sequencing data quality control , 2020, bioRxiv.

[43]  E John Wherry,et al.  T cell exhaustion , 2011 .

[44]  Maria Anna Rapsomaniki,et al.  A Single-Cell Atlas of the Tumor and Immune Ecosystem of Human Breast Cancer , 2019, Cell.

[45]  Richard Bonneau,et al.  High-definition spatial transcriptomics for in situ tissue profiling , 2019, Nature Methods.

[46]  Alexandre Bouchard-Côté,et al.  Clonal genotype and population structure inference from single-cell tumor sequencing , 2016, Nature Methods.

[47]  C. Vallot,et al.  High-throughput single-cell ChIP-seq identifies heterogeneity of chromatin states in breast cancer , 2019, Nature Genetics.

[48]  F. Mapa,et al.  CellSIUS provides sensitive and specific detection of rare cell populations from complex single-cell RNA-seq data , 2019, Genome Biology.

[49]  Richard A. Moore,et al.  Clonal Decomposition and DNA Replication States Defined by Scaled Single-Cell Genome Sequencing , 2019, Cell.

[50]  Benjamin J. Raphael,et al.  Single-cell tumor phylogeny inference with copy-number constrained mutation losses , 2019, bioRxiv.

[51]  I. Nikaido,et al.  Millefy: visualizing cell-to-cell heterogeneity in read coverage of single-cell RNA sequencing datasets , 2020, BMC Genomics.

[52]  Florian Markowetz,et al.  OncoNEM: inferring tumor evolution from single-cell sequencing data , 2016, Genome Biology.

[53]  Euan A. Ashley,et al.  Deep Learning Automates the Quantitative Analysis of Individual Cells in Live-Cell Imaging Experiments , 2016, PLoS Comput. Biol..

[54]  Akdes Serin Harmanci,et al.  CaSpER identifies and visualizes CNV events by integrative analysis of single-cell or bulk RNA-sequencing data , 2020, Nature Communications.

[55]  Jian Wang,et al.  Discovery of biclonal origin and a novel oncogene SLC12A5 in colon cancer by single-cell sequencing , 2014, Cell Research.

[56]  A. Bashashati,et al.  Integrative analysis of genome-wide loss of heterozygosity and monoallelic expression at nucleotide resolution reveals disrupted pathways in triple-negative breast cancer , 2012, Genome research.

[57]  Yuchao Jiang,et al.  SCOPE: a normalization and copy number estimation method for single-cell DNA sequencing , 2019, bioRxiv.

[58]  Hossein Farahani,et al.  E-scape: interactive visualization of single-cell phylogenetics and cancer evolution , 2017, Nature Methods.

[59]  Jayadeva,et al.  Discovery of rare cells from voluminous single cell expression data , 2018, Nature Communications.

[60]  Mirjana Efremova,et al.  CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes , 2020, Nature Protocols.

[61]  Luke Zappia,et al.  Exploring the single-cell RNA-seq analysis landscape with the scRNA-tools database , 2017, bioRxiv.

[62]  Aleksandra A. Kolodziejczyk,et al.  Classification of low quality cells from single-cell RNA-seq data , 2016, Genome Biology.

[63]  Melissa E. Ko,et al.  Deep profiling of apoptotic pathways with mass cytometry identifies a synergistic drug combination for killing myeloma cells , 2020, Cell Death & Differentiation.

[64]  C. Ponting,et al.  G&T-seq: parallel sequencing of single-cell genomes and transcriptomes , 2015, Nature Methods.

[65]  Cole Trapnell,et al.  Supervised classification enables rapid annotation of cell atlases , 2019, Nature Methods.

[66]  Mark D. Robinson,et al.  Compensation of Signal Spillover in Suspension and Imaging Mass Cytometry , 2018, Cell systems.

[67]  Peter J Park,et al.  Linking transcriptional and genetic tumor heterogeneity through allele analysis of single-cell RNA-seq data , 2018, Genome research.

[68]  Ken Chen,et al.  SiFit: inferring tumor trees from single-cell sequencing data under finite-sites models , 2017, Genome Biology.

[69]  Samuel Aparicio,et al.  Scalable whole-genome single-cell library preparation without preamplification , 2017, Nature Methods.

[70]  J. Vijg,et al.  SCCNV: A Software Tool for Identifying Copy Number Variation From Single-Cell Whole-Genome Sequencing , 2019, bioRxiv.

[71]  E. Pierson,et al.  ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis , 2015, Genome Biology.

[72]  J. Buhmann,et al.  Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry , 2014, Nature Methods.

[73]  Steven M. Lipkin,et al.  Single-Cell Transcriptomics Reveals Heterogeneity and Drug Response of Human Colorectal Cancer Organoids , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[74]  Matthew A Myers,et al.  CALDER: Inferring Phylogenetic Trees from Longitudinal Tumor Samples. , 2019, Cell systems.

[75]  Sarah A. Teichmann,et al.  Computational approaches for interpreting scRNA‐seq data , 2017, FEBS letters.

[76]  Angela E. Leek,et al.  Allele-Specific HLA Loss and Immune Escape in Lung Cancer Evolution , 2017, Cell.

[77]  S. Dudoit,et al.  A general and flexible method for signal extraction from single-cell RNA-seq data , 2018, Nature Communications.

[78]  A. Hellani,et al.  Multiple displacement amplification on single cell and possible PGD applications. , 2004, Molecular human reproduction.

[79]  M. Hemberg,et al.  scmap: projection of single-cell RNA-seq data across data sets , 2018, Nature Methods.

[80]  O. Stegle,et al.  Single-Cell Genome-Wide Bisulfite Sequencing for Assessing Epigenetic Heterogeneity , 2014, Nature Methods.

[81]  Wei Guo,et al.  SCINA: A Semi-Supervised Subtyping Algorithm of Single Cells and Bulk Samples , 2019, Genes.

[82]  Mariella G. Filbin,et al.  Decoupling genetics, lineages, and microenvironment in IDH-mutant gliomas by single-cell RNA-seq , 2017, Science.

[83]  F. Markowetz,et al.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups , 2012, Nature.

[84]  Isabella N. Grabski,et al.  Probabilistic gene expression signatures identify cell-types from single cell RNA-seq data , 2020, bioRxiv.

[85]  Aaron T. L. Lun,et al.  Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R , 2017, Bioinform..

[86]  J. Marioni,et al.  Multi‐Omics Factor Analysis—a framework for unsupervised integration of multi‐omics data sets , 2018, Molecular systems biology.

[87]  F. Mapa,et al.  CellSIUS provides sensitive and specific detection of rare cell populations from complex single-cell RNA-seq data , 2019, Genome Biology.