Cell lineage and communication network inference via optimization for single-cell transcriptomics

Abstract The use of single-cell transcriptomics has become a major approach to delineate cell subpopulations and the transitions between them. While various computational tools using different mathematical methods have been developed to infer clusters, marker genes, and cell lineage, none yet integrate these within a mathematical framework to perform multiple tasks coherently. Such coherence is critical for the inference of cell–cell communication, a major remaining challenge. Here, we present similarity matrix-based optimization for single-cell data analysis (SoptSC), in which unsupervised clustering, pseudotemporal ordering, lineage inference, and marker gene identification are inferred via a structured cell-to-cell similarity matrix. SoptSC then predicts cell–cell communication networks, enabling reconstruction of complex cell lineages that include feedback or feedforward interactions. Application of SoptSC to early embryonic development, epidermal regeneration, and hematopoiesis demonstrates robust identification of subpopulations, lineage relationships, and pseudotime, and prediction of pathway-specific cell communication patterns regulating processes of development and differentiation.

[1]  S. Teichmann,et al.  Computational and analytical challenges in single-cell transcriptomics , 2015, Nature Reviews Genetics.

[2]  Bo Wang,et al.  Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning , 2016, Nature Methods.

[3]  Hannah A. Pliner,et al.  Reversed graph embedding resolves complex single-cell trajectories , 2017, Nature Methods.

[4]  Fabian J Theis,et al.  Single cells make big data: New challenges and opportunities in transcriptomics , 2017 .

[5]  Xiaohui Xie,et al.  Inference of the Xenopus tropicalis embryonic regulatory network and spatial gene expression patterns , 2014, BMC Systems Biology.

[6]  A. M. Arias,et al.  Transition states and cell fate decisions in epigenetic landscapes , 2016, Nature Reviews Genetics.

[7]  Fabian J Theis,et al.  SCANPY: large-scale single-cell gene expression data analysis , 2018, Genome Biology.

[8]  Xiaohong Helena Yang,et al.  A New Model T on the Horizon? , 2017, Cell.

[9]  B. Göttgens,et al.  From haematopoietic stem cells to complex differentiation landscapes , 2018, Nature.

[10]  Alex A. Pollen,et al.  Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex , 2014, Nature Biotechnology.

[11]  R. Sandberg,et al.  Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells , 2014, Science.

[12]  Michael P. H. Stumpf,et al.  Learning regulatory models for cell development from single cell transcriptomic data , 2017 .

[13]  D. Tenen,et al.  Wnts are dispensable for differentiation and self-renewal of adult murine hematopoietic stem cells. , 2015, Blood.

[14]  Elaine Fuchs,et al.  Epidermal stem cells of the skin. , 2006, Annual review of cell and developmental biology.

[15]  Jun Li,et al.  Identifying and removing the cell-cycle effect from single-cell RNA-Sequencing data , 2016, Scientific Reports.

[16]  David W. Nauen,et al.  Single-Cell RNA-Seq with Waterfall Reveals Molecular Cascades underlying Adult Neurogenesis. , 2015, Cell stem cell.

[17]  J. Aerts,et al.  SCENIC: Single-cell regulatory network inference and clustering , 2017, Nature Methods.

[18]  Tao Peng,et al.  scEpath: energy landscape-based inference of transition probabilities and cellular trajectories from single-cell transcriptomic data , 2018, Bioinform..

[19]  Yu-Jin Zhang,et al.  Nonnegative Matrix Factorization: A Comprehensive Review , 2013, IEEE Transactions on Knowledge and Data Engineering.

[20]  H. Binder,et al.  Multilineage communication regulates human liver bud development from pluripotency , 2017, Nature.

[21]  Wei Vivian Li,et al.  An accurate and robust imputation method scImpute for single-cell RNA-seq data , 2018, Nature Communications.

[22]  Li Qian,et al.  SLICER: inferring branched, nonlinear cellular trajectories from single cell RNA-seq data , 2016, Genome Biology.

[23]  Adam L. Maclean,et al.  Concise Review: Stem Cell Population Biology: Insights from Hematopoiesis , 2016, Stem cells.

[24]  Rudiyanto Gunawan,et al.  Single-Cell-Based Analysis Highlights a Surge in Cell-to-Cell Molecular Variability Preceding Irreversible Commitment in a Differentiation Process , 2016, PLoS biology.

[25]  Roland Eils,et al.  circlize implements and enhances circular visualization in R , 2014, Bioinform..

[26]  M. Schaub,et al.  SC3 - consensus clustering of single-cell RNA-Seq data , 2016, Nature Methods.

[27]  C. Mummery,et al.  BMP signalling differentially regulates distinct haematopoietic stem cell types , 2015, Nature Communications.

[28]  Fabian J. Theis,et al.  Diffusion maps for high-dimensional single-cell analysis of differentiation data , 2015, Bioinform..

[29]  Sean C. Bendall,et al.  Wishbone identifies bifurcating developmental trajectories from single-cell data , 2016, Nature Biotechnology.

[30]  Jung Chul Kim,et al.  A guide to studying human hair follicle cycling in vivo , 2015, The Journal of investigative dermatology.

[31]  S. Karlsson,et al.  TGF-β signaling in the control of hematopoietic stem cells. , 2015, Blood.

[32]  S. Linnarsson,et al.  Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing , 2014, Nature Neuroscience.

[33]  Casim A. Sarkar,et al.  Robust Network Topologies for Generating Switch-Like Cellular Responses , 2011, PLoS Comput. Biol..

[34]  Hongkai Ji,et al.  TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis , 2016, Nucleic acids research.

[35]  Chris H. Q. Ding,et al.  Symmetric Nonnegative Matrix Factorization for Graph Clustering , 2012, SDM.

[36]  Cyrille F. Dunant,et al.  Distinct routes of lineage development reshape the human blood hierarchy across ontogeny , 2016, Science.

[37]  C. Voermans,et al.  Wnt signaling in normal and malignant hematopoiesis. , 2013, Cold Spring Harbor perspectives in biology.

[38]  A. Regev,et al.  Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis , 2018, Science.

[39]  N. Beerenwinkel,et al.  Single-Cell RNA-Seq Reveals Transcriptional Heterogeneity in Latent and Reactivated HIV-Infected Cells. , 2018, Cell reports.

[40]  Shaina Race,et al.  Determining the Number of Clusters via Iterative Consensus Clustering , 2014, SDM.

[41]  Qing Nie,et al.  Exploring intermediate cell states through the lens of single cells , 2018, Current opinion in systems biology.

[42]  Aleksandra A. Kolodziejczyk,et al.  Single Cell RNA-Sequencing of Pluripotent States Unlocks Modular Transcriptional Variation , 2015, Cell stem cell.

[43]  C. Heldin,et al.  A decisive function of transforming growth factor-β/Smad signaling in tissue morphogenesis and differentiation of human HaCaT keratinocytes , 2011, Molecular biology of the cell.

[44]  Allon M. Klein,et al.  Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells , 2015, Cell.

[45]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[46]  Maria Kasper,et al.  Single-Cell Transcriptomics Reveals that Differentiation and Spatial Signatures Shape Epidermal and Hair Follicle Heterogeneity , 2016, Cell systems.

[47]  J. Marioni,et al.  Single-Cell Landscape of Transcriptional Heterogeneity and Cell Fate Decisions during Mouse Early Gastrulation , 2017, Cell reports.

[48]  Sean C. Bendall,et al.  Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development , 2014, Cell.

[49]  Piero Carninci,et al.  A draft network of ligand–receptor-mediated multicellular signalling in human , 2015, Nature Communications.

[50]  Bruce J. Aronow,et al.  Single-cell analysis of mixed-lineage states leading to a binary cell fate choice , 2016, Nature.

[51]  Russell B. Fletcher,et al.  Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics , 2017, BMC Genomics.

[52]  Mikael Huss,et al.  Resolution of cell fate decisions revealed by single-cell gene expression analysis from zygote to blastocyst. , 2010, Developmental cell.

[53]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[54]  Deepak Kumar Jha,et al.  Reconstruction of complex single-cell trajectories using CellRouter , 2018, Nature Communications.

[55]  Rona S. Gertner,et al.  Single cell RNA Seq reveals dynamic paracrine control of cellular variation , 2014, Nature.

[56]  Jens Lichtenberg,et al.  Single-cell profiling of human megakaryocyte-erythroid progenitors identifies distinct megakaryocyte and erythroid differentiation pathways , 2016, Genome Biology.

[57]  Nancy R. Zhang,et al.  SAVER: Gene expression recovery for single-cell RNA sequencing , 2018, Nature Methods.

[58]  Florian Markowetz,et al.  OncoNEM: inferring tumor evolution from single-cell sequencing data , 2016, Genome Biology.

[59]  M. Guo,et al.  SLICE: determining cell differentiation and lineage based on single cell entropy , 2016, Nucleic acids research.

[60]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[61]  Haesun Park,et al.  SymNMF: nonnegative low-rank approximation of a similarity matrix for graph clustering , 2014, Journal of Global Optimization.

[62]  Kristian G. Andersen,et al.  Experimental Evolution to Study Virus Emergence , 2017, Cell.

[63]  Z. Bar-Joseph,et al.  Using neural networks for reducing the dimensions of single-cell RNA-Seq data , 2017, Nucleic acids research.

[64]  Elaine Fuchs,et al.  Epithelial-Mesenchymal Micro-niches Govern Stem Cell Lineage Choices , 2017, Cell.

[65]  Fabian J Theis,et al.  Diffusion pseudotime robustly reconstructs lineage branching , 2016, Nature Methods.

[66]  J. Marioni,et al.  Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos , 2016, Cell.

[67]  Ruiqiang Li,et al.  Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[68]  I. Amit,et al.  Transcriptional Heterogeneity and Lineage Commitment in Myeloid Progenitors , 2016, Cell.

[69]  Joshua W. K. Ho,et al.  CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data , 2016, Genome Biology.

[70]  Fabian J Theis,et al.  Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells , 2015, Nature Biotechnology.

[71]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[72]  Samuel L. Wolock,et al.  A single-cell hematopoietic landscape resolves 8 lineage trajectories and defects in Kit mutant mice. , 2018, Blood.

[73]  William S. DeWitt,et al.  A Single-Cell Atlas of In Vivo Mammalian Chromatin Accessibility , 2018, Cell.

[74]  Nicola K. Wilson,et al.  A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation. , 2016, Blood.

[75]  F. Staal,et al.  Wnt signaling strength regulates normal hematopoiesis and its deregulation is involved in leukemia development , 2012, Leukemia.

[76]  A. van Oudenaarden,et al.  Single-Cell Transcriptomics Meets Lineage Tracing. , 2018, Cell stem cell.

[77]  S. Teichmann,et al.  Exponential scaling of single-cell RNA-seq in the past decade , 2017, Nature Protocols.

[78]  Y. Saeys,et al.  Computational methods for trajectory inference from single‐cell transcriptomics , 2016, European journal of immunology.

[79]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[80]  Wen-Hui Lien,et al.  Concise Review: Wnt Signaling Pathways in Skin Development and Epidermal Stem Cells , 2017, Stem cells.

[81]  Nenghai Yu,et al.  Locality-preserving low-rank representation for graph construction from nonlinear manifolds , 2016, Neurocomputing.

[82]  Bonnie Berger,et al.  Generalizable and Scalable Visualization of Single-Cell Data Using Neural Networks. , 2018, Cell systems.

[83]  Thalia E. Chan,et al.  Gene Regulatory Network Inference from Single-Cell Data Using Multivariate Information Measures , 2016, bioRxiv.

[84]  Joseph T. Roland,et al.  Unsupervised Trajectory Analysis of Single-Cell RNA-Seq and Imaging Data Reveals Alternative Tuft Cell Origins in the Gut. , 2017, Cell systems.

[85]  Joni Virta,et al.  Tensorial blind source separation for improved analysis of multi-omic data , 2018, Genome Biology.

[86]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[87]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..