A multiresolution framework to characterize single-cell state landscapes

Dissecting the cellular heterogeneity embedded in single-cell transcriptomic data is challenging. Although many methods and approaches exist, identifying cell states and their underlying topology is still a major challenge. Here, we introduce the concept of multiresolution cell-state decomposition as a practical approach to simultaneously capture both fine- and coarse-grain patterns of variability. We implement this concept in ACTIONet, a comprehensive framework that combines archetypal analysis and manifold learning to provide a ready-to-use analytical approach for multiresolution single-cell state characterization. ACTIONet provides a robust, reproducible, and highly interpretable single-cell analysis platform that couples dominant pattern discovery with a corresponding structural representation of the cell state landscape. Using multiple synthetic and real data sets, we demonstrate ACTIONet’s superior performance relative to existing alternatives. We use ACTIONet to integrate and annotate cells across three human cortex data sets. Through integrative comparative analysis, we define a consensus vocabulary and a consistent set of gene signatures discriminating against the transcriptomic cell types and subtypes of the human prefrontal cortex.

[1]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[2]  Carlo Colantuoni,et al.  Decomposing cell identity for transfer learning across cellular measurements, platforms, tissues, and species , 2018, bioRxiv.

[3]  C. Sarkar,et al.  Towards a Quantitative Understanding of Cell Identity. , 2018, Trends in cell biology.

[4]  Koji Ando,et al.  A molecular atlas of cell types and zonation in the brain vasculature , 2018, Nature.

[5]  Howard Y. Chang,et al.  Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia , 2019, Nature Biotechnology.

[6]  Evan Z. Macosko,et al.  Single-Cell Multi-omic Integration Compares and Contrasts Features of Brain Cell Identity , 2019, Cell.

[7]  M. Ceccarelli,et al.  RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types , 2019, Cell reports.

[8]  A. Regev,et al.  Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis , 2018, Science.

[9]  M. C. U. Araújo,et al.  The successive projections algorithm for variable selection in spectroscopic multicomponent analysis , 2001 .

[10]  P. Verstreken,et al.  A Single-Cell Transcriptome Atlas of the Aging Drosophila Brain , 2018, Cell.

[11]  Christian Bauckhage,et al.  Making Archetypal Analysis Practical , 2009, DAGM-Symposium.

[12]  Paola Arlotta,et al.  Generating neuronal diversity in the mammalian cerebral cortex. , 2015, Annual review of cell and developmental biology.

[13]  Kfir Y. Levy,et al.  k*-Nearest Neighbors: From Global to Local , 2017, NIPS.

[14]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[15]  S. Orkin,et al.  Mapping the Mouse Cell Atlas by Microwell-Seq , 2018, Cell.

[16]  Alexander V. Favorov,et al.  Enter the Matrix: Factorization Uncovers Knowledge from Omics , 2018, Trends in genetics : TIG.

[17]  Shawn M. Gillespie,et al.  Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer , 2017, Cell.

[18]  Maximilian Haeussler,et al.  Single-cell genomics identifies cell type–specific molecular changes in autism , 2019, Science.

[19]  Stephan J Sanders,et al.  Integrative functional genomic analysis of human brain development and neuropsychiatric risks , 2018, Science.

[20]  Andrew J. Hill,et al.  The single cell transcriptional landscape of mammalian organogenesis , 2019, Nature.

[21]  Peter N. Yianilos,et al.  Data structures and algorithms for nearest neighbor search in general metric spaces , 1993, SODA '93.

[22]  Caleb Weinreb,et al.  SPRING: a kinetic interface for visualizing high dimensional single-cell expression data , 2017, bioRxiv.

[23]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Cole Trapnell,et al.  Defining cell types and states with single-cell genomics , 2015, Genome research.

[25]  Caleb Weinreb,et al.  Fundamental limits on dynamic inference from single-cell snapshots , 2017, Proceedings of the National Academy of Sciences.

[26]  Koji Tsuda,et al.  CellTree: an R/bioconductor package to infer the hierarchical structure of cell populations from single-cell RNA-seq data , 2016, BMC Bioinformatics.

[27]  Tsz Lock Vien Cheung,et al.  Uniform Color Spaces , 2012, Handbook of Visual Display Technology.

[28]  Alex J. Cornish,et al.  SANTA: Quantifying the Functional Content of Molecular Networks , 2014, PLoS Comput. Biol..

[29]  M. Gerstein,et al.  A Single-Cell Transcriptomic Atlas of Human Neocortical Development during Mid-gestation , 2019, Neuron.

[30]  Pardis C. Sabeti,et al.  Identifying Gene Expression Programs of Cell-type Identity and Cellular Activity with Single-Cell RNA-Seq , 2018 .

[31]  Samuel Demharter,et al.  Joint analysis of heterogeneous single-cell RNA-seq dataset collections , 2019, Nature Methods.

[32]  Boleslaw K. Szymanski,et al.  Supplemental Methods For: Identifying Robust Communities and Multi-community Nodes by Combining Top-down and Bottom-up Approaches to Clustering , 2022 .

[33]  Victoria Stodden,et al.  When Does Non-Negative Matrix Factorization Give a Correct Decomposition into Parts? , 2003, NIPS.

[34]  Shahin Mohammadi,et al.  A geometric approach to characterize the functional identity of single cells , 2018, Nature Communications.

[35]  Pablo Tamayo,et al.  Visualizing and interpreting single-cell gene expression datasets with Similarity Weighted Nonnegative Embedding , 2018, bioRxiv.

[36]  Wenjian Yu,et al.  Fast Randomized PCA for Sparse Data , 2018, ACML.

[37]  James T. Kwok,et al.  Making Large-Scale Nyström Approximation Possible , 2010, ICML.

[38]  Lars Kai Hansen,et al.  Archetypal analysis for machine learning , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[39]  Pierre Hansen,et al.  NP-hardness of Euclidean sum-of-squares clustering , 2008, Machine Learning.

[40]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[41]  Richard Reynolds,et al.  Neuronal vulnerability and multilineage diversity in multiple sclerosis , 2019, Nature.

[42]  Parlitz,et al.  Fast nearest-neighbor searching for nonlinear signal processing , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[43]  Lai Guan Ng,et al.  Dimensionality reduction for visualizing single-cell data using UMAP , 2018, Nature Biotechnology.

[44]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[45]  Manolis Kellis,et al.  Single-cell transcriptomic analysis of Alzheimer’s disease , 2019, Nature.

[46]  Pedro W. Lamberti,et al.  Monoparametric family of metrics derived from classical Jensen–Shannon divergence , 2017, 1709.10153.

[47]  Fan Zhang,et al.  Fast, sensitive, and accurate integration of single cell data with Harmony , 2018, bioRxiv.

[48]  Fabian J Theis,et al.  Cell type atlas and lineage tree of a whole complex animal by single-cell transcriptomics , 2018, Science.

[49]  Vincent A. Traag,et al.  From Louvain to Leiden: guaranteeing well-connected communities , 2018, Scientific Reports.

[50]  Evan Z. Macosko,et al.  Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain , 2018, Cell.

[51]  C. Ji An Archetypal Analysis on , 2005 .

[52]  Fabian J Theis,et al.  PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells , 2019, Genome Biology.

[53]  Brian S. Clark,et al.  Single-Cell RNA-Seq Analysis of Retinal Development Identifies NFI Factors as Regulating Mitotic Exit and Late-Born Cell Specification , 2019, Neuron.

[54]  Nicolas Gillis,et al.  Fast and Robust Recursive Algorithmsfor Separable Nonnegative Matrix Factorization , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Yury A. Malkov,et al.  Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Jean Ponce,et al.  Sparse Modeling for Image and Vision Processing , 2014, Found. Trends Comput. Graph. Vis..

[57]  Orit Rozenblatt-Rosen,et al.  Systematic comparative analysis of single cell RNA-sequencing methods , 2019, bioRxiv.

[58]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[59]  Berthold Göttgens,et al.  A single-cell molecular map of mouse gastrulation and early organogenesis , 2019, Nature.

[60]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[61]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[62]  Allon M. Klein,et al.  Lineage tracing meets single-cell omics: opportunities and challenges , 2020, Nature Reviews Genetics.

[63]  Fabian J Theis,et al.  Current best practices in single‐cell RNA‐seq analysis: a tutorial , 2019, Molecular systems biology.