Multi-resolution single-cell state characterization via joint archetypal/network analysis

Dissecting the cellular heterogeneity embedded in single-cell transcriptomic data is challenging. Although a large number of methods and approaches exist, robustly identifying underlying cell states and their associations is still a major challenge; given the nonexclusive and dynamic influence of multiple unknown sources of variability, the existence of state continuum at the time-scale of observation, and the inevitable snapshot nature of experiments. As a way to address some of these challenges, here we introduce ACTIONet, a comprehensive framework that combines archetypal analysis and network theory to provide a ready-to-use analytical approach for multiresolution single-cell state characterization. ACTIONet uses multilevel matrix decomposition and network reconstruction to simultaneously learn cell state patterns, quantify single-cell states, and reconstruct a reproducible structural representation of the transcriptional state space that is geometrically mapped to a color space. A color-enhanced quantitative view of cell states enables novel visualization, prediction, and annotation approaches. Using data from multiple tissues, organisms, and developmental conditions, we illustrate how ACTIONet facilitates the reconstruction and exploration of single-cell state landscapes.

[1]  Carlo Colantuoni,et al.  Decomposing cell identity for transfer learning across cellular measurements, platforms, tissues, and species , 2018, bioRxiv.

[2]  C. Sarkar,et al.  Towards a Quantitative Understanding of Cell Identity. , 2018, Trends in cell biology.

[3]  Koji Ando,et al.  A molecular atlas of cell types and zonation in the brain vasculature , 2018, Nature.

[4]  Boleslaw K. Szymanski,et al.  Supplemental Methods For: Identifying Robust Communities and Multi-community Nodes by Combining Top-down and Bottom-up Approaches to Clustering , 2022 .

[5]  M. C. U. Araújo,et al.  The successive projections algorithm for variable selection in spectroscopic multicomponent analysis , 2001 .

[6]  A. Regev,et al.  Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis , 2018, Science.

[7]  Alexander V. Favorov,et al.  Enter the Matrix: Factorization Uncovers Knowledge from Omics , 2018, Trends in genetics : TIG.

[8]  Wenjian Yu,et al.  Fast Randomized PCA for Sparse Data , 2018, ACML.

[9]  P. Verstreken,et al.  A Single-Cell Transcriptome Atlas of the Aging Drosophila Brain , 2018, Cell.

[10]  Andrew J. Hill,et al.  The single cell transcriptional landscape of mammalian organogenesis , 2019, Nature.

[11]  Peter N. Yianilos,et al.  Data structures and algorithms for nearest neighbor search in general metric spaces , 1993, SODA '93.

[12]  M. Ceccarelli,et al.  RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types , 2019, Cell reports.

[13]  Jean Ponce,et al.  Sparse Modeling for Image and Vision Processing , 2014, Found. Trends Comput. Graph. Vis..

[14]  M. Gerstein,et al.  A Single-Cell Transcriptomic Atlas of Human Neocortical Development during Mid-gestation , 2019, Neuron.

[15]  Kfir Y. Levy,et al.  k*-Nearest Neighbors: From Global to Local , 2017, NIPS.

[16]  Caleb Weinreb,et al.  SPRING: a kinetic interface for visualizing high dimensional single-cell expression data , 2017, bioRxiv.

[17]  Cole Trapnell,et al.  Defining cell types and states with single-cell genomics , 2015, Genome research.

[18]  Caleb Weinreb,et al.  Fundamental limits on dynamic inference from single-cell snapshots , 2017, Proceedings of the National Academy of Sciences.

[19]  C. Ji An Archetypal Analysis on , 2005 .

[20]  Sheng Liu,et al.  Decomposing Cell Identity for Transfer Learning across Cellular Measurements, Platforms, Tissues, and Species. , 2019, Cell systems.

[21]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Evan Z. Macosko,et al.  Single-Cell Multi-omic Integration Compares and Contrasts Features of Brain Cell Identity , 2019, Cell.

[23]  Vien Cheung Uniform Color Spaces , 2012 .

[24]  Parlitz,et al.  Fast nearest-neighbor searching for nonlinear signal processing , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[25]  Paola Arlotta,et al.  Generating neuronal diversity in the mammalian cerebral cortex. , 2015, Annual review of cell and developmental biology.

[26]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[27]  S. Orkin,et al.  Mapping the Mouse Cell Atlas by Microwell-Seq , 2018, Cell.

[28]  Lai Guan Ng,et al.  Dimensionality reduction for visualizing single-cell data using UMAP , 2018, Nature Biotechnology.

[29]  Pardis C Sabeti,et al.  Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq , 2018, bioRxiv.

[30]  Orit Rozenblatt-Rosen,et al.  Systematic comparative analysis of single cell RNA-sequencing methods , 2019, bioRxiv.

[31]  Fabian J Theis,et al.  PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells , 2019, Genome Biology.

[32]  Pablo Tamayo,et al.  Visualizing and interpreting single-cell gene expression datasets with Similarity Weighted Nonnegative Embedding , 2018, bioRxiv.

[33]  Brian S. Clark,et al.  Single-Cell RNA-Seq Analysis of Retinal Development Identifies NFI Factors as Regulating Mitotic Exit and Late-Born Cell Specification , 2019, Neuron.

[34]  Nicolas Gillis,et al.  Fast and Robust Recursive Algorithmsfor Separable Nonnegative Matrix Factorization , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Stephan J Sanders,et al.  Integrative functional genomic analysis of human brain development and neuropsychiatric risks , 2018, Science.

[36]  Pedro W. Lamberti,et al.  Monoparametric family of metrics derived from classical Jensen–Shannon divergence , 2017, 1709.10153.

[37]  Alex J. Cornish,et al.  SANTA: Quantifying the Functional Content of Molecular Networks , 2014, PLoS Comput. Biol..

[38]  Samuel Demharter,et al.  Joint analysis of heterogeneous single-cell RNA-seq dataset collections , 2019, Nature Methods.

[39]  Fabian J Theis,et al.  Cell type atlas and lineage tree of a whole complex animal by single-cell transcriptomics , 2018, Science.

[40]  Vincent A. Traag,et al.  From Louvain to Leiden: guaranteeing well-connected communities , 2018, Scientific Reports.

[41]  Evan Z. Macosko,et al.  Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain , 2018, Cell.

[42]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[43]  Berthold Göttgens,et al.  A single-cell molecular map of mouse gastrulation and early organogenesis , 2019, Nature.

[44]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[45]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[46]  Manolis Kellis,et al.  Single-cell transcriptomic analysis of Alzheimer’s disease , 2019, Nature.

[47]  Shahin Mohammadi,et al.  A geometric approach to characterize the functional identity of single cells , 2018, Nature Communications.

[48]  Luyi Tian,et al.  Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments , 2019, Nature Methods.

[49]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.