Coexpression uncovers a unified single-cell transcriptomic landscape

Researchers stand to gain insight into complex biological systems by assembling multiple single-cell RNA-sequencing (scRNA-seq) studies to reveal a panoramic view of overarching biological structure. Unfortunately, many existing scRNA-seq analyses are limited by sensitivity to study-specific noise patterns, by lack of scalability to large datasets, or by integrative transformations that obscure biological relevance. We therefore introduce a novel algorithmic framework that analyzes groups of cells in coexpression space across multiple resolutions, rather than individual cells in gene expression space, to enable multi-study analysis with enhanced biological interpretation. We show that our approach reveals the biological structure spanning multiple, large-scale studies even in the presence of batch effects while facilitating biological interpretation via network and latent factor analysis. Our coexpression-based analysis enables an unprecedented view into two complex and dynamic processes - neuronal development and hematopoiesis - by leveraging a total of seven studies containing 1,460,527 cells from laboratories spanning three continents, yielding systems-level insight unattainable by any individual experiment. Our work demonstrates a path toward probing highly complex biological systems from emerging consortium-scale single-cell transcriptomics.

[1]  Mohammad Lotfollahi,et al.  Generative modeling and latent space arithmetics predict single-cell perturbation response across cell types, studies and species , 2018, bioRxiv.

[2]  Yvan Saeys,et al.  A comparison of single-cell trajectory inference methods , 2019, Nature Biotechnology.

[3]  Michael I. Jordan,et al.  Probabilistic harmonization and annotation of single‐cell transcriptomics data with deep generative models , 2019, bioRxiv.

[4]  E. Hol,et al.  Gene Expression Profiling of Multiple Sclerosis Pathology Identifies Early Patterns of Demyelination Surrounding Chronic Active Lesions , 2017, Front. Immunol..

[5]  Pak Chung Sham,et al.  Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data , 2019, Briefings Bioinform..

[6]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, Nature Communications.

[7]  Qi Huang,et al.  The role of microglia in multiple sclerosis , 2017, Neuropsychiatric disease and treatment.

[8]  Xiaolu Yang,et al.  The lncRNA Neat1 promotes activation of inflammasomes in macrophages , 2019, Nature Communications.

[9]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[10]  S. Orkin,et al.  Mapping the Mouse Cell Atlas by Microwell-Seq , 2018, Cell.

[11]  Laleh Haghverdi,et al.  Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors , 2018, Nature Biotechnology.

[12]  R. Satija,et al.  Integrative single-cell analysis , 2019, Nature Reviews Genetics.

[13]  Fabian J Theis,et al.  Diffusion pseudotime robustly reconstructs lineage branching , 2016, Nature Methods.

[14]  Bonnie Berger,et al.  Efficient integration of heterogeneous single-cell transcriptomes using Scanorama , 2019, Nature Biotechnology.

[15]  Nickolay T. Trendafilov,et al.  Stepwise estimation of common principal components , 2010, Comput. Stat. Data Anal..

[16]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[17]  I. Amit,et al.  Disease-Associated Microglia: A Universal Immune Sensor of Neurodegeneration , 2018, Cell.

[18]  Lars E. Borm,et al.  Molecular Architecture of the Mouse Nervous System , 2018, Cell.

[19]  Martin J. Aryee,et al.  Integrated Single-Cell Analysis Maps the Continuous Regulatory Landscape of Human Hematopoietic Differentiation , 2018, Cell.

[20]  Andrew J. Hill,et al.  The single cell transcriptional landscape of mammalian organogenesis , 2019, Nature.

[21]  A. Tanay,et al.  MetaCell: analysis of single-cell RNA-seq data using K-nn graph partitions , 2019, Genome Biology.

[22]  M. Leonard,et al.  The human factor: the critical importance of effective teamwork and communication in providing safe care , 2004, Quality and Safety in Health Care.

[23]  J. Kessler,et al.  The dynamic role of bone morphogenetic proteins in neural stem cell fate and maturation , 2012, Developmental neurobiology.

[24]  R. Khokha,et al.  TIMP-3 recruits quiescent hematopoietic stem cells into active cell cycle and expands multipotent progenitor pool. , 2010, Blood.

[25]  A. Regev,et al.  Efficient Generation of Transcriptomic Profiles by Random Composite Measurements , 2017, Cell.

[26]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[27]  A. Matevossian,et al.  The Paf oncogene is essential for hematopoietic stem cell function and development , 2011, Journal of Experimental Medicine.

[28]  Evan Z. Macosko,et al.  Single-Cell Multi-omic Integration Compares and Contrasts Features of Brain Cell Identity , 2019, Cell.

[29]  Bruce Aronow,et al.  The Molecular Signature of Megakaryocyte-Erythroid Progenitors Reveals a Role for the Cell Cycle in Fate Specification , 2018, Cell reports.

[30]  Anoop Cherian,et al.  Riemannian Dictionary Learning and Sparse Coding for Positive Definite Matrices , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[31]  B. Berger,et al.  SCHEMA: A general framework for integrating heterogeneous single-cell modalities , 2019, bioRxiv.

[32]  Jesse Gillis,et al.  Co-expression in Single-Cell Analysis: Saving Grace or Original Sin? , 2018, Trends in genetics : TIG.

[33]  Sara Ballouz,et al.  Exploiting single-cell expression to characterize co-expression replicability , 2016, Genome Biology.

[34]  Jean-Charles Delvenne,et al.  Random Walks, Markov Processes and the Multiscale Modular Organization of Complex Networks , 2014, IEEE Transactions on Network Science and Engineering.

[35]  In-Jung Kim,et al.  Identification of Intrinsic Axon Growth Modulators for Intact CNS Neurons after Injury , 2017, Cell reports.

[36]  M. Sznaier,et al.  Theory and an Application , 2003 .

[37]  A. Nandi,et al.  Global gene expression analysis of human erythroid progenitors. , 2011, Blood.

[38]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[39]  Fabian J Theis,et al.  SCANPY: large-scale single-cell gene expression data analysis , 2018, Genome Biology.

[40]  Paul J. Hoffman,et al.  Comprehensive Integration of Single-Cell Data , 2018, Cell.

[41]  Erik G. Larsson,et al.  Lecture Notes: Floating-Point Numbers , 2010 .

[42]  Israel Steinfeld,et al.  BMC Bioinformatics BioMed Central , 2008 .

[43]  D. Kalvakolanu,et al.  Microglia and necroptosis: The culprits of neuronal cell death in multiple sclerosis. , 2015, Cytokine.

[44]  Fabian J. Theis,et al.  MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data , 2014, BMC Bioinformatics.

[45]  Bonnie Berger,et al.  Title : Metric learning enables synthesis of heterogeneous single-cell modalities , 2020 .

[46]  A. Tall,et al.  ApoE regulates hematopoietic stem cell proliferation, monocytosis, and monocyte accumulation in atherosclerotic lesions in mice. , 2011, The Journal of clinical investigation.

[47]  C. Brosnan,et al.  Loss of the receptor tyrosine kinase Axl leads to enhanced inflammation in the CNS and delayed removal of myelin debris during Experimental Autoimmune Encephalomyelitis , 2011, Journal of Neuroinflammation.

[48]  C. Deppmann,et al.  Coronin-1 and Calcium Signaling Governs Sympathetic Final Target Innervation , 2015, The Journal of Neuroscience.

[49]  Mohammad Lotfollahi,et al.  scGen predicts single-cell perturbation responses , 2019, Nature Methods.

[50]  L. Foster,et al.  Evaluating measures of association for single-cell transcriptomics , 2019, Nature Methods.

[51]  Markus Glatzel,et al.  The TREM2-APOE Pathway Drives the Transcriptional Phenotype of Dysfunctional Microglia in Neurodegenerative Diseases. , 2017, Immunity.

[52]  Sara Ballouz,et al.  Characterizing the replicability of cell types defined by single cell RNA-sequencing data using MetaNeighbor , 2018, Nature Communications.

[53]  G. Kroemer,et al.  Interrelation between polyploidization and megakaryocyte differentiation: a gene profiling approach. , 2007, Blood.

[54]  Christos Davatzikos,et al.  Unsupervised Learning of Functional Network Dynamics in Resting State fMRI , 2013, IPMI.

[55]  Leeyup Chung,et al.  A Brief Introduction to the Transduction of Neural Activity into Fos Signal , 2015, Development & reproduction.

[56]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[57]  Bing Liu,et al.  Single-Cell RNA Sequencing Resolves Spatiotemporal Development of Pre-thymic Lymphoid Progenitors and Thymus Organogenesis in Human Embryos. , 2019, Immunity.

[58]  Christoph Hafemeister,et al.  Developmental diversification of cortical inhibitory interneurons , 2017, Nature.

[59]  Shin-Ichi Nishikawa,et al.  The T-box transcription factor Eomes/Tbr2 regulates neurogenesis in the cortical subventricular zone. , 2008, Genes & development.

[60]  Masahiko Watanabe,et al.  Presynaptically Released Cbln1 Induces Dynamic Axonal Structural Changes by Interacting with GluD2 during Cerebellar Synapse Formation , 2012, Neuron.

[61]  Bonnie Berger,et al.  Genome-Scale Networks Link Neurodegenerative Disease Genes to α-Synuclein through Specific Molecular Pathways. , 2017, Cell systems.

[62]  Fabian J Theis,et al.  A cellular census of human lungs identifies novel cell states in health and in asthma , 2019, Nature Medicine.

[63]  Kamil Slowikowski,et al.  Fast, sensitive, and accurate integration of single cell data with Harmony , 2019, Nature Methods.

[64]  Yasuyuki Saito,et al.  Microglial SIRPα regulates the emergence of CD11c+ microglia and demyelination damage in white matter , 2018, bioRxiv.

[65]  Massimo Santoro,et al.  Expression Profile of Long Non-Coding RNAs in Serum of Patients with Multiple Sclerosis , 2016, Journal of Molecular Neuroscience.

[66]  Samuel Demharter,et al.  Joint analysis of heterogeneous single-cell RNA-seq dataset collections , 2019, Nature Methods.

[67]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[68]  Shuai Gao,et al.  Hematopoietic Hierarchy - An Updated Roadmap. , 2018, Trends in cell biology.

[69]  João Pedro de Magalhães,et al.  Gene co-expression analysis for functional classification and gene–disease predictions , 2017, Briefings Bioinform..

[70]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[71]  A. Fuente,et al.  From ‘differential expression’ to ‘differential networking’ – identification of dysfunctional regulatory networks in diseases , 2010 .

[72]  Wei Wang,et al.  Feedback regulation of apical progenitor fate by immature neurons through Wnt7–Celsr3–Fzd3 signalling , 2016, Nature Communications.

[73]  Evan Z. Macosko,et al.  Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain , 2018, Cell.

[74]  Allan R. Jones,et al.  A High-Resolution Spatiotemporal Atlas of Gene Expression of the Developing Mouse Brain , 2014, Neuron.

[75]  K. Kawakami,et al.  Transcriptional regulation of a myeloid-lineage specific gene lysozyme C during zebrafish myelopoiesis , 2009, Mechanisms of Development.

[76]  Bonnie Berger,et al.  Geometric Sketching Compactly Summarizes the Single-Cell Transcriptomic Landscape. , 2019, Cell systems.

[77]  Bonnie Berger,et al.  Compact Integration of Multi-Network Topology for Functional Analysis of Genes. , 2016, Cell systems.

[78]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[79]  Sangsoo Kim,et al.  Gene expression Differential coexpression analysis using microarray data and its application to human cancer , 2005 .

[80]  Evan Z. Macosko,et al.  Single‐Cell RNA Sequencing of Microglia throughout the Mouse Lifespan and in the Injured Brain Reveals Complex Cell‐State Changes , 2019, Immunity.

[81]  Fabian J Theis,et al.  PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells , 2019, Genome Biology.

[82]  M. Jacomy,et al.  ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software , 2014, PloS one.

[83]  Fabian J Theis,et al.  Current best practices in single‐cell RNA‐seq analysis: a tutorial , 2019, Molecular systems biology.

[84]  Ulrik Brandes,et al.  On variants of shortest-path betweenness centrality and their generic computation , 2008, Soc. Networks.

[85]  Ulrich Sure,et al.  Edinburgh Research Explorer Spatial and temporal heterogeneity of mouse and human microglia at single-cell resolution , 2022 .

[86]  Matko Bosnjak,et al.  REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms , 2011, PloS one.