Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis

Single-cell transcriptome measurements are being applied at rapidly increasing scales to study cellular repertoires underpinning functions of complex tissues and organs, including mammalian brains. The transcriptional state of each cell, however, reflects a variety of biological factors, including persistent cell-type specific regulatory configurations, transient processes such as cell cycle, local metabolic demands, and extracellular signals. Depending on the biological setting, all such aspects of transcriptional heterogeneity can be of potential interest, but detecting complex heterogeneity structure from inherently uncertain single-cell data presents analytical challenges. We developed PAGODA to resolve multiple, potentially overlapping aspects of transcriptional heterogeneity by identifying known pathways or novel gene sets that show significant excess of coordinated variability among the measured cells. We demonstrate that PAGODA effectively recovers the subpopulations and their corresponding functional characteristics in a variety of single-cell samples, and use it to characterize transcriptional diversity of neuronal progenitors in the developing mouse cortex.

[1]  Ramón y Cajal,et al.  Histologie du système nerveux de l'homme & des vertébrés , 1909 .

[2]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[3]  K. Yoshikawa,et al.  Structure and Expression of the Mouse Necdin Gene , 1996, The Journal of Biological Chemistry.

[4]  A. Blaschke,et al.  Widespread programmed cell death in proliferative and postmitotic regions of the fetal cerebral cortex. , 1996, Development.

[5]  J. Weiner,et al.  Ventricular zone gene-1 (vzg-1) encodes a lysophosphatidic acid receptor expressed in neurogenic regions of the developing cerebral cortex , 1996, The Journal of cell biology.

[6]  Keisuke Kuida,et al.  Decreased apoptosis in the brain and premature lethality in CPP32-deficient mice , 1996, Nature.

[7]  Leyuan Shi,et al.  Interneuron migration from basal forebrain to neocortex: dependence on Dlx genes. , 1997, Science.

[8]  M. Pompeiano,et al.  Decreased apoptosis in proliferative and postmitotic regions of the caspase 3‐deficient embryonic central nervous system , 2000, The Journal of comparative neurology.

[9]  I. Johnstone On the distribution of the largest eigenvalue in principal components analysis , 2001 .

[10]  S. Rehen,et al.  Chromosomal variation in neurons of the developing and adult mammalian nervous system , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[11]  C. Englund,et al.  Pax6, Tbr2, and Tbr1 Are Expressed Sequentially by Radial Glia, Intermediate Progenitor Cells, and Postmitotic Neurons in Developing Neocortex , 2005, The Journal of Neuroscience.

[12]  Y. Yung,et al.  Human Genome & Diseases: Review Aneuploidy in the normal and diseased brain , 2006 .

[13]  S. Anderson,et al.  The origin and specification of cortical interneurons , 2006, Nature Reviews Neuroscience.

[14]  Takaaki Kuwajima,et al.  Necdin Promotes GABAergic Neuron Differentiation in Cooperation with Dlx Homeodomain Proteins , 2006, The Journal of Neuroscience.

[15]  Takaaki Kuwajima,et al.  Necdin Downregulates Cdc2 Expression to Attenuate Neuronal Apoptosis , 2006, The Journal of Neuroscience.

[16]  A. Kriegstein,et al.  Patterns of neural stem and progenitor cell division may underlie evolutionary cortical expansion , 2006, Nature Reviews Neuroscience.

[17]  Jill P. Mesirov,et al.  GSEA-P: a desktop application for Gene Set Enrichment Analysis , 2007, Bioinform..

[18]  Allan R. Jones,et al.  Genome-wide atlas of gene expression in the adult mouse brain , 2007, Nature.

[19]  Jane Y. Wu,et al.  Slit‐2/Robo‐1 modulates the CXCL12/CXCR4‐induced chemotaxis of T cells , 2007, Journal of leukocyte biology.

[20]  H. Ueda,et al.  Single-cell gene profiling defines differential progenitor subclasses in mammalian neurogenesis , 2008, Development.

[21]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[22]  Y. Yung,et al.  Lysophosphatidic Acid Signaling May Initiate Fetal Hydrocephalus , 2011, Science Translational Medicine.

[23]  Pradeep S Rajendran,et al.  Single-cell dissection of transcriptional heterogeneity in human colon tumors , 2011, Nature Biotechnology.

[24]  Caroline Lee,et al.  Deterministic and Stochastic Allele Specific Gene Expression in Single Mouse Blastomeres , 2011, PloS one.

[25]  D. Herr,et al.  Stereotyped fetal brain disorganization is induced by hypoxia and requires lysophosphatidic acid receptor 1 (LPA1) signaling , 2011, Proceedings of the National Academy of Sciences.

[26]  Stephen Bailey,et al.  Principal Component Analysis with Noisy and/or Missing Data , 2012, 1208.4122.

[27]  J. Rubenstein,et al.  A subpopulation of dorsal lateral/caudal ganglionic eminence-derived neocortical interneurons expresses the transcription factor Sp8. , 2012, Cerebral cortex.

[28]  Y. Yung,et al.  Aneuploid Cells Are Differentially Susceptible to Caspase-Mediated Death during Embryonic Cerebral Cortical Development , 2012, The Journal of Neuroscience.

[29]  T. Hashimshony,et al.  CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. , 2012, Cell reports.

[30]  A. van Oudenaarden,et al.  Using Gene Expression Noise to Understand Gene Regulation , 2012, Science.

[31]  Ruiqiang Li,et al.  Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[32]  Aleksandra A. Kolodziejczyk,et al.  Accounting for technical noise in single-cell RNA-seq experiments , 2013, Nature Methods.

[33]  Åsa K. Björklund,et al.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells , 2013, Nature Methods.

[34]  K. Yoshikawa,et al.  Necdin Controls Proliferation and Apoptosis of Embryonic Neural Stem Cells in an Oxygen Tension-Dependent Manner , 2013, The Journal of Neuroscience.

[35]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[36]  R. Sandberg,et al.  Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells , 2014, Science.

[37]  P. Kharchenko,et al.  Bayesian approach to single-cell differential expression analysis , 2014, Nature Methods.

[38]  Gioele La Manno,et al.  Quantitative single-cell RNA-seq with unique molecular identifiers , 2013, Nature Methods.

[39]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[40]  K. Yoshikawa,et al.  Antagonistic Interplay between Necdin and Bmi1 Controls Proliferation of Neural Precursor Cells in the Embryonic Mouse Neocortex , 2014, PloS one.

[41]  I. Amit,et al.  Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types , 2014, Science.

[42]  A. Oudenaarden,et al.  Validation of noise models for single-cell transcriptomics , 2014, Nature Methods.

[43]  Alex A. Pollen,et al.  Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex , 2014, Nature Biotechnology.

[44]  Rona S. Gertner,et al.  Single cell RNA Seq reveals dynamic paracrine control of cellular variation , 2014, Nature.

[45]  Shawn M. Gillespie,et al.  Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma , 2014, Science.

[46]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[47]  Y. Yung,et al.  Lysophosphatidic Acid Signaling in the Nervous System , 2015, Neuron.

[48]  Fabian J Theis,et al.  Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells , 2015, Nature Biotechnology.

[49]  S. Linnarsson,et al.  Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing , 2014, Nature Neuroscience.

[50]  Kutay D Atabay,et al.  Single-cell analysis reveals transcriptional heterogeneity of neural progenitors in human cortex , 2015, Nature Neuroscience.

[51]  J. Chun,et al.  LPA signaling initiates schizophrenia-like brain and behavioral changes in a mouse model of prenatal brain hemorrhage , 2015, Translational Psychiatry.