Scaling single cell transcriptomics through split pool barcoding

Constructing an atlas of cell types in complex organisms will require a collective effort to characterize billions of individual cells. Single cell RNA sequencing (scRNA-seq) has emerged as the main tool for characterizing cellular diversity, but current methods use custom microfluidics or microwells to compartmentalize single cells, limiting scalability and widespread adoption. Here we present Split Pool Ligation-based Transcriptome sequencing (SPLiT-seq), a scRNA-seq method that labels the cellular origin of RNA through combinatorial indexing. SPLiT-seq is compatible with fixed cells, scales exponentially, uses only basic laboratory equipment, and costs one cent per cell. We used this approach to analyze 109,069 single cell transcriptomes from an entire postnatal day 5 mouse brain, providing the first global snapshot at this stage of development. We identified 13 main populations comprising different types of neurons, glia, immune cells, endothelia, as well as types in the blood-brain-barrier. Moreover, we resolve substructure within these clusters corresponding to cells at different stages of development. As sequencing capacity increases, SPLiT-seq will enable profiling of billions of cells in a single experiment.

[1]  Robert C. Wolpert,et al.  A Review of the , 1985 .

[2]  V. Perry,et al.  Turnover of resident microglia in the normal adult mouse brain , 1992, Neuroscience.

[3]  M. Ekker,et al.  Expression from a Dlx gene enhancer marks adult mouse cortical GABAergic neurons. , 2002, Cerebral cortex.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  G. Fishell,et al.  Fibroblast Growth Factor Receptor Signaling Promotes Radial Glial Identity and Interacts with Notch1 Signaling in Telencephalic Progenitors , 2004, The Journal of Neuroscience.

[6]  Gavin D. Grant,et al.  Common markers of proliferation , 2006, Nature Reviews Cancer.

[7]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[8]  K. Davis,et al.  Molecular dissection of NRG1-ERBB4 signaling implicates PTPRZ1 as a potential schizophrenia susceptibility gene , 2008, Molecular Psychiatry.

[9]  H. Monyer,et al.  Major Signaling Pathways in Migrating Neuroblasts , 2009, Front. Mol. Neurosci..

[10]  Tjerk P. Straatsma,et al.  NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations , 2010, Comput. Phys. Commun..

[11]  A. Kriegstein,et al.  Developmental genetics of vertebrate glial–cell specification , 2010, Nature.

[12]  V. Gallo,et al.  Notch and EGFR pathway interaction regulates neural stem cell number and self-renewal , 2010, Nature.

[13]  Fang Liu,et al.  Identification and characterization of neuroblasts in the subventricular zone and rostral migratory stream of the adult human brain , 2011, Cell Research.

[14]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[15]  T. Hashimshony,et al.  CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. , 2012, Cell reports.

[16]  N. Rose,et al.  Macrophage diversity in cardiac inflammation: a review. , 2012, Immunobiology.

[17]  Ó. González-Pérez,et al.  Role of fibroblast growth factor receptors in astrocytic stem cells. , 2012, Current signal transduction therapy.

[18]  Åsa K. Björklund,et al.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells , 2013, Nature Methods.

[19]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[20]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[21]  Laurens van der Maaten,et al.  Accelerating t-SNE using tree-based algorithms , 2014, J. Mach. Learn. Res..

[22]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[23]  S. Gygi,et al.  Identification of a Unique TGF-β Dependent Molecular and Functional Signature in Microglia , 2013, Nature Neuroscience.

[24]  Thomas Tørring,et al.  Template-directed covalent conjugation of DNA to native antibodies, transferrin and other metal-binding proteins. , 2014, Nature chemistry.

[25]  Juan Carlos Fernández,et al.  Multiobjective evolutionary algorithms to identify highly autocorrelated areas: the case of spatial distribution in financially compromised farms , 2014, Ann. Oper. Res..

[26]  T. Maniatis,et al.  An RNA-Sequencing Transcriptome and Splicing Database of Glia, Neurons, and Vascular Cells of the Cerebral Cortex , 2014, The Journal of Neuroscience.

[27]  E. Hol,et al.  GFAP and vimentin deficiency alters gene expression in astrocytes and microglia in wild‐type mice and changes the transcriptional response of reactive glia in mouse model for Alzheimer's disease , 2015, Glia.

[28]  Guillaume J. Filion,et al.  Starcode: sequence clustering based on all-pairs search , 2015, Bioinform..

[29]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[30]  Andrew C. Adey,et al.  Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing , 2015, Science.

[31]  S. Quake,et al.  A survey of human brain transcriptome diversity at the single cell level , 2015, Proceedings of the National Academy of Sciences.

[32]  Allon M. Klein,et al.  Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells , 2015, Cell.

[33]  Mingxiang Teng,et al.  On the widespread and critical impact of systematic bias and batch effects in single-cell RNA-Seq data , 2015 .

[34]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[35]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[36]  S. P. Fodor,et al.  Combinatorial labeling of single cells for gene expression cytometry , 2015, Science.

[37]  Staci A. Sorensen,et al.  Adult Mouse Cortical Cell Taxonomy Revealed by Single Cell Transcriptomics , 2016 .

[38]  Joseph L. Herman,et al.  Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis , 2015, Nature Methods.

[39]  M. Ronaghi,et al.  Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain , 2016, Science.

[40]  Jens Hjerling-Leffler,et al.  Oligodendrocyte heterogeneity in the mouse juvenile and adult central nervous system , 2016, Science.

[41]  Matt Thomson,et al.  Low Dimensionality in Gene Expression Data Enables the Accurate Extraction of Transcriptional Programs from Shallow Sequencing. , 2016, Cell systems.

[42]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, Nature Communications.

[43]  Andrew C. Adey,et al.  Sequencing thousands of single-cell genomes with combinatorial indexing , 2017 .

[44]  Christopher. Simons,et al.  Machine learning with Python , 2017 .