Clustering single cells: a review of approaches on high-and low-depth single-cell RNA-seq data.

Advances in single-cell RNA-sequencing technology have resulted in a wealth of studies aiming to identify transcriptomic cell types in various biological systems. There are multiple experimental approaches to isolate and profile single cells, which provide different levels of cellular and tissue coverage. In addition, multiple computational strategies have been proposed to identify putative cell types from single-cell data. From a data generation perspective, recent single-cell studies can be classified into two groups: those that distribute reads shallowly over large numbers of cells and those that distribute reads more deeply over a smaller cell population. Although there are advantages to both approaches in terms of cellular and tissue coverage, it is unclear whether different computational cell type identification methods are better suited to one or the other experimental paradigm. This study reviews three cell type clustering algorithms, each representing one of three broad approaches, and finds that PCA-based algorithms appear most suited to low read depth data sets, whereas gene clustering-based and biclustering algorithms perform better on high read depth data sets. In addition, highly related cell classes are better distinguished by higher-depth data, given the same total number of reads; however, simultaneous discovery of distinct and similar types is better served by lower-depth, higher cell number data. Overall, this study suggests that the depth of profiling should be determined by initial assumptions about the diversity of cells in the population, and that the selection of clustering algorithm(s) subsequently based on the depth of profiling will allow for better identification of putative transcriptomic cell types.

[1]  Madeline A. Lancaster,et al.  Human cerebral organoids recapitulate gene expression programs of fetal neocortex development , 2015, Proceedings of the National Academy of Sciences.

[2]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[3]  Evan Z. Macosko,et al.  Comprehensive Classification of Retinal Bipolar Neurons by Single-Cell Transcriptomics , 2016, Cell.

[4]  Jens Hjerling-Leffler,et al.  Disentangling neural cell diversity using single-cell transcriptomics , 2016, Nature Neuroscience.

[5]  S. Quake,et al.  A survey of human brain transcriptome diversity at the single cell level , 2015, Proceedings of the National Academy of Sciences.

[6]  Evan Z. Macosko,et al.  A Molecular Census of Arcuate Hypothalamus and Median Eminence Cell Types , 2017, Nature Neuroscience.

[7]  Christof Koch,et al.  Adult Mouse Cortical Cell Taxonomy by Single Cell Transcriptomics , 2016, Nature Neuroscience.

[8]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[9]  I. Amit,et al.  Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types , 2014, Science.

[10]  Adele M Doyle,et al.  Fixed single-cell transcriptomic characterization of human radial glial diversity , 2015, Nature Methods.

[11]  Joseph L. Herman,et al.  Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis , 2015, Nature Methods.

[12]  Cynthia C. Hession,et al.  Div-Seq: Single-nucleus RNA-Seq reveals dynamics of rare adult newborn neurons , 2016, Science.

[13]  Trygve E Bakken,et al.  Single-Cell Profiling of an In Vitro Model of Human Interneuron Development Reveals Temporal Dynamics of Cell Type Production and Maturation , 2017, Neuron.

[14]  J. C. Kim,et al.  Multi-Scale Molecular Deconstruction of the Serotonin Neuron System , 2015, Neuron.

[15]  M. Ronaghi,et al.  Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain , 2016, Science.

[16]  Athanasia G. Palasantza,et al.  Electrophysiological, transcriptomic and morphologic profiling of single neurons using Patch-seq , 2015, Nature Biotechnology.

[17]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[18]  Alex A. Pollen,et al.  Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex , 2014, Nature Biotechnology.

[19]  Yuchio Yanagawa,et al.  Molecular interrogation of hypothalamic organization reveals distinct dopamine neuronal subtypes , 2016, Nature Neuroscience.

[20]  Yi Zhang,et al.  Single-Cell RNA-Seq Reveals Hypothalamic Cell Diversity. , 2017, Cell reports.

[21]  M. Schaub,et al.  SC3 - consensus clustering of single-cell RNA-Seq data , 2016, Nature Methods.

[22]  Mauro J. Muraro,et al.  A Single-Cell Transcriptome Atlas of the Human Pancreas , 2016, Cell systems.

[23]  Assaf Gottlieb,et al.  Reconstruction of the Mouse Otocyst and Early Neuroblast Lineage at Single-Cell Resolution , 2014, Cell.

[24]  Lars E. Borm,et al.  Molecular Diversity of Midbrain Development in Mouse, Human, and Stem Cells , 2016, Cell.

[25]  Alex A. Pollen,et al.  Molecular Identity of Human Outer Radial Glia during Cortical Development , 2015, Cell.

[26]  S. Linnarsson,et al.  Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing , 2014, Nature Neuroscience.

[27]  A. Regev,et al.  Spatial reconstruction of single-cell gene expression , 2015, Nature Biotechnology.

[28]  Dan Tsafrir,et al.  Sorting points into neighborhoods (SPIN): data analysis and visualization by ordering distance matrices , 2005, Bioinform..

[29]  V. Menon,et al.  Discovering sparse transcription factor codes for cell states and state transitions during development , 2017, eLife.

[30]  Jens Hjerling-Leffler,et al.  Oligodendrocyte heterogeneity in the mouse juvenile and adult central nervous system , 2016, Science.

[31]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[32]  Rebecca D Hodge,et al.  A Single-Cell Roadmap of Lineage Bifurcation in Human ESC Models of Embryonic Brain Development. , 2017, Cell stem cell.

[33]  Stephen R Quake,et al.  Cellular Taxonomy of the Mouse Striatum as Revealed by Single-Cell RNA-Seq. , 2016, Cell reports.

[34]  Xu Zhang,et al.  Somatosensory neuron types identified by high-coverage single-cell RNA-sequencing and functional heterogeneity , 2015, Cell Research.