An Introduction to the Analysis of Single-Cell RNA-Sequencing Data

The recent development of single-cell RNA sequencing has deepened our understanding of the cell as a functional unit, providing new insights based on gene expression profiles of hundreds to hundreds of thousands of individual cells, and revealing new populations of cells with distinct gene expression profiles previously hidden within analyses of gene expression performed on bulk cell populations. However, appropriate analysis and utilization of the massive amounts of data generated from single-cell RNA sequencing experiments are challenging and require an understanding of the experimental and computational pathways taken between preparation of input cells and output of interpretable data. In this review, we will discuss the basic principles of these new technologies, focusing on concepts important in the analysis of single-cell RNA-sequencing data. Specifically, we summarize approaches to quality-control measures for determination of which single cells to include for further examination, methods of data normalization and scaling to overcome the relatively inefficient capture rate of mRNA from each cell, and clustering and visualization algorithms used for dimensional reduction of the data to a two-dimensional plot.

[1]  Hua Su,et al.  Single-cell RNA sequencing reveals gene expression signatures of breast cancer-associated endothelial cells , 2017, Oncotarget.

[2]  A. Chenchik,et al.  Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. , 2001, BioTechniques.

[3]  Jacob O Kitzman,et al.  Haplotypes drop by drop , 2016, Nature Biotechnology.

[4]  F. W. Townes,et al.  Missing Data and Technical Variability in Single-Cell RNA-Sequencing Experiments , 2017, bioRxiv.

[5]  Rona S. Gertner,et al.  Single cell RNA Seq reveals dynamic paracrine control of cellular variation , 2014, Nature.

[6]  Davis J. McCarthy,et al.  A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor , 2016, F1000Research.

[7]  N. Hacohen,et al.  Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors , 2017, Science.

[8]  Seema A. Khan,et al.  Profiling human breast epithelial cells using single cell RNA sequencing identifies cell diversity , 2018, Nature Communications.

[9]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[10]  Joanne Lannigan,et al.  Does FACS perturb gene expression? , 2015, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[11]  R. Irizarry,et al.  Missing data and technical variability in single‐cell RNA‐sequencing experiments , 2018, Biostatistics.

[12]  R. Jensen,et al.  Differential expression patterns of housekeeping genes increase diagnostic and prognostic value in lung cancer , 2018, PeerJ.

[13]  Songlin Chen,et al.  RNA-QC-chain: comprehensive and fast quality control for RNA-Seq data , 2018, BMC Genomics.

[14]  Shintaro Katayama,et al.  SAMstrt: statistical test for differential expression in single-cell transcriptome with spike-in normalization , 2013, Bioinform..

[15]  Manisha Ray,et al.  Using Fluidigm C1 to Generate Single-Cell Full-Length cDNA Libraries for mRNA Sequencing. , 2018, Methods in molecular biology.

[16]  M. Newton,et al.  SCnorm: robust normalization of single-cell RNA-seq data , 2017, Nature Methods.

[17]  K. Xing,et al.  Random X-chromosome inactivation dynamics in vivo by single-cell RNA sequencing , 2017, BMC Genomics.

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[19]  M. Gut,et al.  bigSCale: an analytical framework for big-scale single-cell data. , 2018, Genome research.

[20]  Gioele La Manno,et al.  Quantitative single-cell RNA-seq with unique molecular identifiers , 2013, Nature Methods.

[21]  Berthold Göttgens,et al.  Single-cell RNA-sequencing reveals a distinct population of proglucagon-expressing cells specific to the mouse upper small intestine , 2017, Molecular metabolism.

[22]  Pak Chung Sham,et al.  Linnorm: improved statistical analysis for single cell RNA-seq expression data , 2017, Nucleic acids research.

[23]  Sydney M. Shaffer,et al.  Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH. , 2018, Cell systems.

[24]  J. Cowland,et al.  Profiling of gene expression in individual hematopoietic cells by global mRNA amplification and slot blot analysis. , 2001, Journal of immunological methods.

[25]  Florian Markowetz,et al.  OncoNEM: inferring tumor evolution from single-cell sequencing data , 2016, Genome Biology.

[26]  R. Sandberg,et al.  Full-Length mRNA-Seq from single cell levels of RNA and individual circulating tumor cells , 2012, Nature Biotechnology.

[27]  Dennis Wolf,et al.  Single-Cell RNA-Seq Reveals the Transcriptional Landscape and Heterogeneity of Aortic Macrophages in Murine Atherosclerosis , 2018, Circulation research.

[28]  Chengchen Zhao,et al.  Dr.seq2: A quality control and analysis pipeline for parallel single cell transcriptome and epigenome data , 2017, bioRxiv.

[29]  S. Linnarsson,et al.  Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. , 2011, Genome research.

[30]  Fabio Luciani,et al.  Impact of sequencing depth and read length on single cell RNA sequencing data of T cells , 2017, Scientific Reports.

[31]  Hannah A. Pliner,et al.  Reversed graph embedding resolves complex single-cell trajectories , 2017, Nature Methods.

[32]  Chun Jimmie Ye,et al.  Single‐Cell RNA Sequencing of Lymph Node Stromal Cells Reveals Niche‐Associated Heterogeneity , 2018, Immunity.

[33]  U. Eriksson,et al.  Transcriptomic analysis of the harvested endothelial cells in a swine model of mechanical thrombectomy , 2018, Neuroradiology.

[34]  Mauro J. Muraro,et al.  A Single-Cell RNA Sequencing Study Reveals Cellular and Molecular Dynamics of the Hippocampal Neurogenic Niche. , 2017, Cell reports.

[35]  Yin Hu,et al.  Robust detection of alternative splicing in a population of single cells , 2016, Nucleic acids research.

[36]  Hanlee P. Ji,et al.  Haplotyping germline and cancer genomes using high-throughput linked-read sequencing , 2015, Nature Biotechnology.

[37]  J. Marioni,et al.  Pooling across cells to normalize single-cell RNA sequencing data with many zero counts , 2016, Genome Biology.

[38]  A. Regev,et al.  Spatial reconstruction of single-cell gene expression data , 2015 .

[39]  M. Kubista,et al.  Platforms for Single-Cell Collection and Analysis , 2018, International journal of molecular sciences.

[40]  S. Richardson,et al.  Beyond comparisons of means: understanding changes in gene expression at the single-cell level , 2016, Genome Biology.

[41]  Mark D. Robinson,et al.  Robustly detecting differential expression in RNA sequencing data using observation weights , 2013, Nucleic acids research.

[42]  Mingxiang Teng,et al.  On the widespread and critical impact of systematic bias and batch effects in single-cell RNA-Seq data , 2015 .

[43]  Samuel L. Wolock,et al.  A single-cell hematopoietic landscape resolves 8 lineage trajectories and defects in Kit mutant mice. , 2018, Blood.

[44]  Shan Gao,et al.  Data Analysis in Single-Cell Transcriptome Sequencing. , 2018, Methods in molecular biology.

[45]  Aleksandra A. Kolodziejczyk,et al.  Classification of low quality cells from single-cell RNA-seq data , 2016, Genome Biology.

[46]  Charles A. Herring,et al.  Single-Cell Computational Strategies for Lineage Reconstruction in Tissue Systems , 2018, Cellular and molecular gastroenterology and hepatology.

[47]  Hui Wang,et al.  SINCERA: A Pipeline for Single-Cell RNA-Seq Profiling Analysis , 2015, PLoS Comput. Biol..

[48]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[49]  Influence of trypsinization and alternative procedures for cell preparation before RNA extraction on RNA integrity. , 2014, Analytical biochemistry.

[50]  Baixiao Zhao,et al.  Digital Gene Expression Profiling Analysis of Aged Mice under Moxibustion Treatment , 2018, Evidence-based complementary and alternative medicine : eCAM.

[51]  Joseph E Powell,et al.  Single-cell RNA-seq of human induced pluripotent stem cells reveals cellular heterogeneity and cell state transitions between subpopulations , 2018, Genome research.

[52]  Nancy R. Zhang,et al.  Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data , 2017, Nucleic acids research.

[53]  Aleksandra A. Kolodziejczyk,et al.  Accounting for technical noise in single-cell RNA-seq experiments , 2013, Nature Methods.

[54]  S. Teichmann,et al.  A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications , 2017, Genome Medicine.

[55]  Sean C. Bendall,et al.  viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia , 2013, Nature Biotechnology.

[56]  M. Ryan,et al.  A mitochondrial specific stress response in mammalian cells , 2002, The EMBO journal.

[57]  Xin Zhao,et al.  Single-cell RNA-seq reveals a distinct transcriptome signature of aneuploid hematopoietic cells. , 2017, Blood.

[58]  A. Wixforth,et al.  Influence of neighboring adherent cells on laminar flow induced shear stress in vitro-A systematic study. , 2017, Biomicrofluidics.

[59]  Xin Mei,et al.  ascend: R package for analysis of single-cell RNA-seq data , 2017, bioRxiv.

[60]  Steven D Chang,et al.  Single-Cell RNAseq analysis of infiltrating neoplastic cells at the migrating front of human glioblastoma , 2017, bioRxiv.

[61]  Aaron T. L. Lun,et al.  Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R , 2017, Bioinform..

[62]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[63]  Allon M. Klein,et al.  Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells , 2015, Cell.

[64]  C. Spina,et al.  The effect of cell subset isolation method on gene expression in leukocytes , 2014, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[65]  Salah Ayoub,et al.  Cell fixation and preservation for droplet-based single-cell transcriptomics , 2017, BMC Biology.

[66]  Cheng Sun,et al.  Isolating single cells in a neurosphere assay using inertial microfluidics. , 2015, Lab on a chip.

[67]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[68]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[69]  Åsa K. Björklund,et al.  Full-length RNA-seq from single cells using Smart-seq2 , 2014, Nature Protocols.

[70]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[71]  R. Barber,et al.  GAPDH as a housekeeping gene: analysis of GAPDH mRNA expression in a panel of 72 human tissues. , 2005, Physiological genomics.

[72]  Sarah A. Teichmann,et al.  Single-Cell RNA Sequencing Reveals T Helper Cells Synthesizing Steroids De Novo to Contribute to Immune Homeostasis , 2014, Cell reports.

[73]  Andrew C. Adey,et al.  Single-Cell Transcriptional Profiling of a Multicellular Organism , 2017 .

[74]  Tim R. Mercer,et al.  The Human Mitochondrial Transcriptome , 2011, Cell.

[75]  Hans Clevers,et al.  Single-cell messenger RNA sequencing reveals rare intestinal cell types , 2015, Nature.

[76]  Pak Chung Sham,et al.  Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data , 2019, Briefings Bioinform..

[77]  C. Óvilo,et al.  Modulatory Effects of Breed, Feeding Status, and Diet on Adipogenic, Lipogenic, and Lipolytic Gene Expression in Growing Iberian and Duroc Pigs , 2017, International journal of molecular sciences.

[78]  A. Brunet,et al.  Single-Cell Transcriptomic Analysis Defines Heterogeneity and Transcriptional Dynamics in the Adult Neural Stem Cell Lineage. , 2017, Cell reports.

[79]  Hui-Sung Moon,et al.  Inertial-ordering-assisted droplet microfluidics for high-throughput single-cell RNA-sequencing. , 2018, Lab on a chip.

[80]  Zhigang Xue,et al.  Single-cell RNA-seq reveals distinct injury responses in different types of DRG sensory neurons , 2016, Scientific Reports.

[81]  A. Sewell,et al.  Reversible Oligonucleotide Chain Blocking Enables Bead Capture and Amplification of T-Cell Receptor α and β Chain mRNAs , 2016, Journal of the American Chemical Society.

[82]  J. Trimarchi,et al.  Transcriptome sequencing of single cells with Smart-Seq , 2012, Nature Biotechnology.

[83]  Åsa K. Björklund,et al.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells , 2013, Nature Methods.

[84]  Shuqiang Li,et al.  CEL-Seq2: sensitive highly-multiplexed single-cell RNA-Seq , 2016, Genome Biology.

[85]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[86]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[87]  G. Patti,et al.  Sorting cells alters their redox state and cellular metabolome , 2018, Redox biology.

[88]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[89]  Mark A. Knepper,et al.  Transcriptomes of major renal collecting duct cell types in mouse identified by single-cell RNA-seq , 2017, Proceedings of the National Academy of Sciences.

[90]  I. Amit,et al.  Dissection of Influenza Infection In Vivo by Single-Cell RNA Sequencing , 2018, Cell Systems.