scCATCH: Automatic Annotation on Cell Types of Clusters from Single-Cell RNA Sequencing Data

Summary Recent advancements in single-cell RNA sequencing (scRNA-seq) have facilitated the classification of thousands of cells through transcriptome profiling, wherein accurate cell type identification is critical for mechanistic studies. In most current analysis protocols, cell type-based cluster annotation is manually performed and heavily relies on prior knowledge, resulting in poor replicability of cell type annotation. This study aimed to introduce a single-cell Cluster-based Automatic Annotation Toolkit for Cellular Heterogeneity (scCATCH, https://github.com/ZJUFanLab/scCATCH). Using three benchmark datasets, the feasibility of evidence-based scoring and tissue-specific cellular annotation strategies were demonstrated by high concordance among cell types, and scCATCH outperformed Seurat, a popular method for marker genes identification, and cell-based annotation methods. Furthermore, scCATCH accurately annotated 67%–100% (average, 83%) clusters in six published scRNA-seq datasets originating from various tissues. The present results show that scCATCH accurately revealed cell identities with high reproducibility, thus potentially providing insights into mechanisms underlying disease pathogenesis and progression.

[1]  Patrick J. Whelan,et al.  Single-Cell Transcriptomics and Fate Mapping of Ependymal Cells Reveals an Absence of Neural Stem Cell Function , 2018, Cell.

[2]  Wei Liu,et al.  CancerSEA: a cancer single-cell state atlas , 2018, Nucleic Acids Res..

[3]  Hugo de Jonge,et al.  Faculty of 1000 evaluation for A single-cell atlas of the airway epithelium reveals the CFTR-rich pulmonary ionocyte. , 2018 .

[4]  S. Horvath,et al.  Single-Cell Transcriptome Analyses Reveal Signals to Activate Dormant Neural Stem Cells , 2015, Cell.

[5]  Kieran R. Campbell,et al.  Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling , 2019, Nature Methods.

[6]  Fabian J Theis,et al.  The Human Cell Atlas , 2017, bioRxiv.

[7]  N. Ohkohchi,et al.  Isolation of mouse pancreatic ductal progenitor cells expressing CD133 and c-Met by flow cytometric cell sorting. , 2007, Gastroenterology.

[8]  N. Hacohen,et al.  Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors , 2017, Science.

[9]  Mark A. Knepper,et al.  Transcriptomes of major renal collecting duct cell types in mouse identified by single-cell RNA-seq , 2017, Proceedings of the National Academy of Sciences.

[10]  Kieran R. Campbell,et al.  Probabilistic cell type assignment of single-cell transcriptomic data reveals spatiotemporal microenvironment dynamics in human cancers , 2019, bioRxiv.

[11]  J. Nathans,et al.  Hypoxia tolerance in the Norrin-deficient retina and the chronically hypoxic brain studied at single-cell resolution , 2019, Proceedings of the National Academy of Sciences.

[12]  Allon M. Klein,et al.  A single cell atlas of the tracheal epithelium reveals the CFTR-rich pulmonary ionocyte , 2018, Nature.

[13]  N. Fleischer,et al.  Switching of mesodermal and endodermal properties in hTERT-modified and expanded fetal human pancreatic progenitor cells , 2010, Stem Cell Research & Therapy.

[14]  Atul J. Butte,et al.  Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage , 2018, Nature Immunology.

[15]  Philip Lijnzaad,et al.  CHETAH: a selective, hierarchical cell type identification method for single-cell RNA sequencing , 2019, Nucleic acids research.

[16]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[17]  O. Rotzschke,et al.  Large-scale Isolation of Highly Pure “Untouched” Regulatory T Cells in a GMP Environment for Adoptive Cell Therapy , 2015, Journal of immunotherapy.

[18]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[19]  Marcel J. T. Reinders,et al.  A comparison of automatic cell identification methods for single-cell RNA sequencing data , 2019, Genome Biology.

[20]  G. Korbutt,et al.  Epithelial cells within the human pancreas do not coexpress mesenchymal antigens: epithelial–mesenchymal transition is an artifact of cell culture , 2009, Laboratory Investigation.

[21]  Boxi Kang,et al.  Landscape of Infiltrating T Cells in Liver Cancer Revealed by Single-Cell Sequencing , 2017, Cell.

[22]  A. Murphy,et al.  RNA Sequencing of Single Human Islet Cells Reveals Type 2 Diabetes Genes. , 2016, Cell metabolism.

[23]  Sanghamitra Bandyopadhyay,et al.  dropClust: efficient clustering of ultra-large scRNA-seq data , 2017, bioRxiv.

[24]  J. C. Love,et al.  Seq-Well: A Portable, Low-Cost Platform for High-Throughput Single-Cell RNA-Seq of Low-Input Samples , 2017, Nature Methods.

[25]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, Nature Communications.

[26]  S. Orkin,et al.  Mapping the Mouse Cell Atlas by Microwell-Seq , 2018, Cell.

[27]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[28]  L. Goodrich,et al.  Sensory Neuron Diversity in the Inner Ear Is Shaped by Activity , 2018, Cell.

[29]  Feng Li,et al.  CellMarker: a manually curated resource of cell markers in human and mouse , 2018, Nucleic Acids Res..

[30]  Weizhe Hong,et al.  Detecting Activated Cell Populations Using Single-Cell RNA-Seq , 2017, Neuron.

[31]  M. Hemberg,et al.  scmap: projection of single-cell RNA-seq data across data sets , 2018, Nature Methods.

[32]  J. C. Love,et al.  Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput , 2017, Nature Methods.

[33]  S. Teichmann,et al.  A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications , 2017, Genome Medicine.

[34]  S. Quake,et al.  Single-Cell Analysis of Human Pancreas Reveals Transcriptional Signatures of Aging and Somatic Mutation Patterns , 2017, Cell.

[35]  S. Potter,et al.  Single-cell RNA sequencing for the study of development, physiology and disease , 2018, Nature Reviews Nephrology.

[36]  M. Chopp,et al.  Patterns and Dynamics of Subventricular Zone Neuroblast Migration in the Ischemic Striatum of the Adult Mouse , 2009, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[37]  Cole Trapnell,et al.  Supervised classification enables rapid annotation of cell atlases , 2019, Nature Methods.

[38]  G. Opelz,et al.  Plasticity and overlap of in vitro-induced regulatory T-cell markers in healthy humans. , 2013, Transplantation proceedings.