DoubletDecon: Cell-State Aware Removal of Single-Cell RNA-Seq Doublets

Methods for single-cell RNA sequencing (scRNA-Seq) have greatly advanced in recent years. While droplet- and well-based methods have increased the capture frequency of cells for scRNA-Seq, these technologies readily produce technical artifacts, such as doublet-cell and multiplet-cell captures. Doublets occurring between distinct cell-types can appear as hybrid scRNA-Seq profiles, but do not have distinct transcriptomes from individual cell states. We introduce DoubletDecon, an approach that detects doublets with a combination of deconvolution analyses and the identification of unique cell-state gene expression. We demonstrate the ability of DoubletDecon to identify synthetic and cell-hashing cell singlets and doublets from scRNA-Seq datasets of varying cellular complexity. DoubletDecon is able to account for cell-cycle effects and is compatible with diverse species and unsupervised population detection algorithms (e.g., ICGS, Seurat). We believe this approach has the potential to become a standard quality control step for the accurate delineation of cell states.

[1]  Ting Gong,et al.  DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data , 2013, Bioinform..

[2]  N. Salomonis,et al.  Cross-platform single cell analysis of kidney development shows stromal cells express Gdnf. , 2017, Developmental biology.

[3]  L. Steinmetz,et al.  Human haematopoietic stem cell lineage commitment is a continuous process , 2017, Nature Cell Biology.

[4]  Bruce J. Aronow,et al.  Single-cell analysis of mixed-lineage states leading to a binary cell fate choice , 2016, Nature.

[5]  Katherine S. Pollard,et al.  New methods for identifying significant clusters in gene expression data , 2022 .

[6]  Mark J. van der Laan,et al.  A Method to Identify Significant Clusters in Gene Expression Data , 2002 .

[7]  Stijn van Dongen,et al.  Graph Clustering Via a Discrete Uncoupling Process , 2008, SIAM J. Matrix Anal. Appl..

[8]  Sarah A Teichmann,et al.  Computational assignment of cell-cycle stage from single-cell transcriptome data. , 2015, Methods.

[9]  Charles H. Yoon,et al.  Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq , 2016, Science.

[10]  Guillaume Devailly,et al.  Gene expression variability in mammalian embryonic stem cells using single cell RNA-seq data , 2016, Comput. Biol. Chem..

[11]  Chun Jimmie Ye,et al.  Multiplexed droplet single-cell RNA-sequencing using natural genetic variation , 2017, Nature Biotechnology.

[12]  Somasekar Seshagiri,et al.  Massively parallel nanowell-based single-cell gene expression profiling , 2017, BMC Genomics.

[13]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[14]  David E. Muench,et al.  Granulocyte‐Monocyte Progenitors and Monocyte‐Dendritic Cell Progenitors Independently Produce Functionally Distinct Monocytes , 2017, Immunity.

[15]  N. Hacohen,et al.  Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors , 2017, Science.

[16]  Shiwei Zheng,et al.  Cell “hashing” with barcoded antibodies enables multiplexing and doublet detection for single cell genomics , 2017, bioRxiv.

[17]  S. Sugano,et al.  Combinatory use of distinct single-cell RNA-seq analytical platforms reveals the heterogeneous transcriptome response , 2018, Scientific Reports.

[18]  Zev J. Gartner,et al.  DoubletFinder: Doublet detection in single-cell RNA sequencing data using artificial nearest neighbors , 2018, bioRxiv.