DoubletDecon: Deconvoluting Doublets from Single-Cell RNA-Sequencing Data

SUMMARY Methods for single-cell RNA sequencing (scRNA-seq) have greatly advanced in recent years. While droplet- and well-based methods have increased the capture frequency of cells for scRNA-seq, these technologies readily produce technical artifacts, such as doublet cell captures. Doublets occurring between distinct cell types can appear as hybrid scRNA-seq profiles, but do not have distinct transcriptomes from individual cell states. We introduce DoubletDecon, an approach that detects doublets with a combination of deconvolution analyses and the identification of unique cell-state gene expression. We demonstrate the ability of DoubletDecon to identify synthetic, mixed-species, genetic, and cell-hashing cell doublets from scRNA-seq datasets of varying cellular complexity with a high sensitivity relative to alternative approaches. Importantly, this algorithm prevents the prediction of valid mixed-lineage and transitional cell states as doublets by considering their unique gene expression. DoubletDecon has an easy-to-use graphical user interface and is compatible with diverse species and unsupervised population detection algorithms.

[1]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[2]  Kashish Chetal,et al.  The Human Cell Atlas bone marrow single-cell interactive web portal , 2018, Experimental hematology.

[3]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[4]  David E. Muench,et al.  DNMT3A Haploinsufficiency Transforms FLT3ITD Myeloproliferative Disease into a Rapid, Spontaneous, and Fully Penetrant Acute Myeloid Leukemia. , 2016, Cancer discovery.

[5]  Shuigeng Zhou,et al.  Single-cell trajectories reconstruction, exploration and mapping of omics data with STREAM , 2019, Nature Communications.

[6]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[7]  Allon M Klein,et al.  Scrublet: Computational Identification of Cell Doublets in Single-Cell Transcriptomic Data. , 2019, Cell systems.

[8]  N. Hacohen,et al.  Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors , 2017, Science.

[9]  Bruce J. Aronow,et al.  Single-cell analysis of mixed-lineage states leading to a binary cell fate choice , 2016, Nature.

[10]  M. Snyder,et al.  Transcriptomic and epigenomic differences in human induced pluripotent stem cells generated from six reprogramming methods , 2017, Nature Biomedical Engineering.

[11]  Kashish Chetal,et al.  Maturation of heart valve cell populations during postnatal remodeling , 2019, Development.

[12]  Somasekar Seshagiri,et al.  Massively parallel nanowell-based single-cell gene expression profiling , 2017, BMC Genomics.

[13]  Chun Jimmie Ye,et al.  Multiplexed droplet single-cell RNA-sequencing using natural genetic variation , 2017, Nature Biotechnology.

[14]  S. Sugano,et al.  Combinatory use of distinct single-cell RNA-seq analytical platforms reveals the heterogeneous transcriptome response , 2018, Scientific Reports.

[15]  A. Regev,et al.  Spatial reconstruction of single-cell gene expression , 2015, Nature Biotechnology.

[16]  Saptarsi M. Haldar,et al.  BET bromodomain inhibition suppresses innate inflammatory and profibrotic transcriptional networks in heart failure , 2017, Science Translational Medicine.

[17]  Guillaume Devailly,et al.  Gene expression variability in mammalian embryonic stem cells using single cell RNA-seq data , 2016, Comput. Biol. Chem..

[18]  Bertrand Z. Yeung,et al.  Cell Hashing with barcoded antibodies enables multiplexing and doublet detection for single cell genomics , 2018, Genome Biology.

[19]  Charles H. Yoon,et al.  Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq , 2016, Science.

[20]  Christopher S. McGinnis,et al.  DoubletFinder: Doublet detection in single-cell RNA sequencing data using artificial nearest neighbors , 2018, bioRxiv.

[21]  Ting Gong,et al.  DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data , 2013, Bioinform..

[22]  David E. Muench,et al.  Granulocyte‐Monocyte Progenitors and Monocyte‐Dendritic Cell Progenitors Independently Produce Functionally Distinct Monocytes , 2017, Immunity.

[23]  N. Salomonis,et al.  Cross-platform single cell analysis of kidney development shows stromal cells express Gdnf. , 2017, Developmental biology.

[24]  Bruce Aronow,et al.  The Molecular Signature of Megakaryocyte-Erythroid Progenitors Reveals a Role for the Cell Cycle in Fate Specification , 2018, Cell reports.

[25]  Daniel Schnell,et al.  cellHarmony: cell-level matching and holistic comparison of single-cell transcriptomes , 2019, Nucleic acids research.

[26]  Dorothea Emig,et al.  AltAnalyze and DomainGraph: analyzing and visualizing exon expression data , 2010, Nucleic Acids Res..

[27]  L. Steinmetz,et al.  Human haematopoietic stem cell lineage commitment is a continuous process , 2017, Nature Cell Biology.