Robust decomposition of cell type mixtures in spatial transcriptomics

Spatial transcriptomic technologies measure gene expression at increasing spatial resolution, approaching individual cells. However, a limitation of current technologies is that spatial measurements may contain contributions from multiple cells, hindering the discovery of cell type-specific spatial patterns of localization and expression. Here, we develop Robust Cell Type Decomposition (RCTD, https://github.com/dmcable/RCTD), a computational method that leverages cell type profiles learned from single-cell RNA sequencing data to decompose mixtures, such as those observed in spatial transcriptomic technologies. Our approach accounts for platform effects introduced by systematic technical variability inherent to different sequencing modalities. We demonstrate RCTD provides substantial improvement in cell type assignment in Slide-seq data by accurately reproducing known cell type and subtype localization patterns in the cerebellum and hippocampus. We further show the advantages of RCTD by its ability to detect mixtures and identify cell types on an assessment dataset. Finally, we show how RCTD’s recovery of cell type localization uniquely enables the discovery of genes within a cell type whose expression depends on spatial environment. Spatial mapping of cell types with RCTD has the potential to enable the definition of spatial components of cellular identity, uncovering new principles of cellular organization in biological tissue.

[1]  Ashwinikumar Kulkarni,et al.  Beyond bulk: a review of single cell transcriptomics methodologies and applications. , 2019, Current opinion in biotechnology.

[2]  Richard Bonneau,et al.  High-density spatial transcriptomics arrays for in situ tissue profiling , 2019, bioRxiv.

[3]  Jason C. Wester,et al.  Hippocampal GABAergic Inhibitory Interneurons. , 2017, Physiological reviews.

[4]  Mark S. Cembrowski,et al.  The subiculum is a patchwork of discrete subregions , 2018, eLife.

[5]  David B. Dunson,et al.  Lognormal and Gamma Mixed Negative Binomial Regression , 2012, ICML.

[6]  S. Phinn,et al.  Australian vegetated coastal ecosystems as global hotspots for climate change mitigation , 2019, Nature Communications.

[7]  Evan Z. Macosko,et al.  Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution , 2019, Science.

[8]  Dennis D. Spencer,et al.  GAT1 and GAT3 expression are differently localized in the human epileptogenic hippocampus , 2006, Acta Neuropathologica.

[9]  Lydia Ng,et al.  Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system , 2012, Nucleic Acids Res..

[10]  Guo-Cheng Yuan,et al.  Accurate estimation of cell-type composition from gene expression data , 2019, Nature Communications.

[11]  Fabian J Theis,et al.  The Human Cell Atlas , 2017, bioRxiv.

[12]  Aviv Regev,et al.  A transcriptomic atlas of the mouse cerebellum reveals regional specializations and novel cell types , 2020, bioRxiv.

[13]  Evan Z. Macosko,et al.  Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain , 2018, Cell.

[14]  Cole Trapnell,et al.  Supervised classification enables rapid annotation of cell atlases , 2019, Nature Methods.

[15]  G. Enikolopov,et al.  NTPDase2 and Purinergic Signaling Control Progenitor Cell Proliferation in Neurogenic Niches of the Adult Mouse Brain , 2015, Stem cells.

[16]  I. Amit,et al.  Paired-cell sequencing enables spatial gene expression mapping of liver endothelial cells , 2018, Nature Biotechnology.

[17]  Christoph Hafemeister,et al.  Comprehensive integration of single cell data , 2018, bioRxiv.

[18]  Rickard Sandberg,et al.  Identification of spatial expression trends in single-cell gene expression data , 2018, Nature Methods.

[19]  Roy V. Sillitoe,et al.  Molecular layer interneurons shape the spike activity of cerebellar Purkinje cells , 2018, Scientific Reports.

[20]  M. Scharf,et al.  Rapid evolutionary responses to insecticide resistance management interventions by the German cockroach (Blattella germanica L.) , 2019, Scientific Reports.

[21]  Ananthram Swami,et al.  Non-Gaussian mixture models for detection and estimation in heavy-tailed noise , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[22]  石黒 真木夫,et al.  Akaike information criterion statistics , 1986 .

[23]  A. Regev,et al.  Revealing the vectors of cellular identity with single-cell genomics , 2016, Nature Biotechnology.

[24]  Adriano B. L. Tort,et al.  OLM interneurons differentially modulate CA3 and entorhinal inputs to hippocampal CA1 neurons , 2012, Nature Neuroscience.

[25]  Trygve E Bakken,et al.  Single-nucleus and single-cell transcriptomes compared in matched cortical cell types , 2018, PloS one.

[26]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[27]  A. Jauch,et al.  3p25.3 microdeletion of GABA transporters SLC6A1 and SLC6A11 results in intellectual disability, epilepsy and stereotypic behavior , 2014, American journal of medical genetics. Part A.

[28]  Evan Z. Macosko,et al.  Sensitive spatial genome wide expression profiling at cellular resolution , 2020, bioRxiv.

[29]  S. Teichmann,et al.  SpatialDE: identification of spatially variable genes , 2018, Nature Methods.

[30]  Shiquan Sun,et al.  Statistical analysis of spatial expression patterns for spatially resolved transcriptomic studies , 2020, Nature Methods.

[31]  M. Capogna Neurogliaform cells and other interneurons of stratum lacunosum‐moleculare gate entorhinal–hippocampal dialogue , 2011, The Journal of physiology.