Unambiguous detection of SARS-CoV-2 subgenomic mRNAs with single cell RNA sequencing

ABSTRACT Single-cell RNA sequencing (scRNA-Seq) studies have provided critical insight into the pathogenesis of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of coronavirus disease 2019 (COVID-19). scRNA-Seq library preparation methods and data processing workflows are generally designed for the detection and quantification of eukaryotic host mRNAs and not viral RNAs. Here, we compare different scRNA-Seq library preparation methods for their ability to quantify and detect SARS-CoV-2 RNAs with a focus on subgenomic mRNAs (sgmRNAs). We show that compared to 10X Genomics Chromium Next GEM Single Cell 3′ (10X 3′) libraries or 10X Genomics Chromium Next GEM Single Cell V(D)J (10X 5′) libraries sequenced with standard read configurations, 10X 5′ libraries sequenced with an extended length read 1 (R1) that covers both cell barcode and transcript sequence (termed “10X 5′ with extended R1”) increase the number of unambiguous reads spanning leader-sgmRNA junction sites. We further present a data processing workflow, single-cell coronavirus sequencing (scCoVseq), which quantifies reads unambiguously assigned to viral sgmRNAs or viral genomic RNA (gRNA). We find that combining 10X 5′ with extended R1 library preparation/sequencing and scCoVseq data processing maximizes the number of viral UMIs per cell quantified by scRNA-Seq. Corresponding sgmRNA expression levels are highly correlated with expression in matched bulk RNA-Seq data sets quantified with established tools for SARS-CoV-2 analysis. Using this scRNA-Seq approach, we find that SARS-CoV-2 gene expression is highly correlated across individual infected cells, which suggests that the proportion of viral sgmRNAs remains generally consistent throughout infection. Taken together, these results and corresponding data processing workflow enable robust quantification of coronavirus sgmRNA expression at single-cell resolution, thereby supporting high-resolution studies of viral RNA processes in individual cells. IMPORTANCE Single-cell RNA sequencing (scRNA-Seq) has emerged as a valuable tool to study host-virus interactions, especially for coronavirus disease 2019 (COVID-19). Here we compare the performance of different scRNA-Seq library preparation methods and sequencing strategies to detect SARS-CoV-2 RNAs and develop a data processing workflow to quantify unambiguous sequence reads derived from SARS-CoV-2 genomic RNA and subgenomic mRNAs. After establishing a workflow that maximizes the detection of SARS-CoV-2 subgenomic mRNAs, we explore patterns of SARS-CoV-2 gene expression across cells with variable levels of total viral RNA, assess host gene expression differences between infected and bystander cells, and identify non-canonical and lowly abundant SARS-CoV-2 RNAs. The sequencing and data processing strategies developed here can enhance studies of coronavirus RNA biology at single-cell resolution and thereby contribute to our understanding of viral pathogenesis. Single-cell RNA sequencing (scRNA-Seq) has emerged as a valuable tool to study host-virus interactions, especially for coronavirus disease 2019 (COVID-19). Here we compare the performance of different scRNA-Seq library preparation methods and sequencing strategies to detect SARS-CoV-2 RNAs and develop a data processing workflow to quantify unambiguous sequence reads derived from SARS-CoV-2 genomic RNA and subgenomic mRNAs. After establishing a workflow that maximizes the detection of SARS-CoV-2 subgenomic mRNAs, we explore patterns of SARS-CoV-2 gene expression across cells with variable levels of total viral RNA, assess host gene expression differences between infected and bystander cells, and identify non-canonical and lowly abundant SARS-CoV-2 RNAs. The sequencing and data processing strategies developed here can enhance studies of coronavirus RNA biology at single-cell resolution and thereby contribute to our understanding of viral pathogenesis.

[1]  Benjamin J. Polacco,et al.  Impact of SARS-CoV-2 ORF6 and its variant polymorphisms on host responses and viral pathogenesis , 2022, bioRxiv.

[2]  J. V. Falvo,et al.  The oral drug nitazoxanide restricts SARS-CoV-2 infection and attenuates disease pathogenesis in Syrian hamsters , 2022, bioRxiv.

[3]  N. Krogan,et al.  Mutations in SARS-CoV-2 variants of concern link to increased spike cleavage and virus transmission , 2022, Cell Host & Microbe.

[4]  Xinzheng V. Guo,et al.  Author Correction: A molecular single-cell lung atlas of lethal COVID-19 , 2021, Nature.

[5]  A. Dobin,et al.  STARsolo: accurate, fast and versatile mapping/quantification of single-cell and single-nucleus RNA-seq data , 2021, bioRxiv.

[6]  Timothy L. Tickle,et al.  COVID-19 tissue atlases reveal SARS-CoV-2 pathology and cellular targets , 2021, Nature.

[7]  André F. Rendeiro,et al.  A molecular single-cell lung atlas of lethal COVID-19 , 2021, Nature.

[8]  S. Mallal,et al.  Subgenomic RNA identification in SARS-CoV-2 genomic sequencing data , 2021, Genome research.

[9]  A. Iwasaki,et al.  The first 12 months of COVID-19: a timeline of immunological insights , 2021, Nature reviews. Immunology.

[10]  A. Iwasaki,et al.  The first 12 months of COVID-19: a timeline of immunological insights , 2021, Nature Reviews Immunology.

[11]  Ansuman T. Satpathy,et al.  Discovery and functional interrogation of SARS-CoV-2 RNA-host protein interactions , 2021, Cell.

[12]  S. Mallal,et al.  Altered Subgenomic RNA Expression in SARS-CoV-2 B.1.1.7 Infections , 2021, bioRxiv.

[13]  Y. Ponty,et al.  The SARS-CoV-2 subgenome landscape and its novel regulatory features , 2021, Molecular Cell.

[14]  J. Sáez-Rodríguez,et al.  Dissecting CD8+ T cell pathology of severe SARS-CoV-2 infection by single-cell epitope mapping , 2021, bioRxiv.

[15]  David S. Fischer,et al.  Single-cell meta-analysis of SARS-CoV-2 entry genes across tissues and demographics , 2021, Nature Medicine.

[16]  Ruth R. Montgomery,et al.  Single-cell longitudinal analysis of SARS-CoV-2 infection in human airway epithelium identifies target cells, alterations in gene expression, and cell state changes , 2021, PLoS biology.

[17]  Kyle J. Gaulton,et al.  Single-cell meta-analysis of SARS-CoV-2 entry genes across tissues and demographics , 2021, Nature Medicine.

[18]  Cédric R. Weber,et al.  A Single-Cell Atlas of Lymphocyte Adaptive Immune Repertoires and Transcriptomes Reveals Age-Related Differences in Convalescent COVID-19 Patients , 2021, bioRxiv.

[19]  Salah Ayoub,et al.  Transcriptomic profiling of SARS-CoV-2 infected human cell lines identifies HSP90 as target for COVID-19 therapy , 2021, iScience.

[20]  L. Coin,et al.  Transcriptional and epi-transcriptional dynamics of SARS-CoV-2 during cellular infection , 2020, bioRxiv.

[21]  E. Lander,et al.  The SARS-CoV-2 RNA–protein interactome in infected human cells , 2020, Nature Microbiology.

[22]  Thomas M. Keane,et al.  Twelve years of SAMtools and BCFtools , 2020, GigaScience.

[23]  C. Yao,et al.  Cell-Type-Specific Immune Dysregulation in Severely Ill COVID-19 Patients , 2020, Cell Reports.

[24]  I. Rodriguez,et al.  SARS-CoV-2 Receptors and Entry Genes Are Expressed in the Human Olfactory Neuroepithelium and Brain , 2020, iScience.

[25]  V. Thiel,et al.  Coronavirus biology and replication: implications for SARS-CoV-2 , 2020, Nature Reviews Microbiology.

[26]  Silva Kasela,et al.  Identification of Required Host Factors for SARS-CoV-2 Infection in Human Cells , 2020, Cell.

[27]  S. Chanda,et al.  SARS-CoV-2 Orf6 hijacks Nup98 to block STAT nuclear import and antagonize interferon signaling , 2020, Proceedings of the National Academy of Sciences.

[28]  Penghua Wang,et al.  Mechanisms of SARS-CoV-2 Transmission and Pathogenesis , 2020, Trends in Immunology.

[29]  Hadley Wickham,et al.  Read Rectangular Text Data [R package readr version 1.4.0] , 2020 .

[30]  H. Achdout,et al.  The coding capacity of SARS-CoV-2 , 2020, Nature.

[31]  Jiyuan Zhang,et al.  Single-cell landscape of immunological responses in patients with COVID-19 , 2020, Nature Immunology.

[32]  T. Otto,et al.  COVID-19 and Rheumatoid Arthritis share myeloid pathogenic and resolving pathways , 2020, bioRxiv.

[33]  Jiyuan Zhang,et al.  Single-cell landscape of immunological responses in COVID-19 patients , 2020, bioRxiv.

[34]  C. Yao,et al.  Cell type-specific immune dysregulation in severely ill COVID-19 patients , 2020, medRxiv.

[35]  E. Miska,et al.  The Short- and Long-Range RNA-RNA Interactome of SARS-CoV-2 , 2020, bioRxiv.

[36]  Inkyung Jung,et al.  Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19 , 2020, Science Immunology.

[37]  Andrew R. Leach,et al.  The Global Phosphorylation Landscape of SARS-CoV-2 Infection , 2020, Cell.

[38]  Hongyang Wang,et al.  Author Correction: Immune cell profiling of COVID-19 patients in the recovery stage by single-cell sequencing , 2020, Cell Discovery.

[39]  Aaron J. Wilk,et al.  A single-cell atlas of the peripheral immune response in patients with severe COVID-19 , 2020, Nature Medicine.

[40]  M. Ono,et al.  T-Cell Hyperactivation and Paralysis in Severe COVID-19 Infection Revealed by Single-Cell Analysis , 2020, bioRxiv.

[41]  S. Ciesek,et al.  Proteomics of SARS-CoV-2-infected host cells reveals therapy targets , 2020, Nature.

[42]  I. Amit,et al.  Host-Viral Infection Maps Reveal Signatures of Severe COVID-19 Patients , 2020, Cell.

[43]  Ruth R. Montgomery,et al.  Single-cell longitudinal analysis of SARS-CoV-2 infection in human airway epithelium , 2020, bioRxiv.

[44]  M. Alexander,et al.  Bulk and single-cell gene expression profiling of SARS-CoV-2 infected human cell lines identifies molecular targets for therapeutic intervention , 2020, bioRxiv.

[45]  David Haussler,et al.  The UCSC SARS-CoV-2 Genome Browser , 2020, Nature Genetics.

[46]  R. Schwartz,et al.  Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19 , 2020, Cell.

[47]  Benjamin J. Polacco,et al.  A SARS-CoV-2 Protein Interaction Map Reveals Targets for Drug-Repurposing , 2020, Nature.

[48]  Gintaras Deikus,et al.  Introductions and early spread of SARS-CoV-2 in the New York City area , 2020, Science.

[49]  Zebao He,et al.  Proteomic and Metabolomic Characterization of COVID-19 Patient Sera , 2020, Cell.

[50]  Roland Eils,et al.  SARS‐CoV‐2 receptor ACE2 and TMPRSS2 are primarily expressed in bronchial transient secretory cells , 2020, The EMBO journal.

[51]  I. Rodriguez,et al.  SARS-CoV-2 receptor and entry genes are expressed by sustentacular cells in the human olfactory neuroepithelium , 2020, bioRxiv.

[52]  Hongyang Wang,et al.  Immune cell profiling of COVID-19 patients in the recovery stage by single-cell sequencing , 2020, Cell Discovery.

[53]  H. Shan,et al.  Viral Invasion and Type I Interferon Response Characterize the Immunophenotypes during COVID-19 Infection , 2020, SSRN Electronic Journal.

[54]  Hyeshik Chang,et al.  The Architecture of SARS-CoV-2 Transcriptome , 2020, Cell.

[55]  Fabian J Theis,et al.  SARS-CoV-2 Entry Genes Are Most Highly Expressed in Nasal Goblet and Ciliated Cells within Human Airways , 2020, Nature Medicine.

[56]  S. Ciesek,et al.  SARS-CoV-2 infected host cell proteomics reveal potential therapy targets , 2020 .

[57]  K. Kim,et al.  What Is COVID-19? , 2020, Frontiers for Young Minds.

[58]  Shuye Zhang,et al.  Single cell RNA sequencing of 13 human tissues identify cell types and receptors of human coronaviruses , 2020, bioRxiv.

[59]  E. Dong,et al.  An interactive web-based dashboard to track COVID-19 in real time , 2020, The Lancet Infectious Diseases.

[60]  G. Gao,et al.  A Novel Coronavirus from Patients with Pneumonia in China, 2019 , 2020, The New England journal of medicine.

[61]  Douglas M. Bates,et al.  Sparse and Dense Matrix Classes and Methods [R package Matrix version 1.2-18] , 2019 .

[62]  Marco Y. Hein,et al.  Functional single-cell genomics of human cytomegalovirus infection , 2019, Nature Biotechnology.

[63]  Lior Pachter,et al.  Modular and efficient pre-processing of single-cell RNA-seq , 2019, bioRxiv.

[64]  J. Bloom,et al.  Single-Cell Virus Sequencing of Influenza Infections That Trigger Innate Immunity , 2019, Journal of Virology.

[65]  R. Satija,et al.  Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression , 2019, Genome Biology.

[66]  S. Tay,et al.  HSV-1 single-cell analysis reveals the activation of anti-viral and developmental programs in distinct sub-populations , 2019, bioRxiv.

[67]  S. Quake,et al.  Virus-inclusive single-cell RNA sequencing reveals the molecular signature of progression to severe dengue , 2018, Proceedings of the National Academy of Sciences.

[68]  Amit A. Upadhyay,et al.  West Nile Virus-Inclusive Single-Cell RNA Sequencing Reveals Heterogeneity in the Type I Interferon Response within Single Cells , 2018, Journal of Virology.

[69]  Diego Garrido-Martín,et al.  ggsashimi: Sashimi plot revised for browser- and annotation-independent splicing visualization , 2018, PLoS Comput. Biol..

[70]  Anne E Carpenter,et al.  CellProfiler 3.0: Next-generation image processing for biology , 2018, PLoS biology.

[71]  P. Fisher,et al.  Methods and Protocols. , 2018, Current developments in nutrition.

[72]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[73]  Charlotte Soneson,et al.  Bias, robustness and scalability in single-cell differential expression analysis , 2018, Nature Methods.

[74]  S. Quake,et al.  Single-cell transcriptional dynamics of flavivirus infection , 2017, bioRxiv.

[75]  J. Bloom,et al.  Extreme heterogeneity of influenza virus infection in single cells , 2017, bioRxiv.

[76]  A. Heger,et al.  UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy , 2016, bioRxiv.

[77]  I. Sola,et al.  Continuous and Discontinuous RNA Synthesis in Coronaviruses. , 2015, Annual review of virology.

[78]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[79]  Davis J. McCarthy,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[80]  R. Rueckert Picornaviridae: The viruses and their replication , 1996 .

[81]  J. Mosca,et al.  Transcriptional and posttranscriptional regulation of exogenous human beta interferon gene in simian cells defective in interferon synthesis , 1986, Molecular and cellular biology.

[82]  M J Morgan,et al.  Regulation of the interferon system: evidence that Vero cells have a genetic defect in interferon production. , 1979, The Journal of general virology.

[83]  J. Melnick,et al.  Defectiveness of Interferon Production and of Rubella Virus Interference in a Line of African Green Monkey Kidney Cells (Vero) , 1968, Journal of virology.