Structure of the full SARS-CoV-2 RNA genome in infected cells

SARS-CoV-2 is a betacoronavirus with a single-stranded, positive-sense, 30-kilobase RNA genome responsible for the ongoing COVID-19 pandemic. Currently, there are no antiviral drugs or vaccines with proven efficacy, and development of these treatments are hampered by our limited understanding of the molecular and structural biology of the virus. Like many other RNA viruses, RNA structures in coronaviruses regulate gene expression and are crucial for viral replication. Although genome and transcriptome data were recently reported, there is to date little experimental data on predicted RNA structures in SARS-CoV-2 and most putative regulatory sequences are uncharacterized. Here we report the secondary structure of the entire SARS-CoV-2 genome in infected cells at single nucleotide resolution using dimethyl sulfate mutational profiling with sequencing (DMS-MaPseq). Our results reveal previously undescribed structures within critical regulatory elements such as the genomic transcription-regulating sequences (TRSs). Contrary to previous studies, our in-cell data show that the structure of the frameshift element, which is a major drug target, is drastically different from prevailing in vitro models. The genomic structure detailed here lays the groundwork for coronavirus RNA biology and will guide the design of SARS-CoV-2 RNA-based therapeutics.

[1]  Rhiju Das,et al.  RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look , 2020, RNA.

[2]  Hafeez S Haniff,et al.  An in silico map of the SARS-CoV-2 RNA Structurome , 2020, bioRxiv.

[3]  Hyeshik Chang,et al.  The Architecture of SARS-CoV-2 Transcriptome , 2020, Cell.

[4]  Matthew D. Edwards,et al.  Determination of RNA structural diversity and its role in HIV-1 RNA splicing , 2020, Nature.

[5]  Vineet D. Menachery,et al.  Severe Acute Respiratory Syndrome Coronavirus 2 from Patient with Coronavirus Disease, United States , 2020, Emerging infectious diseases.

[6]  Federico M Giorgi,et al.  Genomic variance of the 2019‐nCoV coronavirus , 2020, Journal of medical virology.

[7]  K. Black,et al.  bioRxiv: the preprint server for biology , 2019, bioRxiv.

[8]  Günter Mayer,et al.  Systematic evaluation of error rates and causes in short samples in next-generation sequencing , 2018, Scientific Reports.

[9]  J. Ziebuhr,et al.  Structural and functional conservation of cis-acting RNA elements in coronavirus 5'-terminal genome regions , 2017, Virology.

[10]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[11]  Robert D. Finn,et al.  Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families , 2017, Nucleic Acids Res..

[12]  J. Weissman,et al.  DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo , 2016, Nature Methods.

[13]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[14]  I. Sola,et al.  Continuous and Discontinuous RNA Synthesis in Coronaviruses. , 2015, Annual review of virology.

[15]  J. Leibowitz,et al.  The structure and functions of coronavirus genomic 3′ and 5′ ends , 2015, Virus Research.

[16]  Howard Y. Chang,et al.  Structural imprints in vivo decode RNA regulatory mechanisms , 2015, Nature.

[17]  D. Giedroc,et al.  SHAPE analysis of the RNA secondary structure of the Mouse Hepatitis Virus 5’ untranslated region and N-terminal nsp1 coding sequences , 2014, Virology.

[18]  J. Ziebuhr,et al.  RNA structure analysis of alphacoronavirus terminal genome regions , 2014, Virus Research.

[19]  Steven Busan,et al.  RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP) , 2014, Nature Methods.

[20]  Manolis Kellis,et al.  Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo , 2013, Nature.

[21]  J. Doudna,et al.  Molecular mechanisms of RNA interference. , 2013, Annual review of biophysics.

[22]  D. Mathews,et al.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots , 2013, Proceedings of the National Academy of Sciences.

[23]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[24]  Rolf Backofen,et al.  Global or local? Predicting secondary structure and accessibility in mRNAs , 2012, Nucleic acids research.

[25]  D. Giedroc,et al.  Mouse Hepatitis Virus Stem-Loop 4 Functions as a Spacer Element Required To Drive Subgenomic RNA Synthesis , 2011, Journal of Virology.

[26]  J. Dinman,et al.  Achieving a Golden Mean: Mechanisms by Which Coronaviruses Ensure Synthesis of the Correct Stoichiometric Ratios of Viral Proteins , 2010, Journal of Virology.

[27]  Peter F. Stadler,et al.  RNAz 2.0: Improved Noncoding RNA Detection , 2010, Pacific Symposium on Biocomputing.

[28]  D. Giedroc,et al.  Coronavirus N Protein N-Terminal Domain (NTD) Specifically Binds the Transcriptional Regulatory Sequence (TRS) and Melts TRS-cTRS RNA Duplexes , 2009, Journal of Molecular Biology.

[29]  D. Giedroc,et al.  Mouse Hepatitis Virus Stem-Loop 2 Adopts a uYNMG(U)a-Like Tetraloop Structure That Is Highly Functionally Tolerant of Base Substitutions , 2009, Journal of Virology.

[30]  Kristen K. Dang,et al.  Architecture and Secondary Structure of an Entire HIV-1 RNA Genome , 2009, Nature.

[31]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[32]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..

[33]  Jonathan D Dinman,et al.  The role of programmed-1 ribosomal frameshifting in coronavirus propagation. , 2008, Frontiers in bioscience : a journal and virtual library.

[34]  D. Giedroc,et al.  Structural Lability in Stem–Loop 1 Drives a 5′ UTR–3′ UTR Interaction in Coronavirus Replication , 2008, Journal of Molecular Biology.

[35]  D. Giedroc,et al.  A U-turn motif-containing stem-loop in the coronavirus 5' untranslated region plays a functional role in replication. , 2007, RNA.

[36]  P. Masters,et al.  The Molecular Biology of Coronaviruses , 2006, Advances in Virus Research.

[37]  Serafim Batzoglou,et al.  CONTRAfold: RNA secondary structure prediction without physics-based models , 2006, ISMB.

[38]  Ching-Hsiu Tsai,et al.  An atypical RNA pseudoknot stimulator and an upstream attenuation signal for −1 ribosomal frameshifting of SARS coronavirus , 2005, Nucleic acids research.

[39]  Jonathan D Dinman,et al.  A Three-Stemmed mRNA Pseudoknot in the SARS Coronavirus Frameshift Signal , 2005, PLoS biology.

[40]  Pavel V Baranov,et al.  Programmed ribosomal frameshifting in decoding the SARS-CoV genome , 2005, Virology.

[41]  D. Mathews Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimization. , 2004, RNA.

[42]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[43]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[44]  Ralph S. Baric,et al.  Reverse genetics with a full-length infectious cDNA of severe acute respiratory syndrome coronavirus , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[45]  C. Lawrence,et al.  Statistical prediction of single-stranded regions in RNA secondary structure and application to predicting effective antisense target sites and beyond. , 2001, Nucleic acids research.

[46]  P. Romby,et al.  Implications of RNA structure on the annealing of a potent antisense RNA directed against the human immunodeficiency virus type 1. , 1997, Biochemistry.

[47]  I. Brierley,et al.  Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot , 1989, Cell.

[48]  I. Brierley,et al.  An efficient ribosomal frame-shifting signal in the polymerase-encoding region of the coronavirus IBV. , 1987, The EMBO journal.

[49]  C. Mallows,et al.  A Method for Comparing Two Hierarchical Clusterings , 1983 .

[50]  W. Dixon Simplified Estimation from Censored Normal Samples , 1960 .