Pervasive RNA Secondary Structure in the Genomes of SARS-CoV-2 and Other Coronaviruses

The detection and characterization of large-scale RNA secondary structure in the genome of SARS-CoV-2 indicate an extraordinary and unsuspected degree of genome structural organization; this could be effectively visualized through a newly developed contour plotting method that displays positions, structural features, and conservation of RNA secondary structure between related viruses. Such RNA structure imposes a substantial evolutionary cost; paired sites showed greater restriction in diversity and represent a substantial additional constraint in reconstructing its molecular epidemiology. Its biological relevance arises from previously documented associations between possession of structured genomes and persistence, as documented for HCV and several other RNA viruses infecting humans and mammals. Shared properties potentially conferred by large-scale structure in SARS-CoV-2 include increasing evidence for prolonged infections and induced immune dysfunction that prevents development of protective immunity. The findings provide an additional element to cellular interactions that potentially influences the natural history of SARS-CoV-2, its pathogenicity, and its transmission. ABSTRACT The ultimate outcome of the coronavirus disease 2019 (COVID-19) pandemic is unknown and is dependent on a complex interplay of its pathogenicity, transmissibility, and population immunity. In the current study, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was investigated for the presence of large-scale internal RNA base pairing in its genome. This property, termed genome-scale ordered RNA structure (GORS) has been previously associated with host persistence in other positive-strand RNA viruses, potentially through its shielding effect on viral RNA recognition in the cell. Genomes of SARS-CoV-2 were remarkably structured, with minimum folding energy differences (MFEDs) of 15%, substantially greater than previously examined viruses such as hepatitis C virus (HCV) (MFED of 7 to 9%). High MFED values were shared with all coronavirus genomes analyzed and created by several hundred consecutive energetically favored stem-loops throughout the genome. In contrast to replication-associated RNA structure, GORS was poorly conserved in the positions and identities of base pairing with other sarbecoviruses—even similarly positioned stem-loops in SARS-CoV-2 and SARS-CoV rarely shared homologous pairings, indicative of more rapid evolutionary change in RNA structure than in the underlying coding sequences. Sites predicted to be base paired in SARS-CoV-2 showed less sequence diversity than unpaired sites, suggesting that disruption of RNA structure by mutation imposes a fitness cost on the virus that is potentially restrictive to its longer evolution. Although functionally uncharacterized, GORS in SARS-CoV-2 and other coronaviruses represents important elements in their cellular interactions that may contribute to their persistence and transmissibility. IMPORTANCE The detection and characterization of large-scale RNA secondary structure in the genome of SARS-CoV-2 indicate an extraordinary and unsuspected degree of genome structural organization; this could be effectively visualized through a newly developed contour plotting method that displays positions, structural features, and conservation of RNA secondary structure between related viruses. Such RNA structure imposes a substantial evolutionary cost; paired sites showed greater restriction in diversity and represent a substantial additional constraint in reconstructing its molecular epidemiology. Its biological relevance arises from previously documented associations between possession of structured genomes and persistence, as documented for HCV and several other RNA viruses infecting humans and mammals. Shared properties potentially conferred by large-scale structure in SARS-CoV-2 include increasing evidence for prolonged infections and induced immune dysfunction that prevents development of protective immunity. The findings provide an additional element to cellular interactions that potentially influences the natural history of SARS-CoV-2, its pathogenicity, and its transmission.

[1]  M. Bathe,et al.  Insights into the secondary structural ensembles of the full SARS-CoV-2 RNA genome in infected cells , 2020 .

[2]  Mark Bathe,et al.  Structure of the full SARS-CoV-2 RNA genome in infected cells , 2020, bioRxiv.

[3]  P. Simmonds,et al.  Impact of virus subtype and host IFNL4 genotype on large-scale RNA structure formation in the genome of hepatitis C virus , 2020, bioRxiv.

[4]  R. Delgado,et al.  Persistent SARS-CoV-2 replication in severe COVID-19 , 2020, medRxiv.

[5]  M. M. van der Eerden,et al.  Shedding of infectious virus in hospitalized patients with coronavirus disease-2019 (COVID-19): duration and key determinants , 2020, medRxiv.

[6]  A. Venkatakrishnan,et al.  Long-term SARS-CoV-2 RNA Shedding and its Temporal Association to IgG Seropositivity , 2020, Cell Death Discovery.

[7]  A. Venkatakrishnan,et al.  Quantifying the prevalence of SARS-CoV-2 long-term shedding among non-hospitalized COVID-19 patients , 2020, medRxiv.

[8]  Gregory M. Goldgof,et al.  SARS-CoV-2 seroprevalence and neutralizing activity in donor and patient blood from the San Francisco Bay Area , 2020, medRxiv.

[9]  K. Brown,et al.  SARS-CoV-2 infection in London, England: Impact of lockdown on community point-prevalence, March-May 2020 , 2020, medRxiv.

[10]  M. Sommer,et al.  Spatial and temporal dynamics of SARS-CoV-2 in COVID-19 patients: A systematic review , 2020, medRxiv.

[11]  B. Rahman,et al.  The basic reproduction number of SARS‐CoV‐2 in Wuhan is about to die out, how about the rest of the World? , 2020, Reviews in medical virology.

[12]  M. Torcia,et al.  Evidence for host-dependent RNA editing in the transcriptome of SARS-CoV-2 , 2020, Science Advances.

[13]  Dusan Petrovic,et al.  Appendix: Seroprevalence of anti-SARS-COV-2 IgG antibodies in a population-based sample from Geneva, Switzerland , 2020 .

[14]  P. Simmonds,et al.  Rampant C->U hypermutation in the genomes of SARS-CoV-2 and other coronaviruses – causes and consequences for their short and long evolutionary trajectories , 2020, bioRxiv.

[15]  Zhi-chao Wang,et al.  Stability and infectivity of coronaviruses in inanimate environments , 2020, World journal of clinical cases.

[16]  Sunil Dolwani,et al.  Persistent viral shedding of SARS‐CoV‐2 in faeces – a rapid review , 2020, medRxiv.

[17]  Hafeez S Haniff,et al.  An in silico map of the SARS-CoV-2 RNA Structurome , 2020, bioRxiv.

[18]  Jordan J. Clark,et al.  Detection of neutralising antibodies to SARS-CoV-2 to determine population exposure in Scottish blood donors between March and May 2020 , 2020, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[19]  P. Klenerman,et al.  Neutralising antibodies to SARS coronavirus 2 in Scottish blood donors - a pilot study of the value of serology to determine population exposure , 2020, medRxiv.

[20]  Rhiju Das,et al.  RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses , 2020, bioRxiv.

[21]  M. Torcia,et al.  Evidence for host-dependent RNA editing in the transcriptome of SARS-CoV-2 , 2020, bioRxiv.

[22]  J. Rocklöv,et al.  The reproductive number of COVID-19 is higher compared to SARS coronavirus , 2020, Journal of travel medicine.

[23]  E. Holmes,et al.  A new coronavirus associated with human respiratory disease in China , 2020, Nature.

[24]  Kai Zhao,et al.  A pneumonia outbreak associated with a new coronavirus of probable bat origin , 2020, Nature.

[25]  Jing Zhao,et al.  Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia , 2020, The New England journal of medicine.

[26]  G. Gao,et al.  A Novel Coronavirus from Patients with Pneumonia in China, 2019 , 2020, The New England journal of medicine.

[27]  G. Franzo,et al.  Vaccine or field strains: the jigsaw pattern of infectious bronchitis virus molecular epidemiology in Poland , 2019, Poultry Science.

[28]  M. Clawson,et al.  Longitudinal study of humoral immunity to bovine coronavirus, virus shedding, and treatment for bovine respiratory disease in pre-weaned beef calves , 2019, BMC Veterinary Research.

[29]  A. Vlasova,et al.  Emerging and re-emerging coronaviruses in pigs , 2019, Current Opinion in Virology.

[30]  I. Uchida,et al.  A long-term animal experiment indicating persistent infection of bovine coronavirus in cattle , 2018, The Journal of veterinary medical science.

[31]  T. Clark,et al.  Human Coronavirus NL63 Molecular Epidemiology and Evolutionary Patterns in Rural Coastal Kenya , 2018 .

[32]  V. Corman,et al.  Hosts and Sources of Endemic Human Coronaviruses , 2018, Advances in Virus Research.

[33]  Todd M. Allen,et al.  Early Transcriptional Divergence Marks Virus‐Specific Primary Human CD8+ T Cells in Chronic versus Acute Infection , 2017, Immunity.

[34]  M. S. Assayag,et al.  Assessment of molecular and genetic evolution, antigenicity and virulence properties during the persistence of the infectious bronchitis virus in broiler breeders. , 2017, The Journal of general virology.

[35]  R. Webby,et al.  Longitudinal study of Middle East Respiratory Syndrome coronavirus infection in dromedary camel herds in Saudi Arabia, 2014–2015 , 2017, Emerging Microbes &Infections.

[36]  M. Pensaert,et al.  Porcine epidemic diarrhea: A retrospect from Europe and matters of debate , 2016, Virus Research.

[37]  Andrew S. Kohlway,et al.  The Coding Region of the HCV Genome Contains a Network of Regulatory RNA Structures. , 2016, Molecular cell.

[38]  C. S. Smith,et al.  Coronavirus Infection and Diversity in Bats in the Australasian Region , 2016, EcoHealth.

[39]  Xiaoyan Lu,et al.  MERS-CoV in Upper Respiratory Tract and Lungs of Dromedary Camels, Saudi Arabia, 2013–2014 , 2015, Emerging infectious diseases.

[40]  K. Weeks,et al.  Functionally conserved architecture of hepatitis C virus RNA genomes , 2015, Proceedings of the National Academy of Sciences.

[41]  J. Leibowitz,et al.  The structure and functions of coronavirus genomic 3′ and 5′ ends , 2015, Virus Research.

[42]  Z. Memish,et al.  A Case of Long-term Excretion and Subclinical Infection With Middle East Respiratory Syndrome Coronavirus in a Healthcare Worker , 2014, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[43]  Z. Memish,et al.  Middle East respiratory syndrome coronavirus (MERS-CoV) viral shedding in the respiratory tract: an observational analysis with infection control implications , 2014, International Journal of Infectious Diseases.

[44]  Tomoya Kobayashi,et al.  Group B Betacoronavirus in Rhinolophid Bats, Japan , 2014, The Journal of veterinary medical science.

[45]  Kentaro Kato,et al.  Genomic and serological detection of bat coronavirus from bats in the Philippines , 2012, Archives of Virology.

[46]  G. Lauer,et al.  Hepatitis C virus clearance, reinfection, and persistence, with insights from studies of injecting drug users: towards a vaccine. , 2012, The Lancet. Infectious diseases.

[47]  Todd M. Allen,et al.  Broadly directed virus-specific CD4+ T cell responses are primed during acute hepatitis C infection, but rapidly disappear from human blood with viral persistence , 2012, The Journal of experimental medicine.

[48]  P. Simmonds SSE: a nucleotide and amino acid sequence analysis platform , 2012, BMC Research Notes.

[49]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[50]  P. Simmonds,et al.  Bioinformatic and Physical Characterizations of Genome-Scale Ordered RNA Structure in Mammalian RNA Viruses , 2008, Journal of Virology.

[51]  A. Tuplin,et al.  A Hepatitis C Virus cis-Acting Replication Element Forms a Long-Range RNA-RNA Interaction with Upstream RNA Sequences in NS5B , 2008, Journal of Virology.

[52]  S. Goodbourn,et al.  Interferons and viruses: an interplay between induction, signalling, antiviral responses and virus countermeasures. , 2008, The Journal of general virology.

[53]  Susanna K.P. Lau,et al.  Cytosine deamination and selection of CpG suppressed clones are the two major independent biological forces that shape codon usage bias in coronaviruses , 2007, Virology.

[54]  A. Branch,et al.  Evidence for a functional RNA element in the hepatitis C virus core gene , 2007, Proceedings of the National Academy of Sciences.

[55]  Stuart G. Siddell,et al.  A Contemporary View of Coronavirus Transcription , 2006, Journal of Virology.

[56]  G. Gao,et al.  Persistent shedding of viable SARS-CoV in urine and stool of SARS patients during the convalescent phase , 2005, European Journal of Clinical Microbiology and Infectious Diseases.

[57]  A. Tuplin,et al.  Detailed mapping of RNA secondary structures in core and NS5B-encoding region sequences of hepatitis C virus by RNase cleavage and novel bioinformatic prediction methods. , 2004, The Journal of general virology.

[58]  Hong Yang,et al.  Long-term SARS Coronavirus Excretion from Patient Cohort, China , 2004, Emerging infectious diseases.

[59]  A. Tuplin,et al.  Detection of genome-scale ordered RNA structure (GORS) in genomes of positive-stranded RNA viruses: Implications for virus evolution and host persistence. , 2004, RNA.

[60]  S. You,et al.  A cis-Acting Replication Element in the Sequence Encoding the NS5B RNA-Dependent RNA Polymerase Is Required for Hepatitis C Virus RNA Replication , 2004, Journal of Virology.

[61]  D. Vlahov,et al.  Protection against persistence of hepatitis C , 2002, The Lancet.

[62]  F. Chisari,et al.  Differential CD4+ and CD8+ T‐cell responsiveness in hepatitis C virus infection , 2001, Hepatology.

[63]  Elena Rivas,et al.  Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs , 2000, Bioinform..

[64]  J. M. Dennis,et al.  Long-term impact on a closed household of pet cats of natural infection with feline coronavirus, feline leukaemia virus and feline immunodeficiency virus , 2000, Veterinary Record.

[65]  A. Krogh,et al.  No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution. , 1999, Nucleic acids research.

[66]  P. Rottier The molecular dynamics of feline coronaviruses , 1999, Veterinary Microbiology.

[67]  M. Pensaert,et al.  A sero-epizootiological study of porcine respiratory coronavirus in Belgian swine. , 1993, The Veterinary quarterly.

[68]  A. Pijpers,et al.  Porcine epidemic diarrhoea virus as a cause of persistent diarrhoea in a herd of breeding and finishing pigs , 1993, Veterinary Record.

[69]  M. Pensaert,et al.  Porcine respiratory coronavirus: molecular features and virus-host interactions. , 1993, Veterinary research.

[70]  D. Tyrrell,et al.  The time course of the immune response to experimental coronavirus infection of man , 1990, Epidemiology and Infection.

[71]  D. Percy,et al.  Duration of protection from reinfection following exposure to sialodacryoadenitis virus in Wistar rats. , 1990, Laboratory animal science.

[72]  J. Fox,et al.  RISES IN TITERS OF ANTIBODY TO HUMAN CORONA VIRUSES OC43 AND 229E IN SEATTLE FAMILIES DURING 1975–1979 , 1986, American journal of epidemiology.