Deletions across the SARS-CoV-2 Genome: Molecular Mechanisms and Putative Functional Consequences of Deletions in Accessory Genes

The analysis of deletions may reveal evolutionary trends and provide new insight into the surprising variability and rapidly spreading capability that SARS-CoV-2 has shown since its emergence. To understand the factors governing genomic stability, it is important to define the molecular mechanisms of deletions in the viral genome. In this work, we performed a statistical analysis of deletions. Specifically, we analyzed correlations between deletions in the SARS-CoV-2 genome and repetitive elements and documented a significant association of deletions with runs of identical (poly-) nucleotides and direct repeats. Our analyses of deletions in the accessory genes of SARS-CoV-2 suggested that there may be a hypervariability in ORF7A and ORF8 that is not associated with repetitive elements. Such recurrent search in a “sequence space” of accessory genes (that might be driven by natural selection) did not yet cause increased viability of the SARS-CoV-2 variants. However, deletions in the accessory genes may ultimately produce new variants that are more successful compared to the viral strains with the conventional architecture of the SARS-CoV-2 accessory genes.

[1]  A. Godzik,et al.  Increased Frequency of Indels in Hypervariable Regions of SARS-CoV-2 Proteins—A Possible Signature of Adaptive Selection , 2022, Frontiers in Genetics.

[2]  C. Sorhouet,et al.  Consecutive deletions in a unique Uruguayan SARS-CoV-2 lineage evidence the genetic variability potential of accessory genes , 2022, PloS one.

[3]  E. Koonin,et al.  Template switching and duplications in SARS-CoV-2 genomes give rise to insertion variants that merit monitoring , 2021, Communications Biology.

[4]  I. Rogozin,et al.  The Functional Consequences of the Novel Ribosomal Pausing Site in SARS-CoV-2 Spike Glycoprotein RNA , 2021, International journal of molecular sciences.

[5]  M. Mohammed The percentages of SARS-CoV-2 protein similarity and identity with SARS-CoV and BatCoV RaTG13 proteins can be used as indicators of virus origin , 2021, Journal of Proteins and Proteomics.

[6]  T. Peacock,et al.  SARS-CoV-2 one year on: evidence for ongoing viral adaptation , 2021, The Journal of general virology.

[7]  G. Cheng,et al.  One year of SARS-CoV-2 evolution , 2021, Cell Host & Microbe.

[8]  W. P. Duprex,et al.  Recurrent deletions in the SARS-CoV-2 spike glycoprotein drive antibody escape , 2021, Science.

[9]  C. Sorhouet,et al.  A deletion in SARS‐CoV‐2 ORF7 identified in COVID‐19 outbreak in Uruguay , 2021, Transboundary and emerging diseases.

[10]  A. Rasmussen On the origins of SARS-CoV-2 , 2021, Nature Medicine.

[11]  L. Zinzula Lost in deletion: The enigmatic ORF8 protein of SARS-CoV-2 , 2020, Biochemical and Biophysical Research Communications.

[12]  V. Uversky,et al.  Questions concerning the proximal origin of SARS‐CoV‐2 , 2020, Journal of medical virology.

[13]  M. Sanak,et al.  The SARS-CoV-2 ORF10 is not essential in vitro or in vivo in humans , 2020, bioRxiv.

[14]  M. Rashid,et al.  SARS-CoV-2 ORF8 and SARS-CoV ORF8ab: Genomic Divergence and Functional Convergence , 2020, Pathogens.

[15]  Rui Luo,et al.  The ORF6, ORF8 and nucleocapsid proteins of SARS-CoV-2 inhibit type I interferon signaling pathway , 2020, Virus Research.

[16]  A. Addetia,et al.  Identification of multiple large deletions in ORF7a resulting in in-frame gene fusions in clinical SARS-CoV-2 isolates , 2020, Journal of Clinical Virology.

[17]  L. Aravind,et al.  Novel Immunoglobulin Domain Proteins Provide Insights into Evolution and Pathogenesis of SARS-CoV-2-Related Viruses , 2020, mBio.

[18]  J. Thompson,et al.  Characterization of accessory genes in coronavirus genomes , 2020, Virology Journal.

[19]  E. Holmes,et al.  The proximal origin of SARS-CoV-2 , 2020, Nature Medicine.

[20]  E. Holmes,et al.  We shouldn’t worry when a virus mutates during disease outbreaks , 2020, Nature Microbiology.

[21]  Federico M Giorgi,et al.  Genomic variance of the 2019‐nCoV coronavirus , 2020, Journal of medical virology.

[22]  A. Pfeifer,et al.  Attenuation of replication by a 29 nucleotide deletion in SARS-coronavirus acquired during the early stages of human-to-human transmission , 2018, Scientific Reports.

[23]  Samson S. Y. Wong,et al.  Severe Acute Respiratory Syndrome (SARS) Coronavirus ORF8 Protein Is Acquired from SARS-Related Coronavirus from Greater Horseshoe Bats through Recombination , 2015, Journal of Virology.

[24]  E. Koonin,et al.  Impairment of translation in neurons as a putative causative factor for autism , 2014, Biology Direct.

[25]  Rolf Hilgenfeld,et al.  Accessory proteins of SARS-CoV and other coronaviruses , 2014, Antiviral Research.

[26]  Krishna Shankara Narayanan,et al.  SARS coronavirus accessory proteins , 2007, Virus Research.

[27]  P. Rottier,et al.  The 29-Nucleotide Deletion Present in Human but Not in Animal Severe Acute Respiratory Syndrome Coronaviruses Disrupts the Functional Expression of Open Reading Frame 8 , 2007, Journal of Virology.

[28]  Andrew Pekosz,et al.  Structure and Intracellular Targeting of the SARS-Coronavirus Orf7a Accessory Protein , 2005, Structure.

[29]  S. Lovett Encoded errors: mutations and rearrangements mediated by misalignment at repetitive DNA sequences , 2004, Molecular microbiology.

[30]  Guoping Zhao,et al.  Molecular Evolution of the SARS Coronavirus During the Course of the SARS Epidemic in China , 2004, Science.

[31]  Alexey S Kondrashov,et al.  Context of deletions and insertions in human coding sequences , 2004, Human mutation.

[32]  S. Lovett,et al.  Recombination between repeats in Escherichia coli by a recA-independent, proximity-sensitive mechanism , 1994, Molecular and General Genetics MGG.

[33]  G. Dianov,et al.  Molecular mechanisms of deletion formation in Escherichia coli plasmids , 1991, Molecular and General Genetics MGG.

[34]  G. Dianov,et al.  Mechanisms of deletion formation in Escherichin coli plasmids , 1991, Molecular and General Genetics MGG.

[35]  D. Cooper,et al.  Gene deletions causing human genetic disease: mechanisms of mutagenesis and the role of the local DNA sequence environment , 1991, Human Genetics.

[36]  Sung Keun Kang,et al.  Molecular evolution of the SARS coronavirus during the course of the SARS epidemic in China. , 2004, Science.

[37]  R. Rappuoli,et al.  SARS — beginning to understand a new virus , 2003, Nature Reviews Microbiology.

[38]  X. L. Liu,et al.  Isolation and Characterization of Viruses Related to the SARS Coronavirus from Animals in Southern China , 2003, Science.

[39]  Luciano Milanesi,et al.  Computational analysis of mutation spectra , 2003, Briefings Bioinform..

[40]  S. Lovett,et al.  Instability of repetitive DNA sequences: The role of replication in multiple mechanisms , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[41]  F. D. de Serres,et al.  Similarity pattern analysis in mutational distributions. , 1999, Mutation research.

[42]  S. Lovett,et al.  Slipped Misalignment Mechanisms of Deletion Formation: In Vivo Susceptibility to Nucleases , 1999, Journal of bacteriology.

[43]  B. Michel,et al.  Isolation of a dnaE mutation which enhances RecA‐independent homologous recombination in the Escherichia coli chromosome , 1997, Molecular microbiology.

[44]  S. Lovett,et al.  Enhanced deletion formation by aberrant DNA replication in Escherichia coli. , 1997, Genetics.

[45]  S. Lovett,et al.  Stabilization of diverged tandem repeats by mismatch repair: evidence for deletion formation via a misaligned replication intermediate. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[46]  S. Warren,et al.  The Expanding World of Trinucleotide Repeats , 1996, Science.

[47]  E. Dervyn,et al.  Frequency of deletion formation decreases exponentially with distance between short direct repeats , 1994, Molecular microbiology.

[48]  L. Liu,et al.  recA-independent and recA-dependent intramolecular plasmid recombination. Differential homology requirement and distance effect. , 1994, Journal of molecular biology.

[49]  Q. Chou Minimizing deletion mutagenesis artifact during Taq DNA polymerase PCR by E. coli SSB. , 1992, Nucleic acids research.

[50]  R. Worton,et al.  Partial gene duplication as a cause of human disease , 1992, Human mutation.

[51]  R. Sinden,et al.  Preferential DNA secondary structure mutagenesis in the lagging strand of replication in E. coli , 1991, Nature.

[52]  A. Albertini,et al.  On the formation of spontaneous deletions: The importance of short sequence homologies in the generation of large deletions , 1982, Cell.

[53]  Tom Maniatis,et al.  The structure and evolution of the human β-globin gene family , 1980, Cell.

[54]  M. Inouye,et al.  Frameshift mutations and the genetic code. This paper is dedicated to Professor Theodosius Dobzhansky on the occasion of his 66th birthday. , 1966, Cold Spring Harbor symposia on quantitative biology.