Whole-Genome Comparison of Representatives of All Variants of SARS-CoV-2, Including Subvariant BA.2 and the GKA Clade

Since its discovery at the end of 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has rapidly evolved into many variants, including the subvariant BA.2 and the GKA clade. Genomic clarification is needed for better management of the current pandemic as well as the possible reemergence of novel variants. The sequence of the reference genome Wuhan-Hu-1 and approximately 20 representatives of each variant were downloaded from GenBank and GISAID. Two representatives with no track of in-definitive nucleotides were selected. The sequences were aligned using muscle. The location of insertion/deletion (indel) in the genome was mapped following the open reading frame (ORF) of Wuhan-Hu-1. The phylogeny of the spike protein coding region was constructed using the maximum likelihood method. Amino acid substitutions in all ORFs were analyzed separately. There are two indel sites in ORF1AB, eight in spike, and one each in ORF3A, matrix (MA), nucleoprotein (NP), and the 3′-untranslated regions (3′UTR). Some indel sites and residues/substitutions are not unique, and some are variant-specific. The phylogeny shows that Omicron, Deltacron, and BA2 are clustered together and separated from other variants with 100% bootstrap support. In conclusion, whole-genome comparison of representatives of all variants revealed indel patterns that are specific to SARS-CoV-2 variants or subvariants. Polymorphic amino acid comparison across all coding regions also showed amino acid residues shared by specific groups of variants. Finally, the higher transmissibility of BA.2 might be due at least in part to the 48 nucleotide deletions in the 3′UTR, while the seem-to-be extinction of GKA clade is due to the lack of genetic advantages as a consequence of amino acid substitutions in various genes.

[1]  K. To,et al.  Rapid spread of SARS-CoV-2 Omicron subvariant BA.2 in a single-source community outbreak , 2022, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[2]  S. Albukhaty,et al.  The relevant information about the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) using the five-question approach (when, where, what, why, and how) and its impact on the environment , 2022, Environmental Science and Pollution Research.

[3]  P. Desingu,et al.  Omicron BA.2 lineage spreads in clusters and is concentrated in Denmark , 2022, Journal of medical virology.

[4]  D. Barouch,et al.  Comparable Neutralization of the SARS-CoV-2 Omicron BA.1 and BA.2 Variants , 2022, medRxiv.

[5]  G. Viglietto,et al.  Whole-genome analysis of SARS-CoV-2 in a 2020 infection cluster in a nursing home of Southern Italy , 2022, Infection, Genetics and Evolution.

[6]  Elisabeth Mahase Omicron sub-lineage BA.2 may have “substantial growth advantage,” UKHSA reports , 2022, BMJ.

[7]  Freda Kreier Deltacron: the story of the variant that wasn’t , 2022, Nature.

[8]  OUP accepted manuscript , 2022, Clinical Infectious Diseases.

[9]  Kimihito Ito,et al.  Relative instantaneous reproduction number of Omicron SARS‐CoV‐2 variant with respect to the Delta variant in Denmark , 2021, Journal of medical virology.

[10]  M. Kandeel,et al.  Omicron variant genome evolution and phylogenetics , 2021, Journal of medical virology.

[11]  G. Leung,et al.  Estimating the transmission advantage of the D614G mutant strain of SARS-CoV-2, December 2019 to June 2020 , 2021, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[12]  T. Pumarola,et al.  A year living with SARS-CoV-2: an epidemiological overview of viral lineage circulation by whole-genome sequencing in Barcelona city (Catalonia, Spain) , 2021, Emerging microbes & infections.

[13]  C. Chakraborty,et al.  D614G mutation and SARS-CoV-2: impact on S-protein structure, function, infectivity, and immunity , 2021, Applied Microbiology and Biotechnology.

[14]  A. Kaneda,et al.  Neutralization of the SARS-CoV-2 Mu Variant by Convalescent and Vaccine Serum , 2021, The New England journal of medicine.

[15]  S. Hsu,et al.  D614G mutation in the SARS-CoV-2 spike protein enhances viral fitness by desensitizing it to temperature-dependent denaturation , 2021, Journal of Biological Chemistry.

[16]  Y. Liu,et al.  The reproductive number of the Delta variant of SARS-CoV-2 is far higher compared to the ancestral SARS-CoV-2 virus , 2021, Journal of travel medicine.

[17]  Guohong Deng,et al.  SARS-CoV-2 Quasispecies Provides an Advantage Mutation Pool for the Epidemic Variants , 2021, Microbiology spectrum.

[18]  Pei-Jer Chen,et al.  D614G Substitution of SARS-CoV-2 Spike Protein Increases Syncytium Formation and Virus Titer via Enhanced Furin-Mediated Spike Cleavage , 2021, mBio.

[19]  Irwin Jungreis,et al.  SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes , 2021, Nature Communications.

[20]  R. Jarman,et al.  A comparative recombination analysis of human coronaviruses and implications for the SARS-CoV-2 pandemic , 2021, Scientific Reports.

[21]  Martín S Marcial-Coba,et al.  COVID-19 outbreak in Malaysia: Decoding D614G mutation of SARS-CoV-2 virus isolated from an asymptomatic case in Pahang , 2021, Materials Today: Proceedings.

[22]  J. Evanseck,et al.  Conserved Elements in the 3'-UTR of SARS-CoV-2: Involvement in Genomic Dimerization and Interactions with Cellular Micrornas , 2021, Biophysical Journal.

[23]  Mallikarjuna Reddy Doodipala,et al.  Clinical and whole genome characterization of SARS-CoV-2 in India , 2021, PloS one.

[24]  N. Berthet,et al.  Intra-Host Diversity of SARS-Cov-2 Should Not Be Neglected: Case of the State of Victoria, Australia , 2021, Viruses.

[25]  Hanlee P. Ji,et al.  Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies , 2020, medRxiv.

[26]  P. Rocchi,et al.  Coronavirus RNA Proofreading: Molecular Basis and Therapeutic Targeting , 2020, Molecular Cell.

[27]  A. Rahimi,et al.  Genetics and genomics of SARS-CoV-2: A review of the literature with the special focus on genetic diversity and SARS-CoV-2 genome detection , 2020, Genomics.

[28]  K. Izadpanah,et al.  Quasi-species nature and differential gene expression of severe acute respiratory syndrome coronavirus 2 and phylogenetic analysis of a novel Iranian strain , 2020, Infection, Genetics and Evolution.

[29]  V. Calvez,et al.  Evolution of viral quasispecies during SARS-CoV-2 infection , 2020, Clinical Microbiology and Infection.

[30]  Yang Han,et al.  The ORF3a protein of SARS-CoV-2 induces apoptosis in cells , 2020, Cellular & Molecular Immunology.

[31]  Yang Han,et al.  The ORF3a protein of SARS-CoV-2 induces apoptosis in cells , 2020, Cellular & Molecular Immunology.

[32]  F. Al-Mulla,et al.  Could the D614G substitution in the SARS-CoV-2 spike (S) protein be associated with higher COVID-19 mortality? , 2020, International Journal of Infectious Diseases.

[33]  P. Bhattacharjee,et al.  Mutational spectra of SARS‐CoV‐2 orf1ab polyprotein and signature mutations in the United States of America , 2020, bioRxiv.

[34]  F. Giorgi,et al.  Geographic and Genomic Distribution of SARS-CoV-2 Mutations , 2020, Frontiers in Microbiology.

[35]  Lianzi Wang,et al.  The genetic sequence, origin, and diagnosis of SARS-CoV-2 , 2020, European Journal of Clinical Microbiology & Infectious Diseases.

[36]  Marco D'Arienzo,et al.  Assessment of the SARS-CoV-2 basic reproduction number, R0, based on the early phase of COVID-19 outbreak in Italy , 2020, Biosafety and Health.

[37]  Weidong Wu,et al.  Virology, Epidemiology, Pathogenesis, and Control of COVID-19 , 2020, Viruses.

[38]  E. Holmes,et al.  A new coronavirus associated with human respiratory disease in China , 2020, Nature.

[39]  E. Domingo,et al.  Viral quasispecies , 2019, PLoS genetics.

[40]  Sudhir Kumar,et al.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. , 2018, Molecular biology and evolution.

[41]  J. Leibowitz,et al.  The structure and functions of coronavirus genomic 3′ and 5′ ends , 2015, Virus Research.

[42]  C. Kao,et al.  Factors regulating template switch in vitro by viral RNA-dependent RNA polymerases: Implications for RNA–RNA recombination , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[43]  M. Lai,et al.  The 3' untranslated region of coronavirus RNA is required for subgenomic mRNA transcription from a defective interfering RNA , 1996, Journal of virology.

[44]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[45]  H. Temin,et al.  Characterization of large deletions occurring during a single round of retrovirus vector replication: novel deletion mechanism involving errors in strand transfer , 1991, Journal of virology.