Global Spread of SARS-CoV-2 Subtype with Spike Protein Mutation D614G is Shaped by Human Genomic Variations that Regulate Expression of TMPRSS2 and MX1 Genes

COVID-19 pandemic is a major human tragedy. Worldwide, SARS-CoV-2 has already infected over 3 million and has killed about 230,000 people. SARS-CoV-2 originated in China and, within three months, has evolved to an additional 10 subtypes. One particular subtype with a non-silent (Aspartate to Glycine) mutation at 614th position of the Spike protein (D614G) rapidly outcompeted other pre-existing subtypes, including the ancestral. We assessed that D614G mutation generates an additional serine protease (Elastase) cleavage site near the S1-S2 junction of the Spike protein. We also identified that a single nucleotide deletion (delC) at a known variant site (rs35074065) in a cis-eQTL of TMPRSS2, is extremely rare in East Asians but is common in Europeans and North Americans. The delC allele facilitates entry of the 614G subtype into host cells, thus accelerating the spread of 614G subtype in Europe and North America where the delC allele is common. The delC allele at the cis-eQTL locus rs35074065 of TMPRSS2 leads to overexpression of both TMPRSS2 and a nearby gene MX1. The cis-eQTL site, rs35074065 overlaps with a transcription factor binding site of an activator (IRF1) and a repressor (IRF2). IRF1 activator can bind to variant delC allele, but IRF2 repressor fails to bind. Thus, in an individual carrying the delC allele, there is only activation, but no repression. On viral entry, IRF1 mediated upregulation of MX1 leads to neutrophil infiltration and processing of 614G mutated Spike protein by neutrophil Elastase. The simultaneous processing of 614G spike protein by TMPRSS2 and Elastase serine proteases facilitates the entry of the 614G subtype into host cells. Thus, SARS-CoV-2, particularly the 614G subtype, has spread more easily and with higher frequency to Europe and North America where the delC allele regulating expression of TMPRSS2 and MX1 host proteins is common, but not to East Asia where this allele is rare.

[1]  P. Majumder,et al.  Analysis of RNA sequences of 3636 SARS-CoV-2 collected from 55 countries reveals selective sweep of one virus type , 2020, The Indian journal of medical research.

[2]  Hans Clevers,et al.  SARS-CoV-2 productively infects human gut enterocytes , 2020, Science.

[3]  R. Woods,et al.  Neutrophil extracellular traps in COVID-19. , 2020, JCI insight.

[4]  David S. Fischer,et al.  Integrated analyses of single-cell atlases reveal age, gender, and smoking status associations with cell type-specific expression of mediators of SARS-CoV-2 viral entry and highlights inflammatory programs in putative target cells , 2020, bioRxiv.

[5]  Kari Stefansson,et al.  Spread of SARS-CoV-2 in the Icelandic Population , 2020, The New England journal of medicine.

[6]  Kyle J. Gaulton,et al.  Single Nucleus Multiomic Profiling Reveals Age-Dynamic Regulation of Host Genes Associated with SARS-CoV-2 Infection , 2020, bioRxiv.

[7]  Frederic A. Fellouse,et al.  Human ACE2 receptor polymorphisms predict SARS-CoV-2 susceptibility , 2020, bioRxiv.

[8]  Yan Zhao,et al.  Neutrophil-to-lymphocyte ratio as an independent risk factor for mortality in hospitalized patients with COVID-19 , 2020, Journal of Infection.

[9]  T. Skoff,et al.  Coronavirus Disease 2019 in Children — United States, February 12–April 2, 2020 , 2020, MMWR. Morbidity and mortality weekly report.

[10]  Colin Renfrew,et al.  Phylogenetic network analysis of SARS-CoV-2 genomes , 2020, Proceedings of the National Academy of Sciences.

[11]  K. Yuen,et al.  Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2 , 2020, Cell.

[12]  Tartaglia Marco,et al.  ACE2 variants underlie interindividual variability and susceptibility to COVID-19 in Italian population , 2020, medRxiv.

[13]  F. A. Lagunas-Rangel Neutrophil‐to‐lymphocyte ratio and lymphocyte‐to‐C‐reactive protein ratio in patients with severe coronavirus disease 2019 (COVID‐19): A meta‐analysis , 2020, Journal of medical virology.

[14]  Morteza Abdullatif Khafaie,et al.  Cross-Country Comparison of Case Fatality Rates of COVID-19/SARS-COV-2 , 2020, Osong public health and research perspectives.

[15]  Xiliang Wang,et al.  COVID-19: a new challenge for human beings , 2020, Cellular & Molecular Immunology.

[16]  Jarek Kobiela,et al.  Estimating case fatality rates of COVID-19 , 2020, The Lancet Infectious Diseases.

[17]  C. Whittaker,et al.  Estimates of the severity of coronavirus disease 2019: a model-based analysis , 2020, The Lancet Infectious Diseases.

[18]  Lanjuan Li,et al.  SARS-CoV-2: virus dynamics and host response , 2020, The Lancet Infectious Diseases.

[19]  G. Onder,et al.  Case-Fatality Rate and Characteristics of Patients Dying in Relation to COVID-19 in Italy. , 2020, JAMA.

[20]  Jin Tian,et al.  COVID-19: Epidemiology, Evolution, and Cross-Disciplinary Perspectives , 2020, Trends in Molecular Medicine.

[21]  E. Holmes,et al.  The proximal origin of SARS-CoV-2 , 2020, Nature Medicine.

[22]  Fumihiro Kato,et al.  Enhanced isolation of SARS-CoV-2 by TMPRSS2-expressing cells , 2020, Proceedings of the National Academy of Sciences.

[23]  R. Lu,et al.  Detection of SARS-CoV-2 in Different Types of Clinical Specimens. , 2020, JAMA.

[24]  A. Walls,et al.  Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein , 2020, Cell.

[25]  G. Herrler,et al.  SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor , 2020, Cell.

[26]  A. M. Leontovich,et al.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2 , 2020, Nature Microbiology.

[27]  Chuan Qin,et al.  Dysregulation of immune response in patients with COVID-19 in Wuhan, China , 2020, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[28]  P. Horby,et al.  A novel coronavirus outbreak of global health concern , 2020, The Lancet.

[29]  G. Gao,et al.  A Novel Coronavirus from Patients with Pneumonia in China, 2019 , 2020, The New England journal of medicine.

[30]  A. Mócsai,et al.  Neutrophils as emerging therapeutic targets , 2020, Nature Reviews Drug Discovery.

[31]  Phillip A. Richmond,et al.  JASPAR 2020: update of the open-access database of transcription factor binding profiles , 2019, Nucleic Acids Res..

[32]  R. Rabin,et al.  IRF1 Maintains Optimal Constitutive Expression of Antiviral Genes and Regulates the Early Antiviral Response , 2019, Front. Immunol..

[33]  Sara R. Selitsky,et al.  Basal expression of interferon regulatory factor 1 drives intrinsic hepatocyte resistance to multiple RNA viruses , 2019, Nature Microbiology.

[34]  Lu Lu,et al.  A pan-coronavirus fusion inhibitor targeting the HR1 domain of human coronavirus spike , 2019, Science Advances.

[35]  C. D. Dela Cruz,et al.  BPIFA1 regulates lung neutrophil recruitment and interferon signaling during acute inflammation. , 2019, American journal of physiology. Lung cellular and molecular physiology.

[36]  Gholamreza Haffari,et al.  PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy , 2018, Bioinform..

[37]  K. Shirato,et al.  Wild-type human coronaviruses prefer cell-surface TMPRSS2 to endosomal cathepsins for cell entry , 2017, Virology.

[38]  J. Nyengaard,et al.  The TLR9 agonist MGN1703 triggers a potent type I interferon response in the sigmoid colon , 2017, Mucosal Immunology.

[39]  Trevor Bedford,et al.  Nextstrain: real-time tracking of pathogen evolution , 2017, bioRxiv.

[40]  Kimberly J. Hassett,et al.  Efficient Targeting and Activation of Antigen-Presenting Cells In Vivo after Modified mRNA Vaccine Administration in Rhesus Macaques , 2017, Molecular therapy : the journal of the American Society of Gene Therapy.

[41]  S. Kotenko,et al.  Interferon‐&lgr; Mediates Non‐redundant Front‐Line Antiviral Protection against Influenza Virus Infection without Compromising Host Fitness , 2017, Immunity.

[42]  Yuelong Shu,et al.  GISAID: Global initiative on sharing all influenza data – from vision to reality , 2017, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[43]  C. Johansson,et al.  Type I Interferons as Regulators of Lung Inflammation , 2017, Front. Immunol..

[44]  J. Homola,et al.  The Scavenger Receptor SSc5D Physically Interacts with Bacteria through the SRCR-Containing N-Terminal Domain , 2016, Front. Immunol..

[45]  S. Perlman,et al.  Proteolytic processing of Middle East respiratory syndrome coronavirus spikes expands virus tropism , 2016, Proceedings of the National Academy of Sciences.

[46]  B. Schilling,et al.  Type I IFNs induce anti‐tumor polarization of tumor associated neutrophils in mice and human , 2016, International journal of cancer.

[47]  Latarsha J. Carithers,et al.  The Genotype-Tissue Expression (GTEx) Project. , 2015, Biopreservation and biobanking.

[48]  Kairong Cui,et al.  Division of labor between IRF1 and IRF2 in regulating different stages of transcriptional activation in cellular antiviral activities , 2015, Cell & Bioscience.

[49]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[50]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[51]  Mark A. Miller,et al.  Neutrophil Elastase Causes Tissue Damage That Decreases Host Tolerance to Lung Infection with Burkholderia Species , 2014, PLoS pathogens.

[52]  Bo-guang Sun,et al.  Identification and characterization of a cell surface scavenger receptor cysteine-rich protein of Sciaenops ocellatus: bacterial interaction and its dependence on the conserved structural features of the SRCR domain. , 2013, Fish & shellfish immunology.

[53]  Trevor Bedford,et al.  Viral Phylodynamics , 2013, PLoS Comput. Biol..

[54]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[55]  Christophe Fraser,et al.  Integrating Phylodynamics and Epidemiology to Estimate Transmission Diversity in Viral Epidemics , 2013, PLoS Comput. Biol..

[56]  Geoffrey I. Webb,et al.  PROSPER: An Integrated Feature-Based Tool for Predicting Protease Substrate Cleavage Sites , 2012, PloS one.

[57]  A. Tsung,et al.  Interferon regulatory factor-2 is protective against hepatic ischemia-reperfusion injury. , 2012, American journal of physiology. Gastrointestinal and liver physiology.

[58]  S. Moestrup,et al.  The Conserved Scavenger Receptor Cysteine-Rich Superfamily in Therapy and Diagnosis , 2011, Pharmacological Reviews.

[59]  Heng Li,et al.  Tabix: fast retrieval of sequence features from generic TAB-delimited files , 2011, Bioinform..

[60]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[61]  Yang Zhang,et al.  I-TASSER: a unified platform for automated protein structure and function prediction , 2010, Nature Protocols.

[62]  Markus Eickmann,et al.  Cleavage of Influenza Virus Hemagglutinin by Airway Proteases TMPRSS2 and HAT Differs in Subcellular Localization and Susceptibility to Protease Inhibitors , 2010, Journal of Virology.

[63]  Christopher D. Paddock,et al.  Cellular Immune Responses to Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) Infection in Senescent BALB/c Mice: CD4+ T Cells Are Important in Control of SARS-CoV Infection , 2009, Journal of Virology.

[64]  Pablo Librado,et al.  DnaSP v5: a software for comprehensive analysis of DNA polymorphism data , 2009, Bioinform..

[65]  G. Whittaker,et al.  Activation of the SARS coronavirus spike protein via sequential proteolytic cleavage at two distinct sites , 2009, Proceedings of the National Academy of Sciences.

[66]  C. Hsiao,et al.  Modeling the Early Events of Severe Acute Respiratory Syndrome Coronavirus Infection In Vitro , 2006, Journal of Virology.

[67]  David E. Swayne,et al.  Characterization of the Reconstructed 1918 Spanish Influenza Pandemic Virus , 2005, Science.

[68]  S. Morikawa,et al.  Protease-mediated enhancement of severe acute respiratory syndrome coronavirus infection. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[69]  Xi Rao,et al.  Identification of Two Critical Amino Acid Residues of the Severe Acute Respiratory Syndrome Coronavirus Spike Protein for Its Variation in Zoonotic Tropism Transition via a Double Substitution Strategy , 2005, Journal of Biological Chemistry.

[70]  J. R. Somoza,et al.  The structure of the extracellular region of human hepsin reveals a serine protease domain and a novel scavenger receptor cysteine-rich (SRCR) domain. , 2003, Structure.

[71]  H. Ohbayashi Neutrophil elastase inhibitors as treatment for COPD , 2002, Expert opinion on investigational drugs.

[72]  M. Gordon,et al.  Expression of interferon regulatory factor (IRF) genes and response to interferon-α in chronic myeloid leukaemia , 1997, Leukemia.

[73]  Sudhir Kumar,et al.  MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers , 1994, Comput. Appl. Biosci..

[74]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[75]  Takashi Miyata,et al.  Structurally similar but functionally distinct factors, IRF-1 and IRF-2, bind to the same regulatory elements of IFN and IFN-inducible genes , 1989, Cell.

[76]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.