Genomic epidemiology of SARS-CoV-2 in the United Arab Emirates reveals novel virus mutation, patterns of co-infection and tissue specific host responses

Background: The United Arab Emirates is a major business hub with substantial amount of international travel. Like many other countries, it was greatly affected by the COVID-19 pandemic since late January 2020, with recurring waves of infection. This study aimed at combining genomic and epidemiological data to unravel the source of SARS-CoV-2 introduction, transmission and evolution in the country. Methods: We performed meta-transcriptomic sequencing of 1,067 nasopharyngeal swab samples collected from qRT-PCR positive COVID-19 patients in Abu Dhabi, UAE, between May 9th and June 29th 2020. We investigated the genetic diversity and transmission dynamics of the viral population and analyzed the infection and transmission potential of novel genomic clusters. Within-host SARS-CoV-2 genetic variation was analyzed to determine the occurrence and prevalence of multiple infections. Finally, we evaluated innate host responses during the prolonged period of local infection. Results: All globally known SARS-CoV-2 clades were identified within the UAE sequenced strains, with a higher occurrence of European and East Asian clades. We defined 5 subclades based on 11 unique genetic variants within the UAE strains, which were associated with no significantly different viral loads. Multiple infection of different SARS-CoV-2 strains was observed for at least 5% of the patients. We also discovered an enrichment of cytosine-to-uracil mutation among the viral population collected from the nasopharynx, that is different from the adenosine-to-inosine change previously observed in the bronchoalveolar lavage fluid samples. This observation is accompanied with an upregulation of APOBEC4, an under-studied putative cytidine-uridine editing enzyme in the infected nasopharynx. Conclusions: The genomic epidemiological and molecular biological knowledge obtained in the study provides new insights for the SARS-CoV-2 evolution and transmission. We highlight the importance of sustained surveillance of the virus mutation using genomic sequencing as a public health strategy. Keywords: SARS-CoV-2, meta-transcriptomic sequencing, novel mutations and subclades, co-infection, cyosine depletion, host RNA editing

[1]  M. Memoli,et al.  Navigating the Quagmire: Comparison and Interpretation of COVID-19 Vaccine Phase 1/2 Clinical Trials , 2020, Vaccines.

[2]  A. Alsheikh-Ali,et al.  Multiple early introductions of SARS-CoV-2 into a global travel hub in the Middle East , 2020, Scientific Reports.

[3]  Yiming Bao,et al.  The Global Landscape of SARS-CoV-2 Genomes, Variants, and Haplotypes in 2019nCoVR , 2020, bioRxiv.

[4]  Gavin J. D. Smith,et al.  Discovery and Genomic Characterization of a 382-Nucleotide Deletion in ORF7b and ORF8 during the Early Evolution of SARS-CoV-2 , 2020, mBio.

[5]  S. Rowland-Jones,et al.  Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus , 2020, Cell.

[6]  Samir Bhatt,et al.  Evolution and epidemic spread of SARS-CoV-2 in Brazil , 2020, Science.

[7]  M. Nandagopal,et al.  COVID-19: An Update on the Epidemiological, Genomic Origin, Phylogenetic study, India centric to Worldwide current status , 2020, medRxiv.

[8]  C. Eastin,et al.  Clinical Characteristics of Coronavirus Disease 2019 in China , 2020, The Journal of Emergency Medicine.

[9]  Yonatan H. Grad,et al.  Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period , 2020, Science.

[10]  Xiang Li,et al.  On the origin and continuing evolution of SARS-CoV-2 , 2020, National science review.

[11]  G. Gao,et al.  A Novel Coronavirus from Patients with Pneumonia in China, 2019 , 2020, The New England journal of medicine.

[12]  Trevor Bedford,et al.  Nextstrain: real-time tracking of pathogen evolution , 2017, bioRxiv.

[13]  HaroldC. Smith,et al.  The APOBEC Protein Family: United by Structure, Divergent in Function. , 2016, Trends in biochemical sciences.

[14]  Timothy B. Stockwell,et al.  Quantifying influenza virus diversity and transmission in humans , 2016, Nature Genetics.

[15]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[16]  J. Dudley,et al.  APOBECs and virus restriction. , 2015, Virology.

[17]  Ernesto Picardi,et al.  REDItools: high-throughput RNA editing detection made easy , 2013, Bioinform..

[18]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[19]  Niall Johnson,et al.  Updating the Accounts: Global Mortality of the 1918-1920 "Spanish" Influenza Pandemic , 2002, Bulletin of the history of medicine.

[20]  B. Peters,et al.  Accurate whole genome sequencing as the ultimate genetic test. , 2015, Clinical Chemistry.