SARS-CoV-2 transmission chains from genetic data: a Danish case study

Background The COVID-19 pandemic caused by the SARS-CoV-2 virus started in China in December 2019 and has since spread globally. Information about the spread of the virus in a country can inform the gradual reopening of a country and help to avoid a second wave of infections. Denmark is currently opening up after a lockdown in mid-March. Methods We perform a phylogenetic analysis of 742 publicly available Danish SARS-CoV-2 genome sequences and put them into context using sequences from other countries. Result Our findings are consistent with several introductions of the virus to Denmark from independent sources. We identify several chains of mutations that occurred in Denmark and in at least one case find evidence that the virus spread from Denmark to other countries. A number of the mutations found in Denmark are non-synonymous, and in general there is a considerable variety of strains. The proportions of the most common haplotypes is stable after lockdown. Conclusion Our work shows how genetic data can be used to identify routes of introduction of a virus into a region and provide alternative means for verifying existing assumptions. For example, our analysis supports the hypothesis that the virus was brought to Denmark by skiers returning from Ischgl. On the other hand, we identify transmission chains suggesting that Denmark was part of a network of countries among which the virus was being transmitted; thus challenging the common narrative that Denmark only got infected from abroad. Our analysis does not indicate that the major haplotypes appearing in Denmark have a different degree of virality. Our methods can be applied to other countries, regions or even highly localised outbreaks. When used in real-time, we believe they can serve to identify transmission events and supplement traditional methods such as contact tracing.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  Stefan Elbe,et al.  Data, disease and diplomacy: GISAID's innovative contribution to global health , 2017, Global challenges.

[3]  Erik Wright,et al.  DECIPHER: harnessing local sequence context to improve protein multiple sequence alignment , 2015, BMC Bioinformatics.

[4]  Emmanuel Paradis,et al.  ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R , 2018, Bioinform..

[5]  Mehmet Ozaslan,et al.  Genomic characterization of a novel SARS-CoV-2 , 2020, Gene Reports.

[6]  David Robertson,et al.  CoV-GLUE: A Web Application for Tracking SARS-CoV-2 Genomic Variation , 2020 .

[7]  Y. Hu,et al.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, The Lancet.

[8]  C. Campèse,et al.  First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020 , 2020, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[9]  Yuelong Shu,et al.  GISAID: Global initiative on sharing all influenza data – from vision to reality , 2017, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[10]  F. Balloux,et al.  Emergence of genomic diversity and recurrent mutations in SARS-CoV-2 , 2020, Infection, Genetics and Evolution.

[11]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[12]  S. Tokajian,et al.  SARS-CoV-2 and ORF3a: Nonsynonymous Mutations, Functional Domains, and Viral Pathogenesis , 2020, mSystems.

[13]  Edward C. Holmes,et al.  A dynamic nomenclature proposal for SARS-CoV-2 to assist genomic epidemiology , 2020, bioRxiv.

[14]  Trevor Bedford,et al.  Nextstrain: real-time tracking of pathogen evolution , 2017, bioRxiv.

[15]  Klaus Peter Schliep,et al.  phangorn: phylogenetic analysis in R , 2010, Bioinform..

[16]  Erik S. Wright,et al.  Using DECIPHER v2.0 to Analyze Big Biological Sequence Data in R , 2016, R J..

[17]  Kari Stefansson,et al.  Spread of SARS-CoV-2 in the Icelandic Population , 2020, The New England journal of medicine.

[18]  Guangchuang Yu,et al.  Using ggtree to Visualize Data on Tree‐Like Structures , 2020, Current protocols in bioinformatics.