A Phylodynamic Workflow to Rapidly Gain Insights into the Dispersal History and Dynamics of SARS-CoV-2 Lineages

Since the start of the COVID-19 pandemic, an unprecedented number of genomic sequences of the causative virus (SARS-CoV-2) have been generated and shared with the scientific community. The unparalleled volume of available genetic data presents a unique opportunity to gain real-time insights into the virus transmission during the pandemic, but also a daunting computational hurdle if analysed with gold-standard phylogeographic approaches. We here describe and apply an analytical pipeline that is a compromise between fast and rigorous analytical steps. As a proof of concept, we focus on the Belgium epidemic, with one of the highest spatial density of available SARS-CoV-2 genomes. At the global scale, our analyses confirm the importance of external introduction events in establishing multiple transmission chains in the country. At the country scale, our spatially-explicit phylogeographic analyses highlight that the national lockdown had a relatively low impact on both the lineage dispersal velocity and the long-distance dispersal events within Belgium. Our pipeline has the potential to be quickly applied to other countries or regions, with key benefits in complementing epidemiological analyses in assessing the impact of intervention measures or their progressive easement.

[1]  M. Mckee,et al.  If the world fails to protect the economy, COVID-19 will damage health not just now but also in the future , 2020, Nature Medicine.

[2]  William L. Hamilton,et al.  Screening of healthcare workers for SARS-CoV-2 highlights the role of asymptomatic carriage in COVID-19 transmission , 2020, medRxiv.

[3]  Daniel L. Ayres,et al.  Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10 , 2018, Virus evolution.

[4]  Guy Baele,et al.  Phylodynamic assessment of intervention strategies for the West African Ebola virus outbreak , 2018, Nature Communications.

[5]  Yuelong Shu,et al.  GISAID: Global initiative on sharing all influenza data – from vision to reality , 2017, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[6]  Wenjun Ma,et al.  Genomic Epidemiology of SARS-CoV-2 in Guangdong Province, China , 2020, Cell.

[7]  Z. Yang,et al.  A space-time process model for the evolution of DNA sequences. , 1995, Genetics.

[8]  Richard A Neher,et al.  TreeTime: Maximum-likelihood phylodynamic analysis , 2017, bioRxiv.

[9]  M. Suchard,et al.  Phylogeography takes a relaxed random walk in continuous space and time. , 2010, Molecular biology and evolution.

[10]  Rebecca Rose,et al.  Explaining the geographic spread of emerging epidemics: a framework for comparing viral phylogenies and environmental landscape data , 2016, BMC Bioinformatics.

[11]  Rebecca Rose,et al.  SERAPHIM: studying environmental rasters and phylogenetically informed movements , 2016, Bioinform..

[12]  Kai Zhao,et al.  A pneumonia outbreak associated with a new coronavirus of probable bat origin , 2020, Nature.

[13]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[14]  E. Holmes,et al.  The proximal origin of SARS-CoV-2 , 2020, Nature Medicine.

[15]  Forrest W. Crawford,et al.  Unifying the spatial epidemiology and molecular evolution of emerging epidemics , 2012, Proceedings of the National Academy of Sciences.

[16]  Nuno R. Faria,et al.  A Genomic Survey of SARS-CoV-2 Reveals Multiple Introductions into Northern California without a Predominant Lineage , 2020, medRxiv.

[17]  Andrew K. Przybylski,et al.  Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science , 2020, The Lancet Psychiatry.

[18]  E. Holmes,et al.  Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding , 2020, The Lancet.

[19]  G. Gao,et al.  A Novel Coronavirus from Patients with Pneumonia in China, 2019 , 2020, The New England journal of medicine.

[20]  E. Holmes,et al.  An emergent clade of SARS-CoV-2 linked to returned travellers from Iran , 2020, bioRxiv.

[21]  P. Sagulenko Maximum likelihood phylodynamic analysis , 2017 .

[22]  Guy Baele,et al.  Recent advances in computational phylodynamics. , 2018, Current opinion in virology.

[23]  Wenjun Ma,et al.  Genomic Epidemiology of SARS-CoV-2 in Guangdong Province, China , 2020, Cell.

[24]  D. Hamer Faculty Opinions recommendation of Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. , 2020, Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature.

[25]  Trevor Bedford,et al.  Nextstrain: real-time tracking of pathogen evolution , 2017, bioRxiv.

[26]  Daniel L. Ayres,et al.  BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics , 2019, Systematic biology.

[27]  Joshua B. Singer,et al.  GLUE: a flexible software system for virus sequence data , 2018, BMC Bioinformatics.

[28]  Alexei J. Drummond,et al.  Bayesian Phylogeography Finds Its Roots , 2009, PLoS Comput. Biol..

[29]  V. Patel Faculty Opinions recommendation of Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science. , 2020, Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature.

[30]  Andrew Rambaut,et al.  Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen) , 2016, Virus evolution.

[31]  Olga Chernomor,et al.  IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era , 2020, Molecular biology and evolution.