How much should we sequence? An analysis of the Swiss SARS- CoV-2 surveillance effort

Background During the SARS-CoV-2 pandemic, many countries directed substantial resources towards genomic surveillance to detect and track viral variants. There is a debate over how much sequencing effort is necessary in national surveillance programs for SARS-CoV-2 and future pandemic threats. Aim We aimed to investigate the effect of reduced sequencing on surveillance outcomes in a large genomic dataset from Switzerland, comprising more than 143k sequences. Methods We employed a uniform downsampling strategy using 100 iterations each to investigate the effects of fewer available sequences on the surveillance outcomes: (i) first detection of variants of concern (VOCs), (ii) speed of introduction of VOCs, (iii) diversity of lineages, (iv) first cluster detection of VOCs, (v) density of active clusters, and (vi) geographic spread of clusters. Results The impact of downsampling on VOC detection is disparate for the three VOC lineages , but many outcomes including introduction and cluster detection could be recapitulated even with only 35% of the original sequencing effort. The effect on the observed speed of introduction and first detection of clusters was more sensitive to reduced sequencing effort for some VOCs, in particular Omicron and Delta, respectively. Conclusion A genomic surveillance program needs a balance between societal benefits and costs. While the overall national dynamics of the pandemic could be recapitulated by a reduced sequencing effort, the effect is strongly lineage dependent - something that is unknown at the time of sequencing - and comes at the cost of accuracy, in particular for tracking the emergence of potential VOCs.

[1]  Stephen L. Leib,et al.  The Swiss Pathogen Surveillance Platform – towards a nation-wide One Health data exchange platform for bacterial, viral and fungal genomics and associated metadata , 2023, Microbial genomics.

[2]  D. Montefiori,et al.  A Bivalent Omicron-Containing Booster Vaccine against Covid-19 , 2022, The New England journal of medicine.

[3]  G. Lozanski,et al.  Durability of Booster mRNA Vaccine against SARS-CoV-2 BA.2.12.1, BA.4, and BA.5 Subvariants , 2022, The New England journal of medicine.

[4]  J. Joy,et al.  Genomic epidemiology of the first two waves of SARS-CoV-2 in Canada , 2022, eLife.

[5]  A. Gesierich,et al.  Virus variant–specific clinical performance of SARS coronavirus two rapid antigen tests in point-of-care use, from November 2020 to January 2022 , 2022, Clinical Microbiology and Infection.

[6]  Fei Shao,et al.  BA.2.12.1, BA.4 and BA.5 escape antibodies elicited by Omicron infection , 2022, Nature.

[7]  T. Woodruff,et al.  Nucleocapsid Specific Diagnostics for the Detection of Divergent SARS-CoV-2 Variants , 2022, Frontiers in Immunology.

[8]  P. Corbisier,et al.  A qualitative RT-PCR assay for the specific identification of the SARS-CoV-2 B.1.1.529 (Omicron) Variant of Concern , 2022, Journal of Clinical Virology.

[9]  Nuno R. Faria,et al.  Using multiple sampling strategies to estimate SARS-CoV-2 epidemiological parameters from genomic sequencing data , 2022, Nature Communications.

[10]  Nuno R. Faria,et al.  Global disparities in SARS-CoV-2 genomic surveillance , 2021, medRxiv.

[11]  G. Saberwal,et al.  The lag in SARS-CoV-2 genome submissions to GISAID , 2021, Nature Biotechnology.

[12]  Y. Liu,et al.  The reproductive number of the Delta variant of SARS-CoV-2 is far higher compared to the ancestral SARS-CoV-2 virus , 2021, Journal of travel medicine.

[13]  J. Nkengasong,et al.  Genomic-informed pathogen surveillance in Africa: opportunities and challenges , 2021, The Lancet Infectious Diseases.

[14]  E. Hodcroft,et al.  Public health actions to control new SARS-CoV-2 variants , 2021, Cell.

[15]  P. Febbo,et al.  Genomic surveillance at scale is required to detect newly emerging strains at an early timepoint , 2021 .

[16]  Michael T. Wolfinger,et al.  Genomic epidemiology of superspreading events in Austria reveals mutational dynamics and transmission properties of SARS-CoV-2 , 2020, Science Translational Medicine.

[17]  A. Rambaut,et al.  Author Correction: Rapid SARS-CoV-2 whole-genome sequencing and analysis for informed public health decision-making in the Netherlands , 2020, Nature Medicine.

[18]  Edward C. Holmes,et al.  A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology , 2020, Nature Microbiology.

[19]  P. Lemey,et al.  Temporal signal and the phylodynamic threshold of SARS-CoV-2 , 2020, bioRxiv.

[20]  P. Lemey,et al.  Tracking virus outbreaks in the twenty-first century , 2018, Nature Microbiology.

[21]  I. Xenarios,et al.  Improving the quality and workflow of bacterial genome sequencing and analysis: paving the way for a Switzerland-wide molecular epidemiological surveillance platform. , 2018, Swiss medical weekly.

[22]  Jennifer L. Gardy,et al.  Towards a genomics-informed, real-time, global pathogen surveillance system , 2017, Nature Reviews Genetics.

[23]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[24]  Guidance for representative and targeted genomic SARS-CoV-2 monitoring , 2021 .

[25]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .