Early pandemic molecular diversity of SARS-CoV-2 in children

Background In the US, community circulation of the SARS-CoV-2 virus likely began in February 2020 after mostly travel-related cases. Children's Hospital of Philadelphia began testing on 3/9/2020 for pediatric and adult patients, and for all admitted patients on 4/1/2020, allowing an early glimpse into the local molecular epidemiology of the virus. Methods We obtained 169 SARS-CoV-2 samples (83 from patients <21 years old) from March through May and produced whole genome sequences. We used genotyping tools to track variants over time and to test for possible genotype associated clinical presentations and outcomes in children. Results Our analysis uncovered 13 major lineages that changed in relative abundance as cases peaked in mid-April in Philadelphia. We detected at least 6 introductions of distinct viral variants into the population. As a group, children had more diverse virus genotypes than the adults tested. No strong differences in clinical variables were associated with genotypes. Conclusions Whole genome analysis revealed unexpected diversity, and distinct circulating viral variants within the initial peak of cases in Philadelphia. Most introductions appeared to be local from nearby states. Although limited by sample size, we found no evidence that different genotypes had different clinical impacts in children in this study.

[1]  J. Biegel,et al.  High Prevalence of SARS-CoV-2 Genetic Variation and D614G Mutation in Pediatric Patients With COVID-19 , 2020, Open forum infectious diseases.

[2]  Ezekiel J. Maier,et al.  Variants in SARS-CoV-2 Associated with Mild or Severe Outcome , 2020, medRxiv.

[3]  Joel O. Wertheim,et al.  The emergence of SARS-CoV-2 in Europe and North America , 2020, Science.

[4]  Sebastian Maurer-Stroh,et al.  Effects of a major deletion in the SARS-CoV-2 genome on the severity of infection and the inflammatory response: an observational cohort study , 2020, The Lancet.

[5]  Edward C. Holmes,et al.  A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology , 2020, Nature Microbiology.

[6]  P. Planet,et al.  Rapid whole genome sequence typing reveals multiple waves of SARS-CoV-2 spread , 2020, bioRxiv.

[7]  Trevor Bedford,et al.  Genomic surveillance reveals multiple introductions of SARS-CoV-2 into Northern California , 2020, Science.

[8]  M. Rieder,et al.  Evidence for Limited Early Spread of COVID-19 Within the United States, January–February 2020 , 2020, MMWR. Morbidity and mortality weekly report.

[9]  J. Biegel,et al.  Comprehensive Genome Analysis of 6,000 USA SARS-CoV-2 Isolates Reveals Haplotype Signatures and Localized Transmission Patterns by State and by Country , 2020, Frontiers in Microbiology.

[10]  D. Montefiori,et al.  Spike mutation pipeline reveals the emergence of a more transmissible form of SARS-CoV-2 , 2020, bioRxiv.

[11]  Gintaras Deikus,et al.  Introductions and early spread of SARS-CoV-2 in the New York City area , 2020, Science.

[12]  Colin Renfrew,et al.  Phylogenetic network analysis of SARS-CoV-2 genomes , 2020, Proceedings of the National Academy of Sciences.

[13]  Trevor Bedford,et al.  Cryptic transmission of SARS-CoV-2 in Washington state , 2020, Science.

[14]  E. Holmes,et al.  A new coronavirus associated with human respiratory disease in China , 2020, Nature.

[15]  Olga Chernomor,et al.  IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era , 2019, bioRxiv.

[16]  P. Bork,et al.  Interactive Tree Of Life (iTOL) v4: recent updates and new developments , 2019, Nucleic Acids Res..

[17]  A. von Haeseler,et al.  UFBoot2: Improving the Ultrafast Bootstrap Approximation , 2017, bioRxiv.

[18]  Yuelong Shu,et al.  GISAID: Global initiative on sharing all influenza data – from vision to reality , 2017, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[19]  Andrew Rambaut,et al.  Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen) , 2016, Virus evolution.

[20]  Alexandre P. Francisco,et al.  PHYLOViZ: phylogenetic inference and data visualization for sequence based typing methods , 2012, BMC Bioinformatics.

[21]  Alexandre P. Francisco,et al.  Global optimal eBURST analysis of multilocus typing data using a graphic matroid approach , 2009, BMC Bioinformatics.

[22]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[23]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[24]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[25]  K Hutcheson,et al.  A test for comparing diversities based on the Shannon formula. , 1970, Journal of theoretical biology.