Whole genome analysis of local Kenyan and global sequences unravels the epidemiological and molecular evolutionary dynamics of RSV genotype ON1 strains

The respiratory syncytial virus (RSV) group A variant with the 72-nucleotide duplication in the G gene, genotype ON1, was first detected in Kilifi in 2012 and has almost completely replaced previously circulating genotype GA2 strains. This replacement suggests some fitness advantage of ON1 over the GA2 viruses, and might be accompanied by important genomic substitutions in ON1 viruses. Close observation of such a new virus introduction over time provides an opportunity to better understand the transmission and evolutionary dynamics of the pathogen. We have generated and analyzed 184 RSV-A whole genome sequences (WGS) from Kilifi (Kenya) collected between 2011 and 2016, the first ON1 genomes from Africa and the largest collection globally from a single location. Phylogenetic analysis indicates that RSV-A transmission into this coastal Kenya location is characterized by multiple introductions of viral lineages from diverse origins but with varied success in local transmission. We identify signature amino acid substitutions between ON1 and GA2 viruses within genes encoding the surface proteins (G, F), polymerase (L) and matrix M2-1 proteins, some of which were identified as positively selected, and thereby provide an enhanced picture of RSV-A diversity. Furthermore, five of the eleven RSV open reading frames (ORF) (i.e. G, F, L, N and P), analyzed separately, formed distinct phylogenetic clusters for the two genotypes, which might suggest that coding regions outside of the most frequently studied G ORF play a role in the adaptation of RSV to host populations with the alternative possibility that some of the substitutions are nothing more than genetic hitchhikers. Our analysis provides insight into the epidemiological processes that define RSV spread, highlights the genetic substitutions that characterize emerging strains, and demonstrates the utility of large-scale WGS in molecular epidemiological studies. Author summary Respiratory syncytial virus (RSV) is the leading viral cause of severe pneumonia and bronchiolitis among infants and children globally. No vaccine exists to date. The high genetic variability of this RNA virus, characterized by group (A or B), genotype (within group) and variant (within genotype) replacement in populations, may pose a challenge to effective vaccine design by enabling immune response escape. To date most sequence data exists for the highly variable G gene encoding the RSV attachment protein, and there is little globally-sampled RSV genomic data to provide a fine resolution of the epidemiology and evolutionary dynamics of the pathogen. Here we use long-term RSV surveillance in coastal Kenya to track the introduction, spread and evolution of a new RSV genotype known as ON1 (having a 72-nucleotide duplication in the G gene). We present a set of 184 RSV-A whole genomes, including 176 of RSV ON1 (the first from Africa), describe patterns of local ON1 spread and show genome-wide changes between the two major RSV-A genotypes that may define the pathogen’s adaptation to the host. These findings have implications for vaccine design and improved understanding of RSV epidemiology and evolution.

[1]  C. Agoti,et al.  Surveillance of respiratory viruses in the outpatient setting in rural coastal Kenya: baseline epidemiological observations , 2018, Wellcome open research.

[2]  D. Noyola,et al.  Respiratory Syncytial Virus-A ON1 Genotype Emergence in Central Mexico in 2009 and Evidence of Multiple Duplication Events , 2018, The Journal of infectious diseases.

[3]  Stephanie J. Spielman phyphy: Python package for facilitating the execution and parsing of HyPhy standard analyses , 2018, J. Open Source Softw..

[4]  S. Madhi,et al.  Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in young children in 2015: a systematic review and modelling study , 2017, The Lancet.

[5]  Robin N. Thompson,et al.  Selection on non-antigenic gene segments of seasonal influenza A virus and its impact on adaptive evolution , 2017, bioRxiv.

[6]  C. Shaw,et al.  Sequence variability of the respiratory syncytial virus (RSV) fusion gene among contemporary and historical genotypes of RSV/A and RSV/B , 2017, PloS one.

[7]  R. Nielsen,et al.  The Evolutionary Pathway to Virulence of an RNA Virus , 2017, Cell.

[8]  Kristian G. Andersen,et al.  Experimental Evolution to Study Virus Emergence , 2017, Cell.

[9]  P. Cane,et al.  Spread and Evolution of Respiratory Syncytial Virus A Genotype ON1, Coastal Kenya, 2010–2015 , 2017, Emerging infectious diseases.

[10]  P. Kellam,et al.  Transmission patterns and evolution of respiratory syncytial virus in a community outbreak identified by genomic analysis , 2017, Virus evolution.

[11]  Måns Magnusson,et al.  MultiQC: summarize analysis results for multiple tools and samples in a single report , 2016, Bioinform..

[12]  Minh Nhat Le,et al.  Association of RSV-A ON1 genotype with Increased Pediatric Acute Lower Respiratory Tract Infection in Vietnam , 2016, Scientific Reports.

[13]  Olga Chernomor,et al.  Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices , 2016, Systematic biology.

[14]  S. Khurana,et al.  Antigenic Fingerprinting following Primary RSV Infection in Young Children Identifies Novel Antigenic Sites and Reveals Unlinked Evolution of Human Antibody Repertoires to Fusion and Attachment Glycoproteins , 2016, PLoS pathogens.

[15]  P. Cane,et al.  Molecular Evolutionary Dynamics of Respiratory Syncytial Virus Group A in Recurrent Epidemics in Coastal Kenya , 2016, Journal of Virology.

[16]  O. Pybus,et al.  Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen) , 2016, Virus evolution.

[17]  V. Duvvuri,et al.  Genetic diversity and evolutionary insights of respiratory syncytial virus A ON1 genotype: global and local transmission dynamics , 2015, Scientific Reports.

[18]  P. Cane,et al.  Successive Respiratory Syncytial Virus Epidemics in Local Populations Arise from Multiple Variant Introductions, Providing Insights into Virus Persistence , 2015, Journal of Virology.

[19]  Trevor Bedford,et al.  Ebola Virus Epidemiology, Transmission, and Evolution during Seven Months in Sierra Leone , 2015, Cell.

[20]  T. Hartert,et al.  Functional Analysis of the 60-Nucleotide Duplication in the Respiratory Syncytial Virus Buenos Aires Strain Attachment Glycoprotein , 2015, Journal of Virology.

[21]  J. Pascale,et al.  Conservation of G-Protein Epitopes in Respiratory Syncytial Virus (Group A) Despite Broad Genetic Diversity: Is Antibody Selection Involved in Virus Evolution? , 2015, Journal of Virology.

[22]  Ben Murrell,et al.  Gene-wide identification of episodic selection. , 2015, Molecular biology and evolution.

[23]  P. Piedra,et al.  Infection with novel respiratory syncytial virus genotype Ontario (ON1) in adult hematopoietic cell transplant recipients, Texas, 2011-2013. , 2015, The Journal of infectious diseases.

[24]  P. Kellam,et al.  Local Evolutionary Patterns of Human Respiratory Syncytial Virus Derived from Whole-Genome Sequencing , 2015, Journal of Virology.

[25]  Ben Murrell,et al.  IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform , 2014, PLoS Comput. Biol..

[26]  Anders Larsson,et al.  AliView: a fast and lightweight alignment viewer and editor for large datasets , 2014, Bioinform..

[27]  C. Scagnolari,et al.  Rapid spread of the novel respiratory syncytial virus A ON1 genotype, central Italy, 2011 to 2013. , 2014, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[28]  P. Cane,et al.  Rapid Spread and Diversification of Respiratory Syncytial Virus Genotype ON1, Kenya , 2014, Emerging infectious diseases.

[29]  Derrick E. Wood,et al.  Kraken: ultrafast metagenomic sequence classification using exact alignments , 2014, Genome Biology.

[30]  J. Richter,et al.  Epidemiology of respiratory syncytial virus in children in Cyprus during three consecutive winter seasons (2010–2013): age distribution, seasonality and association between prevalent genotypes and disease severity , 2014, Epidemiology and Infection.

[31]  K. Suwannakarn,et al.  Molecular characterization of human respiratory syncytial virus, 2010-2011: identification of genotype ON1 and a new subgroup B genotype in Thailand , 2013, Archives of Virology.

[32]  N. Yamamoto,et al.  Genetic analysis of attachment glycoprotein (G) gene in new genotype ON1 of human respiratory syncytial virus detected in Japan , 2013, Microbiology and immunology.

[33]  J. Liese,et al.  Novel Respiratory Syncytial Virus A Genotype, Germany, 2011–2012 , 2013, Emerging infectious diseases.

[34]  P. Lemey,et al.  The Comparative Genomics of Human Respiratory Syncytial Virus Subgroups A and B: Genetic Variability and Molecular Evolutionary Dynamics , 2013, Journal of Virology.

[35]  B. Eley,et al.  Novel Respiratory Syncytial Virus Subtype ON1 among Children, Cape Town, South Africa, 2012 , 2013, Emerging infectious diseases.

[36]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[37]  P. Lemey,et al.  Genetic Variability among Complete Human Respiratory Syncytial Virus Subgroup A Genomes: Bridging Molecular Evolutionary Dynamics and Epidemiology , 2012, PloS one.

[38]  P. Cane,et al.  Group- and Genotype-Specific Neutralizing Antibody Responses Against Respiratory Syncytial Virus in Infants and Young Children With Severe Pneumonia , 2012, The Journal of infectious diseases.

[39]  Wai Lok Sibon Li,et al.  Accurate model selection of relaxed molecular clocks in bayesian phylogenetics. , 2012, Molecular biology and evolution.

[40]  P. Cane,et al.  Genetic Relatedness of Infecting and Reinfecting Respiratory Syncytial Virus Strains Identified in a Birth Cohort From Rural Kenya , 2012, The Journal of infectious diseases.

[41]  M. Suchard,et al.  Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty. , 2012, Molecular biology and evolution.

[42]  Sergei L. Kosakovsky Pond,et al.  Detecting Individual Sites Subject to Episodic Diversifying Selection , 2012, PLoS genetics.

[43]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[44]  K. Marsh,et al.  HEALTH AND DEMOGRAPHIC SURVEILLANCE SYSTEM PROFILE Profile: The Kilifi Health and Demographic Surveillance System (KHDSS) , 2012 .

[45]  Shane S. Sturrock,et al.  Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data , 2012, Bioinform..

[46]  Samir N. Patel,et al.  Genetic Variability of Human Respiratory Syncytial Virus A Strains Circulating in Ontario: A Novel Genotype with a 72 Nucleotide G Gene Duplication , 2012, PloS one.

[47]  J. Melero,et al.  Ten Years of Global Evolution of the Human Respiratory Syncytial Virus BA Genotype with a 60-Nucleotide Duplication in the G Protein Gene , 2010, Journal of Virology.

[48]  S. Madhi,et al.  Global burden of acute lower respiratory infections due to respiratory syncytial virus in young children: a systematic review and meta-analysis , 2010, The Lancet.

[49]  D. Nokes,et al.  Incidence and severity of respiratory syncytial virus pneumonia in rural Kenyan children identified through hospital surveillance. , 2009, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[50]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[51]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[52]  Marco A. R. Ferreira,et al.  Bayesian analysis of elapsed times in continuous‐time Markov chains , 2008 .

[53]  Sébastien Lê,et al.  FactoMineR: An R Package for Multivariate Analysis , 2008 .

[54]  S. Ho,et al.  Relaxed Phylogenetics and Dating with Confidence , 2006, PLoS biology.

[55]  Anne-Mieke Vandamme,et al.  Genetic Variability and Molecular Evolution of the Human Respiratory Syncytial Virus Subgroup B Attachment G Protein , 2005, Journal of Virology.

[56]  Sergei L. Kosakovsky Pond,et al.  Not so different after all: a comparison of methods for detecting amino acid sites under selection. , 2005, Molecular biology and evolution.

[57]  O. Pybus,et al.  Bayesian coalescent inference of past population dynamics from molecular sequences. , 2005, Molecular biology and evolution.

[58]  Anne-Mieke Vandamme,et al.  Molecular Evolution and Circulation Patterns of Human Respiratory Syncytial Virus Subgroup A: Positively Selected Sites in the Attachment G Glycoprotein , 2004, Journal of Virology.

[59]  M. Galiano,et al.  Major changes in the G protein of human respiratory syncytial virus isolates introduced by a duplication of 60 nucleotides. , 2003, The Journal of general virology.

[60]  Ghislain Fournous,et al.  Prophage Genomics , 2003, Microbiology and Molecular Biology Reviews.

[61]  H. Shou,et al.  Molecular epidemiology of respiratory syncytial virus. , 2001, Chinese medical journal.

[62]  C B Hall,et al.  Circulation patterns of genetically distinct group A and B strains of human respiratory syncytial virus in a community. , 1998, The Journal of general virology.

[63]  J. Dopazo,et al.  Antigenic structure of the human respiratory syncytial virus G glycoprotein and relevance of hypermutation events for the generation of antigenic variants. , 1997, The Journal of general virology.

[64]  P. Cane Analysis of linear epitopes recognised by the primary human antibody response to a variable region of the attachment (G) protein of respiratory syncytial virus , 1997, Journal of medical virology.

[65]  M. Kashlev,et al.  Transcriptional arrest: Escherichia coli RNA polymerase translocates backward, leaving the 3' end of the RNA intact and extruded. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[66]  P. Cane,et al.  Evolution of subgroup A respiratory syncytial virus: evidence for progressive accumulation of amino acid changes in the attachment protein , 1995, Journal of virology.

[67]  J. Dopazo,et al.  Evolutionary pattern of human respiratory syncytial virus (subgroup A): cocirculating lineages and correlation of genetic and antigenic changes in the G glycoprotein , 1994, Journal of virology.

[68]  E. Norrby,et al.  Two distinct subtypes of human respiratory syncytial virus. , 1985, The Journal of general virology.

[69]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[70]  M. Suchard,et al.  phylogenetics with , 2012 .

[71]  P. Cane,et al.  Respiratory syncytial virus infection and disease in infants and young children observed from birth in Kilifi District, Kenya. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[72]  Alexei J Drummond,et al.  Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences. , 2006, Molecular biology and evolution.

[73]  Claude-Alain H. Roten,et al.  Theoretical and practical advances in genome halving , 2004 .

[74]  C. Lottaz,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[75]  W. Sullender Respiratory syncytial virus genetic and antigenic diversity. , 2000, Clinical microbiology reviews.

[76]  T. A. Hall,et al.  BIOEDIT: A USER-FRIENDLY BIOLOGICAL SEQUENCE ALIGNMENT EDITOR AND ANALYSIS PROGRAM FOR WINDOWS 95/98/ NT , 1999 .

[77]  Sergei L. Kosakovsky Pond,et al.  BIOINFORMATICS APPLICATIONS , 2022 .