Computational Identification of Antigenicity-Associated Sites in the Hemagglutinin Protein of A/H1N1 Seasonal Influenza Virus

The antigenic variability of influenza viruses has always made influenza vaccine development challenging. The punctuated nature of antigenic drift of influenza virus suggests that a relatively small number of genetic changes or combinations of genetic changes may drive changes in antigenic phenotype. The present study aimed to identify antigenicity-associated sites in the hemagglutinin protein of A/H1N1 seasonal influenza virus using computational approaches. Random Forest Regression (RFR) and Support Vector Regression based on Recursive Feature Elimination (SVR-RFE) were applied to H1N1 seasonal influenza viruses and used to analyze the associations between amino acid changes in the HA1 polypeptide and antigenic variation based on hemagglutination-inhibition (HI) assay data. Twenty-three and twenty antigenicity-associated sites were identified by RFR and SVR-RFE, respectively, by considering the joint effects of amino acid residues on antigenic drift. Our proposed approaches were further validated with the H3N2 dataset. The prediction models developed in this study can quantitatively predict antigenic differences with high prediction accuracy based only on HA1 sequences. Application of the study results can increase understanding of H1N1 seasonal influenza virus antigenic evolution and accelerate the selection of vaccine strains.

[1]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[2]  Gleider Hernández,et al.  Effectiveness of vaccines in preventing hospitalization due to COVID-19: A multicenter hospital-based case-control study, Germany, June 2021 to January 2022 , 2022, Vaccine.

[3]  Yu-Chieh Liao,et al.  Identifying potential immunodominant positions and predicting antigenic variants of influenza A/H3N2 viruses. , 2007, Vaccine.

[4]  Chao A. Hsiung,et al.  Bioinformatics models for predicting antigenic variants of influenza A/H3N2 virus , 2008, Bioinform..

[5]  Jinn-Moon Yang,et al.  Antigenic sites of H1N1 influenza virus hemagglutinin revealed by natural isolates and inhibition assays. , 2012, Vaccine.

[6]  Épidémiologique Hebdomadaire Recommended composition of influenza virus vaccines for use in 2000. , 1999, Releve epidemiologique hebdomadaire.

[7]  G G Brownlee,et al.  The predicted antigenicity of the haemagglutinin of the 1918 Spanish influenza pandemic suggests an avian origin. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[8]  Frank L. Horsfall,et al.  PERSISTENT ANTIGENIC VARIATION OF INFLUENZA A VIRUSES AFTER INCOMPLETE NEUTRALIZATION IN OVO WITH HETEROLOGOUS IMMUNE SERUM , 1950, The Journal of experimental medicine.

[9]  Influenza, Seasonal , 2016, My Child Is Sick!, 2nd Ed.

[10]  E. D. Kilbourne Influenza Pandemics of the 20th Century , 2006, Emerging infectious diseases.

[11]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[12]  Q. Jiang,et al.  Antigenic and genetic variation in the hemagglutinins of H1N1 and H3N2 human influenza a viruses in the Shanghai area from 2005 to 2008 , 2011, Journal of medical virology.

[13]  Richard A. Goldstein,et al.  Changing Selective Pressure during Antigenic Changes in Human Influenza H3 , 2008, PLoS pathogens.

[14]  J. Plotkin,et al.  Single Hemagglutinin Mutations That Alter both Antigenicity and Receptor Binding Avidity Influence Influenza Virus Antigenic Clustering , 2013, Journal of Virology.

[15]  Tong Zhang,et al.  Using Sequence Data To Infer the Antigenicity of Influenza Virus , 2013, mBio.

[16]  Wilfred Ndifon,et al.  On the use of hemagglutination-inhibition for influenza surveillance: surveillance data are predictive of influenza vaccine effectiveness. , 2009, Vaccine.

[17]  A. Kelso,et al.  Influenza viruses received and tested by the Melbourne WHO Collaborating Centre for Reference and Research on Influenza annual report, 2014. , 2015, Communicable diseases intelligence quarterly report.

[18]  A. Lapedes,et al.  Mapping the Antigenic and Genetic Evolution of Influenza Virus , 2004, Science.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Ting Wang,et al.  Application of Breiman's Random Forest to Modeling Structure-Activity Relationships of Pharmaceutical Molecules , 2004, Multiple Classifier Systems.

[21]  I. Wilson,et al.  Networks link antigenic and receptor-binding sites of influenza hemagglutinin: Mechanistic insight into fitter strain propagation , 2011, Scientific reports.

[22]  T. Tatusova,et al.  The Influenza Virus Resource at the National Center for Biotechnology Information , 2007, Journal of Virology.

[23]  N. Cox,et al.  Antigenic drift in the evolution of H1N1 influenza A viruses resulting from deletion of a single amino acid in the haemagglutinin gene. , 2007, The Journal of general virology.

[24]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[25]  D. Basak,et al.  Support Vector Regression , 2008 .

[26]  A. McQuarrie,et al.  Regression and Time Series Model Selection , 1998 .

[27]  M. Catton,et al.  Strengthening Australia?s WHO Collaborating Centre for Reference and Research on Influenza , 2006 .

[28]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[29]  Min-Shi Lee,et al.  Predicting Antigenic Variants of Influenza A/H3N2 Viruses , 2004, Emerging infectious diseases.

[30]  Katsuhisa Nakajima,et al.  Recent human influenza A (H1N1) viruses are closely related genetically to strains isolated in 1950 , 1978, Nature.

[31]  D. Burke,et al.  Substitutions Near the Receptor Binding Site Determine Major Antigenic Change During Influenza Virus Evolution , 2013, Science.

[32]  Shyh-Huei Chen,et al.  A support vector machine approach for detecting gene‐gene interaction , 2008, Genetic epidemiology.

[33]  J. Tam,et al.  Molecular Evolution of Human Influenza A/H3N2 Virus in Asia and Europe from 2001 to 2003 , 2005, Journal of Clinical Microbiology.

[34]  Jinn-Moon Yang,et al.  Co-evolution positions and rules for antigenic variants of human influenza A/H3N2 viruses , 2009, BMC Bioinformatics.

[35]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .