Machine Learning Classification of Time since BNT162b2 COVID-19 Vaccination Based on Array-Measured Antibody Activity

Vaccines trigger an immunological response that includes B and T cells, with B cells producing antibodies. SARS-CoV-2 immunity weakens over time after vaccination. Discovering key changes in antigen-reactive antibodies over time after vaccination could help improve vaccine efficiency. In this study, we collected data on blood antibody levels in a cohort of healthcare workers vaccinated for COVID-19 and obtained 73 antigens in samples from four groups according to the duration after vaccination, including 104 unvaccinated healthcare workers, 534 healthcare workers within 60 days after vaccination, 594 healthcare workers between 60 and 180 days after vaccination, and 141 healthcare workers over 180 days after vaccination. Our work was a reanalysis of the data originally collected at Irvine University. This data was obtained in Orange County, California, USA, with the collection process commencing in December 2020. British variant (B.1.1.7), South African variant (B.1.351), and Brazilian/Japanese variant (P.1) were the most prevalent strains during the sampling period. An efficient machine learning based framework containing four feature selection methods (least absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, and maximum relevance minimum redundancy) and four classification algorithms (decision tree, k-nearest neighbor, random forest, and support vector machine) was designed to select essential antibodies against specific antigens. Several efficient classifiers with a weighted F1 value around 0.75 were constructed. The antigen microarray used for identifying antibody levels in the coronavirus features ten distinct SARS-CoV-2 antigens, comprising various segments of both nucleocapsid protein (NP) and spike protein (S). This study revealed that S1 + S2, S1.mFcTag, S1.HisTag, S1, S2, Spike.RBD.His.Bac, Spike.RBD.rFc, and S1.RBD.mFc were most highly ranked among all features, where S1 and S2 are the subunits of Spike, and the suffixes represent the tagging information of different recombinant proteins. Meanwhile, the classification rules were obtained from the optimal decision tree to explain quantitatively the roles of antigens in the classification. This study identified antibodies associated with decreased clinical immunity based on populations with different time spans after vaccination. These antibodies have important implications for maintaining long-term immunity to SARS-CoV-2.

[1]  Tao Huang,et al.  Identification of Genes Associated with the Impairment of Olfactory and Gustatory Functions in COVID-19 via Machine-Learning Methods , 2023, Life.

[2]  Lei Chen,et al.  PMPTCE-HNEA: Predicting metabolic pathway types of chemicals and enzymes with a heterogeneous network embedding algorithm , 2023, Current Bioinformatics.

[3]  J. Eiros,et al.  Evolution of antibody profiles against SARS-CoV-2 in experienced and naïve vaccinated elderly people , 2023, Frontiers in Immunology.

[4]  Fei Huang,et al.  Identification of Smoking-Associated Transcriptome Aberration in Blood with Machine Learning Methods , 2023, BioMed research international.

[5]  Fei Huang,et al.  Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores. , 2023, Biochimica et biophysica acta. Proteins and proteomics.

[6]  Lei Chen,et al.  A model with deep analysis on a large drug network for drug classification. , 2023, Mathematical biosciences and engineering : MBE.

[7]  Tao Huang,et al.  Identification of Methylation Signatures and Rules for Sarcoma Subtypes by Machine Learning Methods , 2022, BioMed research international.

[8]  P. Horváth,et al.  Analysis and comparison of SARS-CoV-2 variant antibodies and neutralizing activity for 6 months after a booster mRNA vaccine in a healthcare worker population , 2022, Frontiers in Immunology.

[9]  A. Varizhuk,et al.  Microarray Profiling of Vaccination-Induced Antibody Responses to SARS-CoV-2 Variants of Interest and Concern , 2022, International journal of molecular sciences.

[10]  Minal K. Patel,et al.  Duration of effectiveness of vaccination against COVID-19 caused by the omicron variant , 2022, The Lancet Infectious Diseases.

[11]  Lei Chen,et al.  Identification of human protein subcellular location with multiple networks , 2022, Current Protoemics.

[12]  A. Sette,et al.  Humoral and cellular immune memory to four COVID-19 vaccines , 2022, Cell.

[13]  Lei Chen,et al.  Drug-Drug Interactions Prediction Using Fingerprint Only , 2022, Computational and mathematical methods in medicine.

[14]  S. Ünal,et al.  Antibody response with SARS-CoV-2 inactivated vaccine (CoronaVac) in Turkish geriatric population , 2022, Age and ageing.

[15]  E. Dolgin Pan-coronavirus vaccine pipeline takes form , 2022, Nature Reviews Drug Discovery.

[16]  A. Linē,et al.  Early and strong antibody responses to SARS-CoV-2 predict disease severity in COVID-19 patients , 2022, Journal of Translational Medicine.

[17]  P. Horváth,et al.  Persistence of SARS-CoV-2 Antibodies in Vaccinated Health Care Workers Analyzed by Coronavirus Antigen Microarray , 2022, Frontiers in Immunology.

[18]  Tao Huang,et al.  Identification of Cell Markers and Their Expression Patterns in Skin Based on Single-Cell RNA-Sequencing Profiles , 2022, Life.

[19]  Lei Chen,et al.  iATC-NFMLP: Identifying classes of anatomical therapeutic chemicals based on drug networks, fingerprints and multilayer perceptron , 2022, Current Bioinformatics.

[20]  D. Altmann,et al.  COVID-19 vaccination: The road ahead , 2022, Science.

[21]  J. Knight,et al.  SARS-CoV-2-specific antibody and T-cell responses 1 year after infection in people recovered from COVID-19: a longitudinal cohort study , 2022, The Lancet Microbe.

[22]  A. Mechaly,et al.  A Novel Quantitative Multi-Component Serological Assay for SARS-CoV-2 Vaccine Evaluation , 2022, Analytical chemistry.

[23]  Xinxin Zhang,et al.  SARS-CoV-2 vaccine research and immunization strategies for improved control of the COVID-19 pandemic , 2022, Frontiers of Medicine.

[24]  S. Whelan,et al.  Germinal centre-driven maturation of B cell response to mRNA vaccination , 2022, Nature.

[25]  S. Zeger,et al.  Duration of effectiveness of vaccines against SARS-CoV-2 infection and COVID-19 disease: results of a systematic review and meta-regression , 2022, The Lancet.

[26]  J. Suh,et al.  Comparing SARS-CoV-2 Antibody Responses after Various COVID-19 Vaccinations in Healthcare Workers , 2022, Vaccines.

[27]  K. Zheng,et al.  mRNA Vaccine: How to Meet the Challenge of SARS-CoV-2 , 2022, Frontiers in Immunology.

[28]  C. Zheng,et al.  Omicron variant of SARS‐CoV‐2: Genomics, transmissibility, and responses to current COVID‐19 vaccines , 2022, Journal of medical virology.

[29]  A. Linē,et al.  Early and strong antibody responses to SARS-CoV-2 predict disease severity in COVID-19 patients , 2022, Journal of translational medicine.

[30]  Vikram Thakur,et al.  OMICRON (B.1.1.529): A new SARS‐CoV‐2 variant of concern mounting worldwide fear , 2021, Journal of medical virology.

[31]  D. Easton,et al.  Covid-19 Vaccine Effectiveness in New York State , 2021, The New England journal of medicine.

[32]  H. Scheiblauer,et al.  Antibody response to SARS-CoV-2 for more than one year − kinetics and persistence of detection are predominantly determined by avidity progression and test design , 2021, Journal of Clinical Virology.

[33]  J. Koeleman,et al.  SARS-CoV-2 antibody and T cell responses one year after COVID-19 and the booster effect of vaccination: A prospective cohort study , 2021, Journal of Infection.

[34]  Yicheng Guo,et al.  A monoclonal antibody that neutralizes SARS-CoV-2 variants, SARS-CoV, and other sarbecoviruses , 2021, Emerging microbes & infections.

[35]  S. Schubl,et al.  Distinct SARS-CoV-2 antibody reactivity patterns elicited by natural infection and mRNA vaccination , 2021, NPJ vaccines.

[36]  S. Bernardini,et al.  Antibody response to COVID-19 vaccine: A point of view that can help to optimize dose distribution , 2021, International Immunopharmacology.

[37]  H. Legido-Quigley,et al.  How an outbreak became a pandemic: a chronological analysis of crucial junctures and international obligations in the early months of the COVID-19 pandemic , 2021, The Lancet.

[38]  Y. Kreiss,et al.  Waning Immune Humoral Response to BNT162b2 Covid-19 Vaccine over 6 Months , 2021, The New England journal of medicine.

[39]  I. Suriapranata,et al.  Antibody response to the inactivated SARS-CoV-2 vaccine among healthcare workers, Indonesia , 2021, International Journal of Infectious Diseases.

[40]  M. Takeda Proteolytic activation of SARS‐CoV‐2 spike protein , 2021, Microbiology and immunology.

[41]  P. Dormitzer,et al.  Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine through 6 Months , 2021, The New England journal of medicine.

[42]  Anna E. Merrill,et al.  SARS-CoV-2 antibody changes in patients receiving COVID-19 convalescent plasma from normal and vaccinated donors , 2021, Transfusion and Apheresis Science.

[43]  G. Shurin,et al.  Differential Antibody Response to mRNA COVID-19 Vaccines in Healthy Subjects , 2021, Microbiology spectrum.

[44]  T. Fiolet,et al.  Comparing COVID-19 vaccines for their characteristics, efficacy and effectiveness against SARS-CoV-2 and variants of concern: a narrative review , 2021, Clinical Microbiology and Infection.

[45]  S. Safiri,et al.  Potency of BNT162b2 and mRNA‐1273 vaccine‐induced neutralizing antibodies against severe acute respiratory syndrome‐CoV‐2 variants of concern: A systematic review of in vitro studies , 2021, Reviews in medical virology.

[46]  Tao Zeng,et al.  Identification of Microbiota Biomarkers With Orthologous Gene Annotation for Type 2 Diabetes , 2021, Frontiers in Microbiology.

[47]  Tao Li,et al.  Longitudinal and proteome-wide analyses of antibodies in COVID-19 patients reveal features of the humoral immune response to SARS-CoV-2 , 2021, Journal of Advanced Research.

[48]  Yu-Dong Cai,et al.  Identifying Protein Subcellular Locations With Embeddings-Based node2loc , 2021, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[49]  A. Mantovani,et al.  One dose of SARS-CoV-2 vaccine exponentially increases antibodies in recovered individuals with symptomatic COVID-19. , 2021, The Journal of clinical investigation.

[50]  J. Rossowska,et al.  Immune Response to COVID-19 mRNA Vaccine—A Pilot Study , 2021, Vaccines.

[51]  B. Chain,et al.  Prior SARS-CoV-2 infection rescues B and T cell responses to variants after first vaccine dose , 2021, Science.

[52]  Qiu Sun,et al.  Antibodies and Vaccines Target RBD of SARS-CoV-2 , 2021, Frontiers in Molecular Biosciences.

[53]  Ahmed Abdul Azim,et al.  Immunogenicity of the Ad26.COV2.S Vaccine for COVID-19. , 2021, JAMA.

[54]  M. Nussenzweig,et al.  mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants , 2021, Nature.

[55]  D. Klonoff,et al.  COVID-19 vaccines: comparison of biological, pharmacological characteristics and adverse effects of Pfizer/BioNTech and Moderna Vaccines. , 2021, European review for medical and pharmacological sciences.

[56]  H. Leier,et al.  Cross-reactivity of SARS-CoV structural protein antibodies against SARS-CoV-2 , 2021, Cell Reports.

[57]  M. Battegay,et al.  Analysis of SARS-CoV-2 antibodies in COVID-19 convalescent blood using a coronavirus antigen microarray , 2021, Nature communications.

[58]  P. Dormitzer,et al.  Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine , 2020, The New England journal of medicine.

[59]  J. Mascola,et al.  Durability of Responses after SARS-CoV-2 mRNA-1273 Vaccination , 2020, The New England journal of medicine.

[60]  Tao Zeng,et al.  Distinguishing Glioblastoma Subtypes by Methylation Signatures , 2020, Frontiers in Genetics.

[61]  V. Thiel,et al.  Coronavirus biology and replication: implications for SARS-CoV-2 , 2020, Nature Reviews Microbiology.

[62]  Min Zheng,et al.  A systematic review of SARS-CoV-2 vaccine candidates , 2020, Signal Transduction and Targeted Therapy.

[63]  S. Bartell,et al.  Estimated seroprevalence of SARS-CoV-2 antibodies among adults in Orange County, California , 2020, Scientific Reports.

[64]  P. Dormitzer,et al.  COVID-19 vaccine BNT162b1 elicits human antibody and TH1 T cell responses , 2020, Nature.

[65]  A. Parasher COVID-19: Current understanding of its Pathophysiology, Clinical presentation and Treatment , 2020, Postgraduate Medical Journal.

[66]  M. Busch,et al.  Distinct SARS-CoV-2 antibody reactivity patterns in coronavirus convalescent plasma revealed by a coronavirus antigen microarray , 2020, Scientific Reports.

[67]  S. Kang,et al.  From SARS to SARS-CoV-2, insights on structure, pathogenicity and immunity aspects of pandemic human coronaviruses , 2020, Infection, Genetics and Evolution.

[68]  A. Munasinghe,et al.  SARS-CoV-2 and the pandemic of COVID-19 , 2020, Postgraduate Medical Journal.

[69]  J. Dye,et al.  Broad neutralization of SARS-related viruses by human monoclonal antibodies , 2020, Science.

[70]  Vineet D. Menachery,et al.  Rapid Generation of Neutralizing Antibody Responses in COVID-19 Patients , 2020, Cell Reports Medicine.

[71]  K. Dev,et al.  Challenges and prospects of COVID‐19 vaccine development based on the progress made in SARS and MERS vaccine development , 2020, Transboundary and emerging diseases.

[72]  J. Peiris,et al.  Cross-reactive Antibody Response between SARS-CoV-2 and SARS-CoV Infections , 2020, Cell Reports.

[73]  M. Battegay,et al.  Analysis of SARS-CoV-2 Antibodies in COVID-19 Convalescent Blood using a Coronavirus Antigen Microarray , 2020, Nature Communications.

[74]  J. Xiang,et al.  Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study , 2020, The Lancet.

[75]  M. Letko,et al.  Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses , 2020, Nature Microbiology.

[76]  W. Liu,et al.  Learning from the Past: Possible Urgent Prevention and Treatment Options for Severe Acute Respiratory Infections Caused by 2019‐nCoV , 2020, Chembiochem : a European journal of chemical biology.

[77]  Xiangbo Chen,et al.  Evaluation of Plasma Extracellular Vesicle MicroRNA Signatures for Lung Adenocarcinoma and Granuloma With Monte-Carlo Feature Selection Method , 2019, Front. Genet..

[78]  Lei Chen,et al.  Identification of gene expression signatures across different types of neural stem cells with the Monte‐Carlo feature selection method , 2018, Journal of cellular biochemistry.

[79]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[80]  Moon-Hyang Park,et al.  Clear Cell Carcinomas of the Ovary: A Multi-Institutional Study of 129 Cases in Korea With Prognostic Significance of Emi1 and Galectin-3 , 2013, International journal of gynecological pathology : official journal of the International Society of Gynecological Pathologists.

[81]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[82]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[83]  Jan Gorodkin,et al.  Comparing two K-category assignments by a K-category correlation coefficient , 2004, Comput. Biol. Chem..

[84]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[85]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[86]  C. Baird,et al.  The pilot study. , 2000, Orthopedic nursing.

[87]  Huan Liu,et al.  Incremental Feature Selection , 1998, Applied Intelligence.

[88]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[89]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[90]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[91]  OUP accepted manuscript , 2022, Age And Ageing.

[92]  Jan Komorowski,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm486 Data and text mining Monte Carlo , 2022 .

[93]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[94]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[95]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.