Computationally Optimized SARS-CoV-2 MHC Class I and II Vaccine Formulations Predicted to Target Human Haplotype Distributions

Summary We present a combinatorial machine learning method to evaluate and optimize peptide vaccine formulations for SARS-CoV-2. Our approach optimizes the presentation likelihood of a diverse set of vaccine peptides conditioned on a target human-population HLA haplotype distribution and expected epitope drift. Our proposed SARS-CoV-2 MHC class I vaccine formulations provide 93.21% predicted population coverage with at least five vaccine peptide-HLA average hits per person (≥ 1 peptide: 99.91%) with all vaccine peptides perfectly conserved across 4,690 geographically sampled SARS-CoV-2 genomes. Our proposed MHC class II vaccine formulations provide 97.21% predicted coverage with at least five vaccine peptide-HLA average hits per person with all peptides having an observed mutation probability of ≤ 0.001. We provide an open-source implementation of our design methods (OptiVax), vaccine evaluation tool (EvalVax), as well as the data used in our design efforts here: https://github.com/gifford-lab/optivax.

[1]  Dewi Harjanto,et al.  Sequence-based prediction of vaccine targets for inducing T cell responses to SARS-CoV-2 utilizing the bioinformatics predictor RECON , 2020, bioRxiv.

[2]  J. Greenbaum,et al.  Improved methods for predicting peptide binding affinity to MHC class II molecules , 2018, Immunology.

[3]  A. Walls,et al.  Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein , 2020, Cell.

[4]  A. Alam,et al.  Design of an Epitope-Based Peptide Vaccine against the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2): A Vaccine-informatics Approach , 2020, bioRxiv.

[5]  John Sidney,et al.  Predicting population coverage of T-cell epitope-based diagnostics and vaccines , 2006, BMC Bioinformatics.

[6]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[7]  Xingyi Ge,et al.  A Unique Protease Cleavage Site Predicted in the Spike Protein of the Novel Pneumonia Coronavirus (2019-nCoV) Potentially Related to Viral Transmissibility , 2020, Virologica Sinica.

[8]  Chloe H. Lee,et al.  In silico identification of vaccine targets for 2019-nCoV , 2020, F1000Research.

[9]  Morten Nielsen,et al.  NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data , 2020, Nucleic Acids Res..

[10]  B. Canard,et al.  The spike glycoprotein of the new coronavirus 2019-nCoV contains a furin-like cleavage site absent in CoV of the same clade , 2020, Antiviral Research.

[11]  Morten Nielsen,et al.  Improved prediction of MHC II antigen presentation through integration and motif deconvolution of mass spectrometry MHC eluted ligand data. , 2020, Journal of proteome research.

[12]  J. Greenbaum,et al.  Targets of T Cell Responses to SARS-CoV-2 Coronavirus in Humans with COVID-19 Disease and Unexposed Individuals , 2020, Cell.

[13]  J Sidney,et al.  An effective CTL peptide vaccine for Ebola Zaire Based on Survivors’ CD8+ targeting of a particular nucleocapsid protein epitope with potential implications for COVID-19 vaccine design , 2020, bioRxiv.

[14]  Lingling Chen,et al.  Designing of a next generation multiepitope based vaccine (MEV) against SARS-COV-2: Immunoinformatics and in silico approaches , 2020, bioRxiv.

[15]  Sajjad Ahmad,et al.  Immuno-informatics Characterization SARS-CoV-2 Spike Glycoprotein for Prioritization of Epitope based Multivalent Peptide Vaccine , 2020, bioRxiv.

[16]  A. Banerjee,et al.  Energetics based epitope screening in SARS CoV-2 (COVID 19) spike glycoprotein by Immuno-informatic analysis aiming to a suitable vaccine development , 2020, bioRxiv.

[17]  B. Graham,et al.  Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation , 2020, Science.

[18]  Uri Laserson,et al.  A model of antigen processing improves prediction of MHC I-presented peptides , 2020 .

[19]  Mujahed I. Mustafa,et al.  Design of multi epitope-based peptide vaccine against E protein of human COVID-19: An immunoinformatics approach , 2020, bioRxiv.

[20]  Nikolaos G. Sgourakis,et al.  Structure-based modeling of SARS-CoV-2 peptide/HLA-A02 antigens , 2020, bioRxiv.

[21]  James McCluskey,et al.  HLA Peptide Length Preferences Control CD8+ T Cell Responses , 2013, The Journal of Immunology.

[22]  V. Arumugaswami,et al.  Insights into Cross-species Evolution of Novel Human Coronavirus SARS-CoV-2 and Defining Immune Determinants for Vaccine Development , 2020, bioRxiv.

[23]  Tim O’Donnell Improved predictive models of peptide presentation on MHC I , 2020 .

[24]  O. Lund,et al.  novel sequence representations Reliable prediction of T-cell epitopes using neural networks with , 2003 .

[25]  Leonard Moise,et al.  iVAX: An integrated toolkit for the selection and optimization of antigens and the design of epitope-driven vaccines , 2015, Human vaccines & immunotherapeutics.

[26]  R. Mishra,et al.  Identification of potential vaccine candidates against SARS-CoV-2, A step forward to fight COVID-19: A Reverse Vaccinology Approach , 2020, bioRxiv.

[27]  M. Wolfert,et al.  Adaptive immune activation: glycosylation does matter. , 2013, Nature chemical biology.

[28]  S. Bose,et al.  Immunoinformatics‐aided identification of T cell and B cell epitopes in the surface glycoprotein of 2019‐nCoV , 2020, Journal of medical virology.

[29]  Morten Nielsen,et al.  T Cell Epitope Predictions. , 2020, Annual review of immunology.

[30]  Ashutosh Kumar Singh,et al.  Designing a multi-epitope peptide based vaccine against SARS-CoV-2 , 2020, Scientific Reports.

[31]  Alex Rubinsteyn,et al.  MHCflurry: Open-Source Class I MHC Binding Affinity Prediction. , 2018, Cell systems.

[32]  M. Hasan,et al.  Genome based Evolutionary study of SARS-CoV-2 towards the Prediction of Epitope Based Chimeric Vaccine , 2020, bioRxiv.

[33]  B. Prasad,et al.  In silico approach for designing of a multi-epitope based vaccine against novel Coronavirus (SARS-COV-2) , 2020, bioRxiv.

[34]  Russ B. Altman,et al.  Potential T-cell and B-cell Epitopes of 2019-nCoV , 2020, bioRxiv.

[35]  Yan Fu,et al.  Cryo-EM Studies of Virus-Antibody Immune Complexes , 2020, Virologica Sinica.

[36]  J. Maris,et al.  Identification of SARS-CoV-2 Vaccine Epitopes Predicted to Induce Long-Term Population-Scale Immunity , 2020, Cell Reports Medicine.

[37]  M. Nielsen,et al.  NetMHCpan-4.0: Improved Peptide–MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data , 2017, The Journal of Immunology.

[38]  M. Kolbe,et al.  Structural basis to design multi-epitope vaccines against Novel Coronavirus 19 (COVID19) infection, the ongoing pandemic emergency: an in silico approach , 2020, bioRxiv.

[39]  M. Nielsen,et al.  The Length Distribution of Class I–Restricted T Cell Epitopes Is Determined by Both Peptide Supply and MHC Allele–Specific Binding Preference , 2016, The Journal of Immunology.

[40]  C. Chakraborty,et al.  Development of epitope‐based peptide vaccine against novel coronavirus 2019 (SARS‐COV‐2): Immunoinformatics approach , 2020, Journal of medical virology.

[41]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[42]  R. Valadan,et al.  Design an Efficient Multi-Epitope Peptide Vaccine Candidate Against SARS-CoV-2: An in silico Analysis , 2020, bioRxiv.

[43]  Ole Winther,et al.  COVID-19 vaccine candidates: Prediction and validation of 174 novel SARS-CoV-2 epitopes , 2020 .

[44]  A. Vitiello,et al.  The relationship between class I binding affinity and immunogenicity of potential cytotoxic T cell epitopes. , 1994, Journal of immunology.

[45]  R. Hilgenfeld,et al.  Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved α-ketoamide inhibitors , 2020, Science.

[46]  R. Scheuermann,et al.  A Sequence Homology and Bioinformatic Approach Can Predict Candidate Targets for Immune Responses to SARS-CoV-2 , 2020, Cell Host & Microbe.

[47]  Haoyang Zeng,et al.  Quantification of Uncertainty in Peptide-MHC Binding Prediction Improves High-Affinity Peptide Selection for Therapeutic Design. , 2019, Cell systems.

[48]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[49]  L. Excoffier,et al.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. , 1995, Molecular biology and evolution.

[50]  Stefan Elbe,et al.  Data, disease and diplomacy: GISAID's innovative contribution to global health , 2017, Global challenges.

[51]  Jingqiu Cheng,et al.  Site-specific N-glycosylation Characterization of Recombinant SARS-CoV-2 Spike Proteins , 2020, bioRxiv.

[52]  Y. Vashi,et al.  Understanding the B and T cells epitopes of spike protein of severe respiratory syndrome coronavirus-2: A computational way to predict the immunogens , 2020, bioRxiv.

[53]  William S. Lane,et al.  Predominant naturally processed peptides bound to HLA-DR1 are derived from MHC-related molecules and are heterogeneous in size , 1992, Nature.

[54]  M. Schwartz,et al.  The coding capacity of SARS-CoV-2 , 2020, Nature.

[55]  Ole Winther,et al.  COVID-19 Vaccine Candidates: Prediction and Validation of 174 SARS-CoV-2 Epitopes , 2020, bioRxiv.

[56]  Syed Faraz Ahmed,et al.  Preliminary Identification of Potential Vaccine Targets for the COVID-19 Coronavirus (SARS-CoV-2) Based on SARS-CoV Immunological Studies , 2020, Viruses.

[57]  Wolfgang Helmberg,et al.  The sequencing-based typing tool of dbMHC: typing highly polymorphic gene sequences , 2004, Nucleic Acids Res..

[58]  An effective CTL peptide vaccine for Ebola Zaire Based on Survivors’ CD8+ targeting of a particular nucleocapsid protein epitope with potential implications for COVID-19 vaccine design , 2020, Vaccine.

[59]  Bjoern Peters,et al.  Most viral peptides displayed by class I MHC on infected cells are immunogenic , 2019, Proceedings of the National Academy of Sciences.

[60]  Trevor Bedford,et al.  Nextstrain: real-time tracking of pathogen evolution , 2017, bioRxiv.