Immunoinformatics and epitope prediction in the age of genomic medicine

Immunoinformatics involves the application of computational methods to immunological problems. Prediction of B- and T-cell epitopes has long been the focus of immunoinformatics, given the potential translational implications, and many tools have been developed. With the advent of next-generation sequencing (NGS) methods, an unprecedented wealth of information has become available that requires more-advanced immunoinformatics tools. Based on information from whole-genome sequencing, exome sequencing and RNA sequencing, it is possible to characterize with high accuracy an individual’s human leukocyte antigen (HLA) allotype (i.e., the individual set of HLA alleles of the patient), as well as changes arising in the HLA ligandome (the collection of peptides presented by the HLA) owing to genomic variation. This has allowed new opportunities for translational applications of epitope prediction, such as epitope-based design of prophylactic and therapeutic vaccines, and personalized cancer immunotherapies. Here, we review a wide range of immunoinformatics tools, with a focus on B- and T-cell epitope prediction. We also highlight fundamental differences in the underlying algorithms and discuss the various metrics employed to assess prediction quality, comparing their strengths and weaknesses. Finally, we discuss the new challenges and opportunities presented by high-throughput data-sets for the field of epitope prediction.

[1]  P. van Endert,et al.  Differential proteasomal processing of hydrophobic and hydrophilic protein regions: Contribution to cytotoxic T lymphocyte epitope clustering in HIV-1-Nef , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  S. Brunak,et al.  Predicting proteasomal cleavage sites: a comparison of available methods. , 2003, International immunology.

[3]  A Sette,et al.  Two complementary methods for predicting peptides binding major histocompatibility complex molecules. , 1997, Journal of molecular biology.

[4]  Vasant Honavar,et al.  Predicting flexible length linear B-cell epitopes. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[5]  Jérôme Lane,et al.  IMGT®, the international ImMunoGeneTics information system® , 2004, Nucleic Acids Res..

[6]  Oliver Kohlbacher,et al.  A Mathematical Framework for the Selection of an Optimal Set of Peptides for Epitope-Based Vaccines , 2008, PLoS Comput. Biol..

[7]  Morten Nielsen,et al.  Peptide‐MHC class I stability is a better predictor than peptide affinity of CTL immunogenicity , 2012, European journal of immunology.

[8]  Pingping Guan,et al.  EpiJen: a server for multistep T cell epitope prediction , 2006, BMC Bioinformatics.

[9]  K. Parker,et al.  Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. , 1994, Journal of immunology.

[10]  V. Brusic,et al.  Evaluation of MHC class I peptide binding prediction servers: Applications for vaccine research , 2008, BMC Immunology.

[11]  Morten Nielsen,et al.  The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding , 2009, Bioinform..

[12]  Nora C. Toussaint,et al.  Towards in silico design of epitope-based vaccines , 2009, Expert opinion on drug discovery.

[13]  张静,et al.  Banana Ovate family protein MaOFP1 and MADS-box protein MuMADS1 antagonistically regulated banana fruit ripening , 2015 .

[14]  Ji Wan,et al.  SVRMHC prediction server for MHC-binding peptides , 2006, BMC Bioinformatics.

[15]  Morten Nielsen,et al.  Automated benchmarking of peptide-MHC class I binding predictions , 2015, Bioinform..

[16]  Morten Nielsen,et al.  Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction , 2007, BMC Bioinformatics.

[17]  Kun Yu,et al.  Methods for Prediction of Peptide Binding to MHC Molecules: A Comparative Study , 2002, Molecular medicine.

[18]  P. van Endert,et al.  Substrate selection by transporters associated with antigen processing occurs during peptide binding to TAP. , 1998, Molecular immunology.

[19]  Bo Yao,et al.  EPSVR and EPMeta: prediction of antigenic epitopes using support vector regression and multiple server results , 2010, BMC Bioinformatics.

[20]  Can Keşmir,et al.  Role of peptide processing predictions in T cell epitope identification: contribution of different prediction programs , 2014, Immunogenetics.

[21]  Oliver Kohlbacher,et al.  Multiple Instance Learning Allows MHC Class II Epitope Predictions Across Alleles , 2008, WABI.

[22]  Pierre Baldi,et al.  COBEpro: a novel system for predicting continuous B-cell epitopes. , 2009, Protein engineering, design & selection : PEDS.

[23]  Alessandro Sette,et al.  Properties of MHC Class I Presented Peptides That Enhance Immunogenicity , 2013, PLoS Comput. Biol..

[24]  Channa K. Hattotuwagama,et al.  AntiJen: a quantitative immunology database integrating functional, thermodynamic, kinetic, biophysical, and cellular data , 2005, Immunome research.

[25]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[26]  Morten Nielsen,et al.  Modeling the adaptive immune system: predictions and simulations , 2007, Bioinform..

[27]  P. Kloetzel,et al.  A theoretical approach towards the identification of cleavage-determining amino acid motifs of the 20 S proteasome. , 1999, Journal of molecular biology.

[28]  Søren B. Padkjær,et al.  Structural analysis of B-cell epitopes in antibody:protein complexes. , 2013, Molecular immunology.

[29]  Oliver Kohlbacher,et al.  FRED—a framework for T-cell epitope detection , 2009, Bioinform..

[30]  Benjamin Schubert,et al.  OptiType: precision HLA typing from next-generation sequencing data , 2014, Bioinform..

[32]  Deborah Hix,et al.  The immune epitope database (IEDB) 3.0 , 2014, Nucleic Acids Res..

[33]  Jean-Philippe Vert,et al.  Efficient peptide-MHC-I binding prediction for alleles with few known binders , 2008, Bioinform..

[34]  James Robinson,et al.  The IPD and IMGT/HLA database: allele variant databases , 2014, Nucleic Acids Res..

[35]  Morten Nielsen,et al.  A Community Resource Benchmarking Predictions of Peptide Binding to MHC-I Molecules , 2006, PLoS Comput. Biol..

[36]  John Sidney,et al.  A Systematic Assessment of MHC Class II Peptide Binding Predictions and Evaluation of a Consensus Approach , 2008, PLoS Comput. Biol..

[37]  L Adorini,et al.  Structural requirements for the interaction between class II MHC molecules and peptide antigens , 1990, Immunologic research.

[38]  Xiaodi Huang,et al.  MHC2MIL: a novel multiple instance learning based method for MHC-II peptide binding prediction by considering peptide flanking region and residue positions , 2014, BMC Genomics.

[39]  Morten Nielsen,et al.  NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction , 2009, BMC Bioinformatics.

[40]  E. Reinherz,et al.  Prediction of MHC class I binding peptides using profile motifs. , 2002, Human immunology.

[41]  R. Lloyd,et al.  Structural Determinants for Specific Recognition by T4 Endonuclease V* , 1996, The Journal of Biological Chemistry.

[42]  K. Cibulskis,et al.  Comprehensive analysis of cancer-associated somatic mutations in class I HLA genes , 2015, Nature Biotechnology.

[43]  Hau-San Wong,et al.  TEPITOPEpan: Extending TEPITOPE for Peptide Binding Prediction Covering over 700 HLA-DR Molecules , 2012, PloS one.

[44]  Sneh Lata,et al.  MHCBN 4.0: A database of MHC/TAP binding peptides and T-cell epitopes , 2009, BMC Research Notes.

[45]  V Brusic,et al.  Relationship between peptide selectivities of human transporters associated with antigen processing and HLA class I molecules. , 1998, Journal of immunology.

[46]  Syed Haider,et al.  International Cancer Genome Consortium Data Portal—a one-stop shop for cancer genomics data , 2011, Database J. Biol. Databases Curation.

[47]  Vladimir Brusic,et al.  Dana-Farber repository for machine learning in immunology. , 2011, Journal of immunological methods.

[48]  M. Nielsen,et al.  NetMHCstab – predicting stability of peptide–MHC‐I complexes; impacts for cytotoxic T lymphocyte epitope discovery , 2014, Immunology.

[49]  Morten Nielsen,et al.  Pan-specific MHC class I predictors: a benchmark of HLA class I pan-specific prediction methods , 2009, Bioinform..

[50]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[51]  Morten Nielsen,et al.  Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method , 2007, BMC Bioinformatics.

[52]  Chee Keong Kwoh,et al.  PREDTAP: a system for prediction of peptide binding to the human transporter associated with antigen processing , 2006, Immunome research.

[53]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[54]  Christina Kuttler An Algorithm for the Prediction of Proteasomal Cleavages , 2000, German Conference on Bioinformatics.

[55]  H. Rammensee,et al.  The regulatory landscape for actively personalized cancer immunotherapies , 2013, Nature Biotechnology.

[56]  Gajendra P. S. Raghava,et al.  Pcleavage: an SVM based method for prediction of constitutive proteasome and immunoproteasome cleavage sites in antigenic sequences , 2005, Nucleic Acids Res..

[57]  T. Hanai,et al.  Hidden Markov model-based prediction of antigenic peptides that interact with MHC class II molecules. , 2002, Journal of bioscience and bioengineering.

[58]  M. Zody,et al.  ATHLATES: accurate typing of human leukocyte antigen through exome sequencing , 2013, Nucleic acids research.

[59]  Uthaman Gowthaman,et al.  In silico tools for predicting peptides binding to HLA-class II molecules: more confusion than conclusion. , 2008, Journal of proteome research.

[60]  Morten Nielsen,et al.  Prediction of epitopes using neural network based methods. , 2011, Journal of immunological methods.

[61]  Gajendra P. S. Raghava,et al.  ProPred: prediction of HLA-DR binding sites , 2001, Bioinform..

[62]  J. Castle,et al.  HLA typing from RNA-Seq sequence reads , 2012, Genome Medicine.

[63]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[64]  O. Lund,et al.  The role of the proteasome in generating cytotoxic T-cell epitopes: insights obtained from improved predictions of proteasomal cleavage , 2005, Immunogenetics.

[65]  Morten Nielsen,et al.  NetMHCcons: a consensus method for the major histocompatibility complex class I predictions , 2011, Immunogenetics.

[66]  Vladimir Brusic,et al.  MULTIPRED: a computational system for prediction of promiscuous HLA binding peptides , 2005, Nucleic Acids Res..

[67]  J. Hammer,et al.  Discovery of promiscuous HLA-II-restricted T cell epitopes with TEPITOPE. , 2004, Methods.

[68]  Yoram Louzoun,et al.  Virus-epitope vaccine design: informatic matching the HLA-I polymorphism to the virus genome. , 2007, Molecular immunology.

[69]  O. Lund,et al.  NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known Sequence , 2007, PloS one.

[70]  Ora Schueler-Furman,et al.  Learning MHC I - peptide binding , 2006, ISMB.

[71]  Morten Nielsen,et al.  Reliable B Cell Epitope Predictions: Impacts of Method Development and Improved Benchmarking , 2012, PLoS Comput. Biol..

[72]  Magdalini Moutaftsi,et al.  A consensus epitope prediction approach identifies the breadth of murine TCD8+-cell responses to vaccinia virus , 2006, Nature Biotechnology.

[73]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[74]  Morten Nielsen,et al.  NetCTLpan: pan-specific MHC class I pathway epitope predictions , 2010, Immunogenetics.

[75]  P. Dönnes,et al.  Integrated modeling of the major events in the MHC class I antigen processing pathway , 2005, Protein science : a publication of the Protein Society.

[76]  O. Lund,et al.  NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ , 2013, Immunogenetics.

[77]  Shinn-Ying Ho,et al.  POPISK: T-cell reactivity prediction using support vector machines and string kernels , 2011, BMC Bioinformatics.

[78]  Rocío Romero-Záliz,et al.  PGMRA: a web server for (phenotype × genotype) many-to-many relation analysis in GWAS , 2013, Nucleic Acids Res..

[79]  John Sidney,et al.  Examining the independent binding assumption for binding of peptide epitopes to MHC-I molecules , 2003, Bioinform..

[80]  Nora C. Toussaint,et al.  Universal peptide vaccines - optimal peptide vaccine design based on viral sequence conservation. , 2011, Vaccine.

[81]  H. Erlich,et al.  HLA DNA typing: past, present, and future. , 2012, Tissue antigens.

[82]  Morten Nielsen,et al.  NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8–11 , 2008, Nucleic Acids Res..

[83]  D. Flower,et al.  Benchmarking B cell epitope prediction: Underperformance of existing methods , 2005, Protein science : a publication of the Protein Society.

[84]  Benjamin Schubert,et al.  EpiToolKit—a web-based workbench for vaccine design , 2015, Bioinform..

[85]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[86]  Vladimir Brusic,et al.  A neural network model approach to the study of human TAP transporter , 1998, Silico Biol..

[87]  Shinn-Ying Ho,et al.  POPI: predicting immunogenicity of MHC class I binding peptides by mining informative physicochemical properties , 2007, Bioinform..

[88]  H. Rammensee,et al.  Allele-specific motifs revealed by sequencing of self-peptides eluted from MHC molecules , 1991, Nature.

[89]  Bo Yao,et al.  Conformational B-Cell Epitope Prediction on Antigen Protein Structures: A Review of Current Algorithms and Comparison with Common Binding Site Prediction Methods , 2013, PloS one.

[90]  Oliver Kohlbacher,et al.  T-cell epitope prediction based on self-tolerance , 2011, BCB '11.

[91]  Morten Nielsen,et al.  Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification , 2015, Immunogenetics.