NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known Sequence

Background Binding of peptides to Major Histocompatibility Complex (MHC) molecules is the single most selective step in the recognition of pathogens by the cellular immune system. The human MHC class I system (HLA-I) is extremely polymorphic. The number of registered HLA-I molecules has now surpassed 1500. Characterizing the specificity of each separately would be a major undertaking. Principal Findings Here, we have drawn on a large database of known peptide-HLA-I interactions to develop a bioinformatics method, which takes both peptide and HLA sequence information into account, and generates quantitative predictions of the affinity of any peptide-HLA-I interaction. Prospective experimental validation of peptides predicted to bind to previously untested HLA-I molecules, cross-validation, and retrospective prediction of known HIV immune epitopes and endogenous presented peptides, all successfully validate this method. We further demonstrate that the method can be applied to perform a clustering analysis of MHC specificities and suggest using this clustering to select particularly informative novel MHC molecules for future biochemical and functional analysis. Conclusions Encompassing all HLA molecules, this high-throughput computational method lends itself to epitope searches that are not only genome- and pathogen-wide, but also HLA-wide. Thus, it offers a truly global analysis of immune responses supporting rational development of vaccines and immunotherapy. It also promises to provide new basic insights into HLA structure-function relationships. The method is available at http://www.cbs.dtu.dk/services/NetMHCpan.

[1]  Fred R. McMorris,et al.  Consensusn-trees , 1981 .

[2]  Alessandro Sette,et al.  Structural characteristics of an antigen required for its interaction with Ia and recognition by T cells , 1987, Nature.

[3]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[4]  A Sette,et al.  The relation between major histocompatibility complex (MHC) restriction and the capacity of Ia to bind immunogenic peptides , 1987, Science.

[5]  H. Grey,et al.  Prediction of major histocompatibility complex binding regions of protein antigens by sequence pattern analysis. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[6]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[7]  Hans-Georg Rammensee,et al.  Cellular peptide composition governed by major histocompatibility complex class I molecules , 1990, Nature.

[8]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[9]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[10]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[11]  K. Parker,et al.  Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. , 1994, Journal of immunology.

[12]  R. J. Stonier,et al.  Complex Systems: Mechanism of Adaptation , 1994 .

[13]  J A Koziol,et al.  Prediction of binding to MHC class I molecules. , 1995, Journal of immunological methods.

[14]  Hans-Georg Rammensee,et al.  MHC Ligands and Peptide Motifs , 1998, Molecular Biology Intelligence Unit.

[15]  A Sette,et al.  Two complementary methods for predicting peptides binding major histocompatibility complex molecules. , 1997, Journal of molecular biology.

[16]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[17]  H Mamitsuka,et al.  Predicting peptides that bind to MHC molecules using supervised learning of hidden markov models , 1998, Proteins.

[18]  J. Yewdell,et al.  Immunodominance in major histocompatibility complex class I-restricted T lymphocyte responses. , 1999, Annual review of immunology.

[19]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[20]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[21]  J. Sidney,et al.  Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism , 1999, Immunogenetics.

[22]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[23]  S Brunak,et al.  Identifying cytotoxic T cell epitopes from genomic and proteomic information: "The human MHC project.". , 2000, Reviews in immunogenetics.

[24]  M. Segal,et al.  Relating Amino Acid Sequence to Phenotype: Analysis of Peptide‐Binding Data , 2000, Biometrics.

[25]  B. Walker,et al.  HIV Molecular Immunology 2001 , 2001 .

[26]  Thomas G. Dietterich,et al.  Bioinformatics The Machine Learning Approach 2nd ed. , 2001 .

[27]  D. Flower,et al.  Toward the quantitative prediction of T-cell epitopes: coMFA and coMSIA studies of peptides with affinity for the class I MHC molecule HLA-A*0201. , 2001, Journal of medicinal chemistry.

[28]  David Bryant,et al.  A classification of consensus methods for phylogenetics , 2001, Bioconsensus.

[29]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[30]  S L Lauemøller,et al.  Establishment of a quantitative ELISA capable of determining peptide - MHC class I interaction. , 2002, Tissue antigens.

[31]  Vladimir Brusic,et al.  Prediction of promiscuous peptides that bind HLA class I molecules , 2002, Immunology and cell biology.

[32]  E. Reinherz,et al.  Prediction of MHC class I binding peptides using profile motifs. , 2002, Human immunology.

[33]  A. Sette,et al.  Epitope-based vaccines: an update on epitope identification, vaccine design and delivery. , 2003, Current opinion in immunology.

[34]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[35]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[36]  O. Lund,et al.  novel sequence representations Reliable prediction of T-cell epitopes using neural networks with , 2003 .

[37]  Vladimir Brusic,et al.  Prediction of class I T-cell epitopes: evidence of presence of immunological hot spots inside antigens , 2004, ISMB/ECCB.

[38]  Simon Parsons,et al.  Bioinformatics: The Machine Learning Approach by P. Baldi and S. Brunak, 2nd edn, MIT Press, 452 pp., $60.00, ISBN 0-262-02506-X , 2004, The Knowledge Engineering Review.

[39]  O. Lund,et al.  Definition of supertypes for HLA molecules using clustering of specificity matrices , 2004, Immunogenetics.

[40]  D. Flower,et al.  Identifiying Human MHC Supertypes Using Bioinformatic Methods , 2004, The Journal of Immunology.

[41]  Søren Brunak,et al.  Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach , 2004, Bioinform..

[42]  P. Kloetzel,et al.  Modeling the MHC class I pathway by combining predictions of proteasomal cleavage,TAP transport and MHC class I binding , 2005, Cellular and Molecular Life Sciences CMLS.

[43]  M. Lefranc IMGT, the international ImMunoGeneTics information system®: a standardized approach for immunogenetics and immunoinformatics , 2005, Immunome research.

[44]  Bjoern Peters,et al.  A roadmap for the immunomics of category A-C pathogens. , 2005, Immunity.

[45]  John Sidney,et al.  Improving MHC binding peptide prediction by incorporating binding data of auxiliary MHC molecules , 2006, Bioinform..

[46]  Bjoern Peters,et al.  Detailed characterization of the peptide binding specificity of five common Patr class I MHC molecules , 2006, Immunogenetics.

[47]  O. Michielin,et al.  Structural prediction of peptides bound to MHC class I. , 2006, Journal of molecular biology.

[48]  Morten Nielsen,et al.  A Community Resource Benchmarking Predictions of Peptide Binding to MHC-I Molecules , 2006, PLoS Comput. Biol..

[49]  R. Abagyan,et al.  Ab initio prediction of peptide‐MHC binding geometry for diverse class I MHC allotypes , 2006, Proteins.

[50]  Thomas Lengauer,et al.  DynaPred: A structure and sequence based method for the prediction of MHC class I binding peptide sequences and conformations , 2006, ISMB.

[51]  Bjoern Peters,et al.  Immune epitope mapping in the post-genomic era: lessons for vaccine development. , 2007, Current opinion in immunology.

[52]  M. Lefranc IMGT, the international ImMunoGeneTics information system for Immunoinformatics. Methods for querying IMGT databases, tools, and Web resources in the context of immunoinformatics. , 2008, Methods in molecular biology.