HLaffy: estimating peptide affinities for Class-1 HLA molecules by learning position-specific pair potentials

MOTIVATION T-cell epitopes serve as molecular keys to initiate adaptive immune responses. Identification of T-cell epitopes is also a key step in rational vaccine design. Most available methods are driven by informatics and are critically dependent on experimentally obtained training data. Analysis of a training set from Immune Epitope Database (IEDB) for several alleles indicates that the sampling of the peptide space is extremely sparse covering a tiny fraction of the possible nonamer space, and also heavily skewed, thus restricting the range of epitope prediction. RESULTS We present a new epitope prediction method that has four distinct computational modules: (i) structural modelling, estimating statistical pair-potentials and constraint derivation, (ii) implicit modelling and interaction profiling, (iii) feature representation and binding affinity prediction and (iv) use of graphical models to extract peptide sequence signatures to predict epitopes for HLA class I alleles. CONCLUSIONS HLaffy is a novel and efficient epitope prediction method that predicts epitopes for any Class-1 HLA allele, by estimating the binding strengths of peptide-HLA complexes which is achieved through learning pair-potentials important for peptide binding. It relies on the strength of the mechanistic understanding of peptide-HLA recognition and provides an estimate of the total ligand space for each allele. The performance of HLaffy is seen to be superior to the currently available methods. AVAILABILITY AND IMPLEMENTATION The method is made accessible through a webserver http://proline.biochem.iisc.ernet.in/HLaffy CONTACT : nchandra@biochem.iisc.ernet.in SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[2]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[3]  U. Hobohm,et al.  A pattern search method for putative anchor residues in T cell epitopes , 1993, European journal of immunology.

[4]  K. Parker,et al.  Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. , 1994, Journal of immunology.

[5]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[6]  J. Sidney,et al.  Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism , 1999, Immunogenetics.

[7]  O. Schueler‐Furman,et al.  Structure‐based prediction of binding peptides to MHC class I molecules: Application to a broad range of MHC alleles , 2000, Protein science : a publication of the Protein Society.

[8]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[9]  Gajendra P. S. Raghava,et al.  MHCBN: a comprehensive database of MHC binding and non-binding peptides , 2003, Bioinform..

[10]  E. Hewitt The MHC class I antigen presentation pathway: strategies for viral immune evasion , 2003, Immunology.

[11]  D. Flower,et al.  Identifiying Human MHC Supertypes Using Bioinformatic Methods , 2004, The Journal of Immunology.

[12]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[13]  Alessandro Sette,et al.  Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method , 2005, BMC Bioinformatics.

[14]  Debasisa Mohanty,et al.  MODPROPEP: a program for knowledge-based modeling of protein–peptide complexes , 2007, Nucleic Acids Res..

[15]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[16]  O. Lund,et al.  NetMHCpan, a method for MHC class I binding prediction beyond humans , 2008, Immunogenetics.

[17]  Morten Nielsen,et al.  Accurate approximation method for prediction of class I MHC affinities for peptides of length 8, 10 and 11 using prediction tools trained on 9mers , 2008, Bioinform..

[18]  Pedro A Reche,et al.  Prediction of MHC-peptide binding: a systematic and comprehensive overview. , 2009, Current pharmaceutical design.

[19]  Thomas Lengauer,et al.  Predicting MHC class I epitopes in large datasets , 2010, BMC Bioinformatics.

[20]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[21]  James Robinson,et al.  The IMGT/HLA database , 2008, Nucleic Acids Res..

[22]  Morten Nielsen,et al.  Major histocompatibility complex class I binding predictions as a tool in epitope discovery , 2010, Immunology.

[23]  Alessandro Sette,et al.  The Immune Epitope Database 2.0 , 2009, Nucleic Acids Res..

[24]  Chen Yanover,et al.  Large-scale characterization of peptide-MHC binding landscapes with structural simulations , 2011, Proceedings of the National Academy of Sciences.