Shift-Invariant Adaptive Double Threading: Learning MHC II - Peptide Binding

The major histocompatibility complex (MHC) plays important roles in the workings of the human immune system. Specificity of MHC binding to peptide fragments from cellular and pathogens' proteins has been found to correlate with disease outcome and pathogen or cancer evolution. In this paper we propose a novel approach to predicting binding configurations and energies for MHC class II molecules, whose epitopes are generally predicted less well than the MHC I epitopes due in part to larger variation in bound peptide length. We treat the relative position of the peptide as a hidden variable, and model the ensemble of different binding configurations, rather than use a separate alignment procedure to narrow it down to one. Thus, our predictor infers a distribution over peptide positions from the MHC II and peptide sequences, and computes the total binding affinity. The training procedure iterates the predictions with re-estimation of the parameters of the binding groove model. For a given relative peptide position, any MHC class I prediction model can be used. Here we choose the physics based model of Jojic et al. (2006). We show that the parameters of the binding model can be learned efficiently from the training data and then used to estimate binding energies for previously untested peptides. Our technique performs on par with previous approaches to MHC II epitope prediction. Furthermore, our model choice allows generalization to new MHC class II alleles, which were not a part of the training set.

[1]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[2]  Ellis L. Reinherz,et al.  Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles , 2004, Immunogenetics.

[3]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[4]  Yang Dai,et al.  Prediction of MHC class II binding peptides based on an iterative learning model , 2005, Immunome research.

[5]  Philip E. Bourne,et al.  Curation of complex, context-dependent immunological data , 2006, BMC Bioinformatics.

[6]  Celia A Schiffer,et al.  Lack of synergy for inhibitors targeting a multi‐drug‐resistant HIV‐1 protease , 2002, Protein science : a publication of the Protein Society.

[7]  Ora Schueler-Furman,et al.  Learning MHC I - peptide binding , 2006, ISMB.

[8]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[9]  Claude Beazley,et al.  A Novel Predictive Technique for the MHC Class II Peptide-Binding Interaction , 2003, Molecular medicine.

[10]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[11]  Darren R. Flower,et al.  Predicting Class II MHC-Peptide binding: a kernel based approach using similarity scores , 2006, BMC Bioinformatics.

[12]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..

[13]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[14]  M F del Guercio,et al.  Several common HLA-DR types share largely overlapping peptide binding repertoires. , 1998, Journal of immunology.

[15]  Gajendra P. S. Raghava,et al.  MHCBN: a comprehensive database of MHC binding and non-binding peptides , 2003, Bioinform..

[16]  Brendan J. Frey,et al.  Topographic Transformation as a Discrete Latent Variable , 1999, NIPS.

[17]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[18]  Qing Zhang,et al.  The Molecular Biology Toolkit (MBT): a modular platform for developing molecular visualization applications , 2005, BMC Bioinformatics.

[19]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..

[20]  Morten Nielsen,et al.  Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method , 2007, BMC Bioinformatics.

[21]  O. Schueler‐Furman,et al.  Structure‐based prediction of binding peptides to MHC class I molecules: Application to a broad range of MHC alleles , 2000, Protein science : a publication of the Protein Society.

[22]  Søren Brunak,et al.  Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach , 2004, Bioinform..

[23]  Tomer Hertz,et al.  PepDist: A New Framework for Protein-Peptide Binding Prediction based on Learning Peptide Distance Functions , 2006, BMC Bioinformatics.

[24]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[25]  Jan Engberg,et al.  Visualization of Myelin Basic Protein (Mbp) T Cell Epitopes in Multiple Sclerosis Lesions Using a Monoclonal Antibody Specific for the Human Histocompatibility Leukocyte Antigen (Hla)-Dr2–Mbp 85–99 Complex , 2000, The Journal of experimental medicine.

[26]  Derin B Keskin,et al.  Peptide 15-mers of defined sequence that substitute for random amino acid copolymers in amelioration of experimental autoimmune encephalomyelitis. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  J. Drijfhout,et al.  HLA-DR binding analysis of peptides from islet antigens in IDDM. , 1998, Diabetes.

[28]  Jianming Shi,et al.  Prediction of MHC class II binders using the ant colony search strategy , 2005, Artif. Intell. Medicine.

[29]  A. Sali,et al.  Statistical potentials for fold assessment , 2009 .

[30]  T. Elliott,et al.  Poor correspondence between predicted and experimental binding of peptides to class I MHC molecules. , 2000, Tissue antigens.