Predicting the binding affinity of MHC class II peptides.

MHC (Major Histocompatibility Complex) proteins are categorized under the heterodimeric integral membrane proteins. The MHC molecules are divided into 2 subclasses, class I and class II. Two classes differ from each other in size of their binding pockets. Predicting the affinity of these peptides is important for vaccine design. It is also vital for understanding the roles of immune system in various diseases. Due to the variability of the locations of the class II peptide binding cores, predicting the affinity of these peptides is difficult. In this paper, we proposed a new method for predicting the affinity of the MHC Class II binding peptides based on their sequences. Our method classifies peptides as binding and non-binding. Our prediction method is based on a 3-step algorithm. In the first step we identify the informative n-grams based on their frequencies for both classes. In the next step, the alphabet size is reduced. At the last step, by utilizing the informative n-grams, the class of a given sequence is predicted. We have tested our method on the MHC Bench IV-b data set [13], and compared with various other methods in the literature.

[1]  Gajendra P. S. Raghava,et al.  SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence , 2004, Bioinform..

[2]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[3]  Jaime G. Carbonell,et al.  Comparative n-gram analysis of whole-genome protein sequences , 2002 .

[4]  Maria V. Tejada-Simon,et al.  Naturally Processed HLA Class II Peptides Reveal Highly Conserved Immunogenic Flanking Region Sequence Preferences That Reflect Antigen Processing Rather Than Peptide-MHC Interactions1 , 2001, The Journal of Immunology.

[5]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[6]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[7]  D. Jewell,et al.  Use of eluted peptide sequence data to identify the binding characteristics of peptides to the insulin-dependent diabetes susceptibility allele HLA-DQ8 (DQ 3.2). , 1997, International immunology.

[8]  R. Germain,et al.  Antigen presentation by MHC class II molecules: invariant chain function, protein trafficking, and the molecular basis of diverse determinant capture. , 1997, Human immunology.

[9]  Francesco Sinigaglia,et al.  Precise prediction of major histocompatibility complex class II-peptide interaction based on peptide side chain scanning , 1994, The Journal of experimental medicine.

[10]  D. Zaller,et al.  Prediction of peptide affinity to HLA DRB1*0401. , 1994, International archives of allergy and immunology.

[11]  H. Kalbacher,et al.  Characterization of peptides bound to extracellular and intracellular HLA-DR1 molecules. , 1993, Human immunology.

[12]  Hans-Georg Rammensee,et al.  MHC ligands and peptide motifs: first listing , 2004, Immunogenetics.

[13]  Judith Klein-Seetharaman,et al.  A Sequence Alignment-Independent Method for Protein Classification , 2004, Applied bioinformatics.

[14]  O. Lund,et al.  Reliable prediction of T‐cell epitopes using neural networks with novel sequence representations , 2003, Protein science : a publication of the Protein Society.