A Novel Kernel-Based Approach for Predicting Binding Peptides for HLA Class II Molecules

Peptides that bind to Human Leukocyte Antigens (HLA) can be presented to T-cell receptor and trigger immune response. Identification of specific binding peptides is critical for immunology research and vaccine design. However, accurate prediction of peptides binding to HLA molecules is challenging. A variety of methods such as HMM and ANN have been applied to predict peptides that can bind to HLA class I molecules and therefore the number of candidate binders for experimental assay can be largely reduced. However, it is a more complex process to predict peptides that bind to HLA class II molecules. In this paper, we proposed a kernel-based method, integrating the BLOSUM matrix with string kernel to form a new kernel. The substitution score between amino acids in BLOSUM matrix is incorporated into computing the similarity between two binding peptides, which exhibits more biological meaning over traditional string kernels. The promising results of this approach show advantages than other methods.

[1]  William Arbuthnot Sir Lane,et al.  Specificity and promiscuity among naturally processed peptides bound to HLA-DR alleles , 1993, The Journal of experimental medicine.

[2]  Yang Dai,et al.  Prediction of MHC class II binding peptides based on an iterative learning model , 2005, Immunome research.

[3]  Yingdong Zhao,et al.  Application of support vector machines for T-cell epitopes prediction , 2003, Bioinform..

[4]  H Mamitsuka,et al.  Predicting peptides that bind to MHC molecules using supervised learning of hidden markov models , 1998, Proteins.

[5]  Gajendra P. S. Raghava,et al.  ProPred: prediction of HLA-DR binding sites , 2001, Bioinform..

[6]  J. Berzofsky,et al.  Two novel T cell epitope prediction algorithms based on MHC-binding motifs; comparison of predicted and published epitopes from Mycobacterium tuberculosis and HIV protein sequences. , 1995, Vaccine.

[7]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[9]  Gajendra P. S. Raghava,et al.  SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence , 2004, Bioinform..

[10]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[11]  Tatsuya Akutsu,et al.  Protein homology detection using string alignment kernels , 2004, Bioinform..

[12]  Xing-Ming Zhao,et al.  A novel approach to extracting features from motif content and protein composition for protein sequence classification , 2005, Neural Networks.

[13]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[14]  H. Kalbacher,et al.  Characterization of peptides bound to extracellular and intracellular HLA-DR1 molecules. , 1993, Human immunology.

[16]  J. Mercer Functions of Positive and Negative Type, and their Connection with the Theory of Integral Equations , 1909 .

[17]  B Honig,et al.  On the calculation of binding free energies using continuum methods: Application to MHC class I protein‐peptide interactions , 1997, Protein science : a publication of the Protein Society.

[18]  Jianming Shi,et al.  Prediction of MHC class II binders using the ant colony search strategy , 2005, Artif. Intell. Medicine.

[19]  Vladimir Brusic,et al.  Predictive Vaccinology: Optimisation of Predictions Using Support Vector Machine Classifiers , 2005, IDEAL.

[20]  Gajendra P.S. Raghava,et al.  Prediction of CTL epitopes using QM, SVM and ANN techniques. , 2004, Vaccine.

[21]  David A Winkler,et al.  Predictive Bayesian neural network models of MHC class II peptide binding. , 2005, Journal of molecular graphics & modelling.

[22]  Irini A. Doytchinova,et al.  Towards the in silico identification of class II restricted T-cell epitopes: a partial least squares iterative self-consistent algorithm for affinity prediction , 2003, Bioinform..

[23]  H. Rammensee,et al.  Allele-specific motifs revealed by sequencing of self-peptides eluted from MHC molecules , 1991, Nature.

[24]  R. R. Mallios,et al.  Class II MHC quantitative binding motifs derived from a large molecular database with a versatile iterative stepwise discriminant analysis meta- algorithm , 1999, Bioinform..

[25]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[26]  E. Huarte,et al.  Specific and general HLA-DR binding motifs: comparison of algorithms. , 2000, Human immunology.

[27]  R Elber,et al.  Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes. , 1998, Folding & design.

[28]  Vladimir Brusic,et al.  Data cleansing for computer models: a case study from immunology , 1999, ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378).

[29]  J. Hammer,et al.  New methods to predict MHC-binding sequences within protein antigens. , 1995, Current opinion in immunology.

[30]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[31]  A Sette,et al.  Naturally processed peptides longer than nine amino acid residues bind to the class I MHC molecule HLA-A2.1 with high affinity and in different conformations. , 1994, Journal of immunology.

[32]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..

[33]  Hans-Georg Rammensee,et al.  MHC ligands and peptide motifs: first listing , 2004, Immunogenetics.

[34]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[35]  Ji Wan,et al.  SVRMHC prediction server for MHC-binding peptides , 2006, BMC Bioinformatics.

[36]  Gajendra P. S. Raghava,et al.  MHCBN: a comprehensive database of MHC binding and non-binding peptides , 2003, Bioinform..

[37]  Frederic Maire,et al.  Intelligent Data Engineering and Automated Learning - IDEAL 2005, 6th International Conference, Brisbane, Australia, July 6-8, 2005, Proceedings , 2005, IDEAL.

[38]  Søren Brunak,et al.  Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach , 2004, Bioinform..

[39]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[40]  D. Zaller,et al.  Prediction of peptide affinity to HLA DRB1*0401. , 1994, International archives of allergy and immunology.

[41]  Z. Nagy,et al.  Precise prediction of major histocompatibility complex class II-peptide interaction based on peptide side chain scanning , 1994, The Journal of experimental medicine.