Computational Models for Identifying Promiscuous HLA-B7 Binders based on Information Theory and Support Vector Machine

Computational vaccinology is a developing discipline. To become a standard component in vaccine development, it requires accurate and broadly applicable models of wet-lab experiments. We developed prediction models based on a novel data representation of peptide/MHC interaction and support vector machines (SVM) for prediction of peptides that promiscuously bind to multiple human leukocyte antigen (HLA) alleles belonging to HLA-B7 supertype. 10-fold cross-validation results showed that the area under the receiver operating curve (Aroc) of SVM models is above 0.90. Blind testing results showed that the average Aroc of SVM models is 0.84. A learning approach based on information theory, termed Information Learning Approach, was used for feature selection. Several amino acid positions with high information content have been identified in input 9mer peptides and HLA alleles and were used as input features to SVM. They are position 1, 2, 4, 5, 7, 8, 9 in 9mer peptides and position 45 and 97 in HLA-B7 molecules. Prediction accuracy was improved after feature selection. These positions cover the anchor positions of HLA-B7 alleles, which have important biological roles for successful biding of relevant peptides.

[1]  Kiyoshi Miwa,et al.  Residue 116 determines the C-terminal anchor residue of HLA-B*3501 and -B*5101 binding peptides but does not explain the general affinity difference , 1998, Immunogenetics.

[2]  A Sette,et al.  The development of multi-epitope vaccines: epitope identification, vaccine design and clinical evaluation. , 2001, Biologicals : journal of the International Association of Biological Standardization.

[3]  J. Sidney,et al.  HLA supertypes and supermotifs: a functional perspective on HLA polymorphism. , 1998, Current opinion in immunology.

[4]  Chee Keong Kwoh,et al.  Dynamic algorithm for inferring qualitative models of Gene Regulatory Networks , 2006, Int. J. Data Min. Bioinform..

[5]  M F del Guercio,et al.  Definition of an HLA-A3-like supermotif demonstrates the overlapping peptide-binding repertoires of common HLA molecules. , 1996, Human immunology.

[6]  T. Williams Human leukocyte antigen gene polymorphism and the histocompatibility laboratory. , 2001, The Journal of molecular diagnostics : JMD.

[7]  Vladimir Brusic,et al.  Computational binding assays of antigenic peptides , 2006, Letters in Peptide Science.

[8]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[9]  J Alexander,et al.  Optimizing vaccine design for cellular processing, MHC binding and TCR recognition. , 2002, Tissue antigens.

[10]  A Sette,et al.  Practical, biochemical and evolutionary implications of the discovery of HLA class I supermotifs. , 1996, Immunology today.

[11]  Vladimir Brusic,et al.  Predictive Vaccinology: Optimisation of Predictions Using Support Vector Machine Classifiers , 2005, IDEAL.

[12]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[13]  Hong Zhang,et al.  EPIMHC: a curated database of MHC-binding peptides for customized computational vaccinology , 2005, Bioinform..

[14]  J. Sidney,et al.  Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism , 1999, Immunogenetics.

[15]  Takamasa Ueno,et al.  Single T Cell Receptor-Mediated Recognition of an Identical HIV-Derived Peptide Presented by Multiple HLA Class I Molecules1 , 2002, The Journal of Immunology.

[16]  Irini A. Doytchinova,et al.  JenPep: a database of quantitative functional peptide data for immunology , 2002, Bioinform..

[17]  Bing Zhao,et al.  A novel MHCp binding prediction model. , 2003, Human immunology.

[18]  Vladimir Brusic,et al.  Prediction of promiscuous peptides that bind HLA class I molecules , 2002, Immunology and cell biology.

[19]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[20]  Yun. Zheng Information learning approach , 2007 .

[21]  L. Tussey,et al.  Mapping and binding analysis of peptides derived from the tumor-associated antigen survivin for eight HLA alleles. , 2005, Cancer immunity.

[22]  G. Chelvanayagam A roadmap for HLA-A, HLA-B, and HLA-C peptide binding specificities , 1996, Immunogenetics.

[23]  Vladimir Brusic,et al.  Neural Models for Predicting Viral Vaccine Targets , 2005, J. Bioinform. Comput. Biol..

[24]  S Brunak,et al.  Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach. , 2003, Tissue antigens.

[25]  Vladimir Brusic,et al.  MULTIPRED: a computational system for prediction of promiscuous HLA binding peptides , 2005, Nucleic Acids Res..

[26]  Kwoh Chee Keong,et al.  Dynamic algorithm for inferring qualitative models of gene regulatory networks , 2004 .

[27]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[28]  C. Schönbach,et al.  Fine tuning of peptide binding to HLA-B*3501 molecules by nonanchor residues. , 1995, Journal of immunology.

[29]  S. Brunak,et al.  SARS CTL vaccine candidates; HLA supertype‐, genome‐wide scanning and biochemical validation , 2004, Tissue antigens.

[30]  Francine Jotereau,et al.  Identification of Five New HLA-B*3501-Restricted Epitopes Derived from Common Melanoma-Associated Antigens, Spontaneously Recognized by Tumor-Infiltrating Lymphocytes 1 , 2003, The Journal of Immunology.

[31]  Lucy Dorrell,et al.  Cytotoxic T lymphocytes recognize structurally diverse, clade‐specific and cross‐reactive peptides in human immunodeficiency virus type‐1 gag through HLA‐B53 , 2001, European journal of immunology.

[32]  Alessandro Sette,et al.  Development of a DNA Vaccine Designed to Induce Cytotoxic T Lymphocyte Responses to Multiple Conserved Epitopes in HIV-1 1 , 2003, The Journal of Immunology.

[33]  M F del Guercio,et al.  Specificity and degeneracy in peptide binding to HLA-B7-like class I molecules. , 1996, Journal of immunology.

[34]  L C Harrison,et al.  MHCPEP: a database of MHC-binding peptides. , 1994, Nucleic acids research.

[35]  M F del Guercio,et al.  Several HLA alleles share overlapping peptide specificities. , 1995, Journal of immunology.

[36]  S. Oka,et al.  Identification of multiple HIV-1 CTL epitopes presented by HLA-B*5101 molecules. , 1999, Human immunology.

[37]  Zheng Yun,et al.  Identifying simple discriminatory gene vectors with an information theory approach , 2005, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).