Analysis and prediction of affinity of TAP binding peptides using cascade SVM

The generation of cytotoxic T lymphocyte (CTL) epitopes from an antigenic sequence involves number of intracellular processes, including production of peptide fragments by proteasome and transport of peptides to endoplasmic reticulum through transporter associated with antigen processing (TAP). In this study, 409 peptides that bind to human TAP transporter with varying affinity were analyzed to explore the selectivity and specificity of TAP transporter. The abundance of each amino acid from P1 to P9 positions in high‐, intermediate‐, and low‐affinity TAP binders were examined. The rules for predicting TAP binding regions in an antigenic sequence were derived from the above analysis. The quantitative matrix was generated on the basis of contribution of each position and residue in binding affinity. The correlation of r = 0.65 was obtained between experimentally determined and predicted binding affinity by using a quantitative matrix. Further a support vector machine (SVM)‐based method has been developed to model the TAP binding affinity of peptides. The correlation (r = 0.80) was obtained between the predicted and experimental measured values by using sequence‐based SVM. The reliability of prediction was further improved by cascade SVM that uses features of amino acids along with sequence. An extremely good correlation (r = 0.88) was obtained between measured and predicted values, when the cascade SVM‐based method was evaluated through jackknife testing. A Web service, TAPPred (http://www.imtech.res.in/raghava/tappred/ or http://bioinformatics.uams.edu/mirror/tappred/), has been developed based on this approach.

[1]  Irini A. Doytchinova,et al.  JenPep: a database of quantitative functional peptide data for immunology , 2002, Bioinform..

[2]  C DeLisi,et al.  T-cell antigenic sites tend to be amphipathic structures. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[3]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[4]  Bjoern Peters,et al.  Identifying MHC Class I Epitopes by Predicting the TAP Transport Efficiency of Epitope Precursors , 2003, The Journal of Immunology.

[5]  P. Karplus,et al.  Prediction of chain flexibility in proteins , 1985, Naturwissenschaften.

[6]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[7]  D. Flower,et al.  Toward the quantitative prediction of T-cell epitopes: coMFA and coMSIA studies of peptides with affinity for the class I MHC molecule HLA-A*0201. , 2001, Journal of medicinal chemistry.

[8]  H. Ploegh,et al.  Substrate specificity of allelic variants of the TAP peptide transporter. , 1994, Immunity.

[9]  Vladimir Brusic,et al.  A neural network model approach to the study of human TAP transporter , 1998, Silico Biol..

[10]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[11]  P. Ponnuswamy,et al.  Hydrophobic character of amino acid residues in globular proteins , 1978, Nature.

[12]  T. Schumacher,et al.  Peptide translocation by variants of the transporter associated with antigen processing. , 1993, Science.

[13]  M. Levitt Conformational preferences of amino acids in globular proteins. , 1978, Biochemistry.

[14]  R. Hodges,et al.  New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. , 1986, Biochemistry.

[15]  Julie McMurry,et al.  Immuno‐informatics: Mining genomes for vaccine components , 2002, Immunology and cell biology.

[16]  P. Kloetzel,et al.  A theoretical approach towards the identification of cleavage-determining amino acid motifs of the 20 S proteasome. , 1999, Journal of molecular biology.

[17]  Chris H. Q. Ding,et al.  Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[18]  S. Tonegawa,et al.  Peptide length and sequence specificity of the mouse TAP1/TAP2 translocator , 1994, The Journal of experimental medicine.

[19]  K. Parker,et al.  Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. , 1994, Journal of immunology.

[20]  Robert Tampé,et al.  The transporter associated with antigen processing: function and implications in human diseases. , 2002, Physiological reviews.

[21]  Gajendra P. S. Raghava,et al.  MHCBN: a comprehensive database of MHC binding and non-binding peptides , 2003, Bioinform..

[22]  A Sette,et al.  The peptide-binding motif for the human transporter associated with antigen processing , 1995, The Journal of experimental medicine.

[23]  V Brusic,et al.  Relationship between peptide selectivities of human transporters associated with antigen processing and HLA class I molecules. , 1998, Journal of immunology.

[24]  Hans-Georg Rammensee,et al.  MHC ligands and peptide motifs: first listing , 2004, Immunogenetics.

[25]  P. Cresswell,et al.  Human transporters associated with antigen processing possess a promiscuous peptide-binding site. , 1994, Immunity.

[26]  J L Cornette,et al.  Prediction of immunodominant helper T cell antigenic sites from the primary sequence. , 1987, Journal of immunology.

[27]  R. Obst,et al.  Analysis of the fine specificity of rat, mouse and human TAP peptide transporters , 1995, European journal of immunology.

[28]  P. van Endert,et al.  Powering the peptide pump: TAP crosstalk with energetic nucleotides. , 2002, Trends in biochemical sciences.

[29]  R. Tampé,et al.  A sequential model for peptide binding and transport by the transporters associated with antigen processing. , 1994, Immunity.

[30]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[31]  Nello Cristianini,et al.  Advances in Kernel Methods - Support Vector Learning , 1999 .

[32]  A Sette,et al.  Two complementary methods for predicting peptides binding major histocompatibility complex molecules. , 1997, Journal of molecular biology.

[33]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[34]  S Uebel,et al.  Recognition principle of the TAP transporter disclosed by combinatorial peptide libraries. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[35]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[36]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[37]  S. Tenzer,et al.  Using the World Wide Web for predicting CTL epitopes. , 2003, Current opinion in immunology.

[38]  D. Eisenberg Three-dimensional structure of membrane and surface proteins. , 1984, Annual review of biochemistry.

[39]  Gajendra P. S. Raghava,et al.  ProPred1: Prediction of Promiscuous MHC Class-I Binding Sites , 2003, Bioinform..

[40]  A Facchiano,et al.  Flexibility plot of proteins. , 1989, Protein engineering.

[41]  R. Tampé,et al.  Function of the transport complex TAP in cellular immune recognition. , 1999, Biochimica et biophysica acta.

[42]  Robert Tampé,et al.  The transporter associated with antigen processing TAP: structure and function , 1999, FEBS letters.

[43]  Hans-Georg Rammensee,et al.  MHC Ligands and Peptide Motifs , 1998, Molecular Biology Intelligence Unit.

[44]  H. Bull,et al.  Surface tension of amino acid solutions: a hydrophobicity scale of the amino acid residues. , 1974, Archives of biochemistry and biophysics.

[45]  S Uebel,et al.  Specificity of the proteasome and the TAP transporter. , 1999, Current opinion in immunology.

[46]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[47]  K C Chou,et al.  An analysis of protein folding type prediction by seed-propagated sampling and jackknife test , 1995, Journal of protein chemistry.

[48]  S. Stevanović,et al.  Combining computer algorithms with experimental approaches permits the rapid and accurate identification of T cell epitopes from defined antigens. , 2001, Journal of immunological methods.

[49]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[50]  D. D. Jones,et al.  Amino acid properties and side-chain orientation in proteins: a cross correlation appraoch. , 1975, Journal of theoretical biology.

[51]  G. Hämmerling,et al.  Antigen processing and presentation‐towards the Millennium , 1999, Immunological reviews.