A set of new amino acid descriptors applied in prediction of MHC class I binding peptides.

A set of new amino acid descriptors, namely factor analysis scales of generalized amino acid information (FASGAI) involving hydrophobicity, alpha and turn propensities, bulky properties, compositional characteristics, local flexibility and electronic properties, was proposed to resolve the representation of peptide structures. FASGAI vectors were then used to represent the structures of 152 HLA-A(*)0201 restrictive T-cell epitopes with 9 amino acid residues. The features that are closely related to binding affinities were selected by genetic arithmetic, and the model based on partial least squares was developed to predict binding affinities. The model revealed promising predictive power, giving relatively high predictions for training and test samples. Further, the PreMHCbinding program at significantly lower computational complexity was exploited to predict MHC class I binding peptides. Quantitative structure-affinity relationship analyses demonstrated the bulky properties and hydrophobicity of the 3rd residue, bulky properties of the 2nd residue, hydrophobicity of the 9th residue that provided high positive contribution to the binding affinities, and that the hydrophobicity of the 4th residue and local flexibility of the 3rd residue were negative to binding affinities. The results showed that FASGAI vectors can be further utilized to represent the structures of other functional peptides; moreover, it has thus showed us further direction into the potential applications on relationship between structures and functions of proteins.

[1]  S. Kienle,et al.  Decrypting the structure of major histocompatibility complex class I- restricted cytotoxic T lymphocyte epitopes with complex peptide libraries , 1995, The Journal of experimental medicine.

[2]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[3]  Gajendra P. S. Raghava,et al.  SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence , 2004, Bioinform..

[4]  H Mamitsuka,et al.  Predicting peptides that bind to MHC molecules using supervised learning of hidden markov models , 1998, Proteins.

[5]  L C Harrison,et al.  Fuzzy neural network-based prediction of the motif for MHC class II binding peptides. , 2001, Journal of bioscience and bioengineering.

[6]  H. Rammensee,et al.  Allele-specific motifs revealed by sequencing of self-peptides eluted from MHC molecules , 1991, Nature.

[7]  Esteve Xavier Rifà Ros,et al.  FIELD, A. (2005). Discovering Statistics Using SPSS. London: SAGE Publications , 2006 .

[8]  Shunzhou Wan,et al.  Large‐scale molecular dynamics simulations of HLA‐A*0201 complexed with a tumor‐specific antigenic peptide: Can the α3 and β2m domains be neglected? , 2004, J. Comput. Chem..

[9]  S. Wold,et al.  Peptide quantitative structure-activity relationships, a multivariate approach. , 1987, Journal of medicinal chemistry.

[10]  J A Koziol,et al.  Prediction of binding to MHC class I molecules. , 1995, Journal of immunological methods.

[11]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..

[12]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[13]  Shengshi Z. Li,et al.  A new set of amino acid descriptors and its application in peptide QSARs. , 2005, Biopolymers.

[14]  L. Kier,et al.  Amino acid side chain parameters for correlation studies in biology and pharmacology. , 2009, International journal of peptide and protein research.

[15]  P. Sneath Relations between chemical structure and biological activity in peptides. , 1966, Journal of theoretical biology.

[16]  Z. Nagy,et al.  Precise prediction of major histocompatibility complex class II-peptide interaction based on peptide side chain scanning , 1994, The Journal of experimental medicine.

[17]  Naoki Abe,et al.  Prediction of MHC Class I Binding Peptides by a Query Learning Algorithm Based on Hidden Markov Models , 2002, Journal of biological physics.

[18]  Torbjörn Lundstedt,et al.  PREPROCESSING PEPTIDE SEQUENCES FOR MULTIVARIATE SEQUENCE-PROPERTY ANALYSIS , 1998 .

[19]  V. Brusic,et al.  Neural network-based prediction of candidate T-cell epitopes , 1998, Nature Biotechnology.

[20]  H. Rammensee,et al.  Peptides naturally presented by MHC class I molecules. , 1993, Annual review of immunology.

[21]  C. B. Lucasius,et al.  Genetic algorithms for large-scale optimization in chemometrics: An application , 1991 .

[22]  Vladimir Brusic,et al.  Computational methods for prediction of T-cell epitopes--a framework for modelling, testing, and applications. , 2004, Methods.

[23]  Don C. Wiley,et al.  Crystal structure of the human class II MHC protein HLA-DR1 complexed with an influenza virus peptide , 1994, Nature.

[24]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[25]  D. Flower,et al.  Toward the quantitative prediction of T-cell epitopes: coMFA and coMSIA studies of peptides with affinity for the class I MHC molecule HLA-A*0201. , 2001, Journal of medicinal chemistry.

[26]  Hans-Georg Rammensee,et al.  MHC ligands and peptide motifs: first listing , 2004, Immunogenetics.

[27]  D. Wiley,et al.  The antigenic identity of peptide-MHC complexes: A comparison of the conformations of five viral peptides presented by HLA-A2 , 1993, Cell.

[28]  D. Madden The three-dimensional structure of peptide-MHC complexes. , 1995, Annual review of immunology.

[29]  Andy P. Field,et al.  Discovering Statistics Using SPSS , 2000 .

[30]  Stefan Kienle,et al.  Tolerance to Amino Acid Variations in Peptides Binding to the Major Histocompatibility Complex Class I Protein H-2Kb(*) , 1995, The Journal of Biological Chemistry.

[31]  Hiroyuki Honda,et al.  Prediction of peptide binding to major histocompatibility complex class II molecules through use of boosted fuzzy classifier with SWEEP operator method. , 2006, Journal of bioscience and bioengineering.

[32]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[33]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[34]  Paola Gramatica,et al.  The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models , 2003 .

[35]  G. Klebe,et al.  Molecular similarity indices in a comparative analysis (CoMSIA) of drug molecules to correlate and predict their biological activity. , 1994, Journal of medicinal chemistry.

[36]  R. Cramer,et al.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. , 1988, Journal of the American Chemical Society.

[37]  Athanassios Stavrakoudis,et al.  T‐cell epitopes of the La/SSB autoantigen: Prediction based on the homology modeling of HLA‐DQ2/DQ7 with the insulin‐B peptide/HLA‐DQ8 complex , 2006, J. Comput. Chem..

[38]  J. Sidney,et al.  Prominent role of secondary anchor residues in peptide binding to HLA-A2.1 molecules , 1993, Cell.