DynaPred: A structure and sequence based method for the prediction of MHC class I binding peptide sequences and conformations

MOTIVATION The binding of endogenous antigenic peptides to MHC class I molecules is an important step during the immunologic response of a host against a pathogen. Thus, various sequence- and structure-based prediction methods have been proposed for this purpose. The sequence-based methods are computationally efficient, but are hampered by the need of sufficient experimental data and do not provide a structural interpretation of their results. The structural methods are data-independent, but are quite time-consuming and thus not suited for screening of whole genomes. Here, we present a new method, which performs sequence-based prediction by incorporating information obtained from molecular modeling. This allows us to perform large databases screening and to provide structural information of the results. RESULTS We developed a SVM-trained, quantitative matrix-based method for the prediction of MHC class I binding peptides, in which the features of the scoring matrix are energy terms retrieved from molecular dynamics simulations. At the same time we used the equilibrated structures obtained from the same simulations in a simple and efficient docking procedure. Our method consists of two steps: First, we predict potential binders from sequence data alone and second, we construct protein-peptide complexes for the predicted binders. So far, we tested our approach on the HLA-A0201 allele. We constructed two prediction models, using local, position-dependent (DynaPred(POS)) and global, position-independent (DynaPred) features. The former model outperformed the two sequence-based methods used in our evaluation; the latter shows a much higher generalizability towards other alleles than the position-dependent models. The constructed peptide structures can be refined within seconds to structures with an average backbone RMSD of 1.53 A from the corresponding experimental structures.

[1]  Kun Yu,et al.  Methods for Prediction of Peptide Binding to MHC Molecules: A Comparative Study , 2002, Molecular medicine.

[2]  Berk Hess,et al.  GROMACS 3.0: a package for molecular simulation and trajectory analysis , 2001 .

[3]  H. Margalit,et al.  Ranking potential binding peptides to MHC molecules by a computational threading approach. , 1995, Journal of molecular biology.

[4]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[5]  J. Åqvist,et al.  Ligand binding affinities from MD simulations. , 2002, Accounts of chemical research.

[6]  M. Segal,et al.  Relating Amino Acid Sequence to Phenotype: Analysis of Peptide‐Binding Data , 2000, Biometrics.

[7]  C. DeLisi,et al.  Computing the structure of bound peptides. Application to antigen recognition by class I major histocompatibility complex receptors. , 1993, Journal of molecular biology.

[8]  Bjoern Peters,et al.  Automated generation and evaluation of specific MHC binding predictive tools: ARB matrix applications , 2005, Immunogenetics.

[9]  R Elber,et al.  Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes. , 1998, Folding & design.

[10]  K. Parker,et al.  Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. , 1994, Journal of immunology.

[11]  M J Sternberg,et al.  Empirical scale of side-chain conformational entropy in protein folding. , 1993, Journal of molecular biology.

[12]  Nicole Frahma,et al.  HIV Molecular Immunology 2005 , 2006 .

[13]  D A Agard,et al.  Binding mode prediction for a flexible ligand in a flexible pocket using multi-conformation simulated annealing pseudo crystallographic refinement. , 2001, Journal of molecular biology.

[14]  C. DeLisi,et al.  Free energy mapping of class I MHC molecules and structural determination of bound peptides , 1996, Protein science : a publication of the Protein Society.

[15]  A Sette,et al.  Two complementary methods for predicting peptides binding major histocompatibility complex molecules. , 1997, Journal of molecular biology.

[16]  M F del Guercio,et al.  Several common HLA-DR types share largely overlapping peptide binding repertoires. , 1998, Journal of immunology.

[17]  Takuya Takahashi,et al.  Prediction of T-cell epitope. , 2007, Journal of pharmacological sciences.

[18]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[19]  Yingdong Zhao,et al.  Application of support vector machines for T-cell epitopes prediction , 2003, Bioinform..

[20]  Didier Rognan,et al.  Recovery of known T-cell epitopes by computational scanning of a viral genome , 2002, J. Comput. Aided Mol. Des..

[21]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[22]  R. Abagyan,et al.  Ab initio prediction of peptide‐MHC binding geometry for diverse class I MHC allotypes , 2006, Proteins.

[23]  Shoba Ranganathan,et al.  Modeling the structure of bound peptide ligands to major histocompatibility complex , 2004, Protein science : a publication of the Protein Society.

[24]  C. Chothia,et al.  Hydrophobic bonding and accessible surface area in proteins , 1974, Nature.

[25]  John Sidney,et al.  Examining the independent binding assumption for binding of peptide epitopes to MHC-I molecules , 2003, Bioinform..

[26]  J. Hammer,et al.  New methods to predict MHC-binding sequences within protein antigens. , 1995, Current opinion in immunology.

[27]  D. Flower,et al.  Additive method for the prediction of protein-peptide binding affinity. Application to the MHC class I molecule HLA-A*0201. , 2002, Journal of proteome research.

[28]  L C Harrison,et al.  MHCPEP: a database of MHC-binding peptides. , 1994, Nucleic acids research.

[29]  Irini A. Doytchinova,et al.  Towards the chemometric dissection of peptide – HLA-A*0201 binding affinity: comparison of local and global QSAR models , 2005, J. Comput. Aided Mol. Des..

[30]  I Lasters,et al.  Computation of the binding of fully flexible peptides to proteins with flexible side chains , 1997, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[31]  E. Reinherz,et al.  Prediction of MHC class I binding peptides using profile motifs. , 2002, Human immunology.

[32]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..

[33]  J. Skolnick,et al.  Application of an artificial neural network to predict specific class I MHC binding peptide sequences , 1998, Nature Biotechnology.

[34]  Ting Wang,et al.  Comparative binding energy (COMBINE) analysis of OppA-peptide complexes to relate structure to binding thermodynamics. , 2002, Journal of medicinal chemistry.

[35]  H Mamitsuka,et al.  Predicting peptides that bind to MHC molecules using supervised learning of hidden markov models , 1998, Proteins.

[36]  L. Zhihua,et al.  Toward the quantitative prediction of T-cell epitopes: QSAR studies on peptides having affinity with the class I MHC molecular HLA-A*0201. , 2004, Journal of computational biology : a journal of computational molecular cell biology.

[37]  Pingping Guan,et al.  Quantitative structure-activity relationships and the prediction of MHC supermotifs. , 2004, Methods.

[38]  Zheng Rong Yang,et al.  Prediction of T-Cell Epitopes Using Biosupport Vector Machines , 2005, J. Chem. Inf. Model..

[39]  Arne Elofsson,et al.  Prediction of MHC class I binding peptides, using SVMHC , 2002, BMC Bioinformatics.

[40]  Gajendra P.S. Raghava,et al.  Prediction of CTL epitopes using QM, SVM and ANN techniques. , 2004, Vaccine.

[41]  Eugene I Shakhnovich,et al.  Structural mining: self-consistent design on flexible protein-peptide docking and transferable binding affinity potential. , 2004, Journal of the American Chemical Society.

[42]  Søren Brunak,et al.  Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach , 2004, Bioinform..

[43]  H. Rammensee,et al.  SYFPEITHI: database for MHC ligands and peptide motifs , 1999, Immunogenetics.

[44]  H. Grey,et al.  Prediction of major histocompatibility complex binding regions of protein antigens by sequence pattern analysis. , 1989, Proceedings of the National Academy of Sciences of the United States of America.