Predictive Bayesian neural network models of MHC class II peptide binding.

We used Bayesian regularized neural networks to model data on the MHC class II-binding affinity of peptides. Training data consisted of sequences and binding data for nonamer (nine amino acid) peptides. Independent test data consisted of sequences and binding data for peptides of length </=25. We assumed that MHC class II-binding activity of peptides depends only on the highest ranked embedded nonamer and that reverse sequences of active nonamers are inactive. We also internally validated the models by using 30% of the training data in an internal test set. We obtained robust models, with near identical statistics for multiple training runs. We determined how predictive our models were using statistical tests and area under the Receiver Operating Characteristic (ROC) graphs (A(ROC)). Most models gave training A(ROC) values close to 1.0 and test set A(ROC) values >0.8. We also used both amino acid indicator variables (bin20) and property-based descriptors to generate models for MHC class II-binding of peptides. The property-based descriptors were more parsimonious than the indicator variable descriptors, making them applicable to larger peptides, and their design makes them able to generalize to unknown peptides outside of the training space. None of the external test data sets contained any of the nonamer sequences in the training sets. Consequently, the models attempted to predict the activity of truly unknown peptides not encountered in the training sets. Our models were well able to tackle the difficult problem of correctly predicting the MHC class II-binding activities of a majority of the test set peptides. Exceptions to the assumption that nonamer motif activities were invariant to the peptide in which they were embedded, together with the limited coverage of the test data, and the fuzziness of the classification procedure, are likely explanations for some misclassifications.

[1]  D. Flower,et al.  Additive method for the prediction of protein-peptide binding affinity. Application to the MHC class I molecule HLA-A*0201. , 2002, Journal of proteome research.

[2]  S Buus,et al.  Description and prediction of peptide-MHC binding: the 'human MHC project'. , 1999, Current opinion in immunology.

[3]  Gajendra P. S. Raghava,et al.  SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence , 2004, Bioinform..

[4]  Paul A. Smith,et al.  Comparison of Linear and Nonlinear Classification Algorithms for the Prediction of Drug and Chemical Metabolism by Human UDP-Glucuronosyltransferase Isoforms , 2003, J. Chem. Inf. Comput. Sci..

[5]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..

[6]  D. Rognan,et al.  Customized versus universal scoring functions: application to class I MHC-peptide binding free energy predictions. , 2001, Bioorganic & medicinal chemistry letters.

[7]  Frank R Burden,et al.  Broad-based quantitative structure-activity relationship modeling of potency and selectivity of farnesyltransferase inhibitors using a Bayesian regularized neural network. , 2004, Journal of medicinal chemistry.

[8]  Frank R. Burden,et al.  Robust QSAR Models from Novel Descriptors and Bayesian Regularised Neural Networks , 2000 .

[9]  J Zeleznikow,et al.  Efficient discovery of immune response targets by cyclical refinement of QSAR models of peptide binding. , 2001, Journal of molecular graphics & modelling.

[10]  F. Burden,et al.  A quantitative structure--activity relationships model for the acute toxicity of substituted benzenes to Tetrahymena pyriformis using Bayesian-regularized neural networks. , 2000, Chemical research in toxicology.

[11]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[12]  D. Flower,et al.  Toward the quantitative prediction of T-cell epitopes: coMFA and coMSIA studies of peptides with affinity for the class I MHC molecule HLA-A*0201. , 2001, Journal of medicinal chemistry.

[13]  F. Burden,et al.  Robust QSAR models using Bayesian regularized neural networks. , 1999, Journal of medicinal chemistry.

[14]  Vladimir Brusic,et al.  MHCPEP, a database of MHC-binding peptides: update 1996 , 1997, Nucleic Acids Res..

[15]  M. Wauben,et al.  Major histocompatibility complex class II binding characteristics of peptoid-peptide hybrids. , 2002, Bioorganic & medicinal chemistry.

[16]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[17]  A Sette,et al.  Two complementary methods for predicting peptides binding major histocompatibility complex molecules. , 1997, Journal of molecular biology.

[18]  David A Winkler,et al.  Modelling blood-brain barrier partitioning using Bayesian neural nets. , 2004, Journal of molecular graphics & modelling.