Prediction of Discontinuous B-Cell Epitopes Using Logistic Regression and Structural Information

Computational prediction of discontinuous B-cell epitopes remains challenging, but it is an important task in vaccine design. In this study, we developed a novel computational method to predict discontinuous epitope residues by combining the logistic regression model with two important structural features, B-factor and relative accessible surface area (RASA). We conducted five-fold cross-validation on a representative dataset composed of antigen structures bound with antibodies and independent testing on Epitome database, respectively. Experimental results indicate that besides the well-known RASA feature, B-factor can also be used to identify discontinuous epitopes. Furthermore, these two features are complementary and their combination can remarkably improve the prediction performance. Comparison with existing approaches shows that our method can achieve better performance in terms of average AUC value and sensitivity for predicting discontinuous B-cell epitopes.

[1]  M.H.V. Van Regenmortel Synthetic peptides versus natural antigens in immunoassays. , 1993 .

[2]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[3]  B. Rost,et al.  Conservation and prediction of solvent accessibility in protein families , 1994, Proteins.

[4]  Chi Zhang,et al.  Prediction of antigenic epitopes on protein surfaces by consensus scoring , 2009, BMC Bioinformatics.

[5]  Urmila Kulkarni-Kale,et al.  CEP: a conformational epitope prediction server , 2005, Nucleic Acids Res..

[6]  P. Tongaonkar,et al.  A semi‐empirical method for prediction of antigenic determinants on protein antigens , 1990, FEBS letters.

[7]  Pierre Baldi,et al.  PEPITO: improved discontinuous B-cell epitope prediction using multiple distance thresholds and half sphere exposure , 2008, Bioinform..

[8]  Wei Li,et al.  ElliPro: a new structure-based tool for the prediction of antibody epitopes , 2008, BMC Bioinformatics.

[9]  P Argos,et al.  Correlation between side chain mobility and conformation in protein structures. , 1997, Protein engineering.

[10]  E Westhof,et al.  Correlation between the location of antigenic sites and the prediction of turns in proteins. , 1993, Immunology letters.

[11]  P. Karplus,et al.  Prediction of chain flexibility in proteins , 1985, Naturwissenschaften.

[12]  Sudipto Saha,et al.  Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network , 2006, Proteins.

[13]  J M Thornton,et al.  Protein-protein interactions: a review of protein dimer structures. , 1995, Progress in biophysics and molecular biology.

[14]  O. Lund,et al.  Prediction of residues in discontinuous B‐cell epitopes using protein 3D structures , 2006, Protein science : a publication of the Protein Society.

[15]  G. Rose,et al.  Antigenic determinants in proteins coincide with surface regions accessible to large probes (antibody domains). , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[16]  T. Bhat,et al.  Bound water molecules and conformational stabilization help mediate an antigen-antibody association. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Di Wu,et al.  SEPPA: a computational server for spatial epitope prediction of protein antigens , 2009, Nucleic Acids Res..

[18]  M H Van Regenmortel Synthetic peptides versus natural antigens in immunoassays. , 1993, Annales de biologie clinique.

[19]  Avner Schlessinger,et al.  Epitome: database of structure-inferred antigenic epitopes , 2005, Nucleic Acids Res..

[20]  K. Chou,et al.  Prediction of linear B-cell epitopes using amino acid pair antigenicity scale , 2007, Amino Acids.

[21]  Nimrod D. Rubinstein,et al.  A machine-learning approach for predicting B-cell epitopes. , 2009, Molecular immunology.

[22]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[23]  Seungwoo Hwang,et al.  Using evolutionary and structural information to predict DNA‐binding sites on DNA‐binding proteins , 2006, Proteins.

[24]  R. Hodges,et al.  New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. , 1986, Biochemistry.

[25]  E. Emini,et al.  Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide , 1985, Journal of virology.