PEPITO: improved discontinuous B-cell epitope prediction using multiple distance thresholds and half sphere exposure

MOTIVATION Accurate prediction of B-cell epitopes is an important goal of computational immunology. Up to 90% of B-cell epitopes are discontinuous in nature, yet most predictors focus on linear epitopes. Even when the tertiary structure of the antigen is available, the accurate prediction of B-cell epitopes remains challenging. RESULTS Our predictor, PEPITO, uses a combination of amino-acid propensity scores and half sphere exposure values at multiple distances to achieve state-of-the-art performance. PEPITO achieves an area under the curve (AUC) of 75.4 on the Discotope dataset. Additionally, we benchmark PEPITO as well as the Discotope predictor on the more recent Epitome dataset, achieving AUCs of 68.3 and 66.0, respectively. AVAILABILITY PEPITO is available as part of the SCRATCH suite of protein structure predictors via www.igb.uci.edu. CONTACT pfbaldi@ics.uci.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Avner Schlessinger,et al.  Towards a consensus on datasets and evaluation metrics for developing B‐cell epitope prediction tools , 2007, Journal of molecular recognition : JMR.

[2]  Arno Lukas,et al.  Identification of discontinuous antigenic determinants on proteins based on shape complementarities , 2007, Journal of molecular recognition : JMR.

[3]  Urmila Kulkarni-Kale,et al.  CEP: a conformational epitope prediction server , 2005, Nucleic Acids Res..

[4]  M. V. Regenmortel,et al.  Mapping Epitope Structure and Activity: From One-Dimensional Prediction to Four-Dimensional Description of Antigenic Specificity , 1996 .

[5]  O. Lund,et al.  Prediction of residues in discontinuous B‐cell epitopes using protein 3D structures , 2006, Protein science : a publication of the Protein Society.

[6]  Avner Schlessinger,et al.  Epitome: database of structure-inferred antigenic epitopes , 2005, Nucleic Acids Res..

[7]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Sudipto Saha,et al.  Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network , 2006, Proteins.

[9]  D. Flower,et al.  Benchmarking B cell epitope prediction: Underperformance of existing methods , 2005, Protein science : a publication of the Protein Society.

[10]  Marc H V Van Regenmortel,et al.  Immunoinformatics may lead to a reappraisal of the nature of B cell epitopes and of the feasibility of synthetic peptide vaccines , 2006, Journal of molecular recognition : JMR.

[11]  Jean-Luc Pellequer,et al.  BEPITOPE: predicting the location of continuous epitopes and patterns in proteins , 2003, Journal of molecular recognition : JMR.

[12]  A. Alix,et al.  Predictive estimation of protein linear epitopes by using the program PEOPLE. , 1999, Vaccine.

[13]  Burkhard Rost,et al.  UniqueProt: creating representative protein sequence sets , 2003, Nucleic Acids Res..

[14]  Van Regenmortel MHV Mapping Epitope Structure and Activity: From One-Dimensional Prediction to Four-Dimensional Description of Antigenic Specificity , 1996, Methods.

[15]  Morten Nielsen,et al.  Improved method for predicting linear B-cell epitopes , 2006, Immunome research.

[16]  T. Hamelryck An amino acid has two sides: A new 2D measure provides a different view of solvent exposure , 2005, Proteins.

[17]  Pierre Baldi,et al.  SCRATCH: a protein structure and structural feature prediction server , 2005, Nucleic Acids Res..