Prediction of conformational epitopes with the use of a knowledge-based energy function and geometrically related neighboring residue characteristics

BackgroundA conformational epitope (CE) in an antigentic protein is composed of amino acid residues that are spatially near each other on the antigen's surface but are separated in sequence; CEs bind their complementary paratopes in B-cell receptors and/or antibodies. CE predication is used during vaccine design and in immuno-biological experiments. Here, we develop a novel system, CE-KEG, which predicts CEs based on knowledge-based energy and geometrical neighboring residue contents. The workflow applied grid-based mathematical morphological algorithms to efficiently detect the surface atoms of the antigens. After extracting surface residues, we ranked CE candidate residues first according to their local average energy distributions. Then, the frequencies at which geometrically related neighboring residue combinations in the potential CEs occurred were incorporated into our workflow, and the weighted combinations of the average energies and neighboring residue frequencies were used to assess the sensitivity, accuracy, and efficiency of our prediction workflow.ResultsWe prepared a database containing 247 antigen structures and a second database containing the 163 non-redundant antigen structures in the first database to test our workflow. Our predictive workflow performed better than did algorithms found in the literature in terms of accuracy and efficiency. For the non-redundant dataset tested, our workflow achieved an average of 47.8% sensitivity, 84.3% specificity, and 80.7% accuracy according to a 10-fold cross-validation mechanism, and the performance was evaluated under providing top three predicted CE candidates for each antigen.ConclusionsOur method combines an energy profile for surface residues with the frequency that each geometrically related amino acid residue pair occurs to identify possible CEs in antigens. This combination of these features facilitates improved identification for immuno-biological studies and synthetic vaccine design. CE-KEG is available at http://cekeg.cs.ntou.edu.tw.

[1]  O. Lund,et al.  Prediction of residues in discontinuous B‐cell epitopes using protein 3D structures , 2006, Protein science : a publication of the Protein Society.

[2]  Avner Schlessinger,et al.  Towards a consensus on datasets and evaluation metrics for developing B‐cell epitope prediction tools , 2007, Journal of molecular recognition : JMR.

[3]  Xinglong Yu,et al.  An introduction to epitope prediction methods and software , 2009, Reviews in medical virology.

[4]  Vasant G Honavar,et al.  Predicting linear B‐cell epitopes using string kernels , 2008, Journal of molecular recognition : JMR.

[5]  M. V. Regenmortel,et al.  Mapping Epitope Structure and Activity: From One-Dimensional Prediction to Four-Dimensional Description of Antigenic Specificity , 1996 .

[6]  Manfred J. Sippl,et al.  Thirty years of environmental health research--and growing. , 1996, Nucleic Acids Res..

[7]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[8]  M. Cadene,et al.  X-ray structure of a voltage-dependent K+ channel , 2003, Nature.

[9]  R. Bruccoleri,et al.  On the attribution of binding energy in antigen-antibody complexes McPC 603, D1.3, and HyHEL-5. , 1989, Biochemistry.

[10]  M. L. Connolly Solvent-accessible surfaces of proteins and nucleic acids. , 1983, Science.

[11]  Angela Chow,et al.  Longitudinal Analysis of the Human Antibody Response to Chikungunya Virus Infection: Implications for Serodiagnosis and Vaccine Development , 2012, Journal of Virology.

[12]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[13]  Di Wu,et al.  SEPPA: a computational server for spatial epitope prediction of protein antigens , 2009, Nucleic Acids Res..

[14]  Tun-Wen Pai,et al.  The family 21 carbohydrate-binding module of glucoamylase from Rhizopus oryzae consists of two sites playing distinct roles in ligand binding. , 2006, The Biochemical journal.

[15]  K. Chou,et al.  Prediction of linear B-cell epitopes using amino acid pair antigenicity scale , 2007, Amino Acids.

[16]  J. Skolnick,et al.  A distance‐dependent atomic knowledge‐based potential for improved protein structure selection , 2001, Proteins.

[17]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[18]  Jean-Luc Pellequer,et al.  BEPITOPE: predicting the location of continuous epitopes and patterns in proteins , 2003, Journal of molecular recognition : JMR.

[19]  Morten Nielsen,et al.  Improved method for predicting linear B-cell epitopes , 2006, Immunome research.

[20]  Wei Li,et al.  ElliPro: a new structure-based tool for the prediction of antibody epitopes , 2008, BMC Bioinformatics.

[21]  R. Poljak,et al.  The structural basis of antigen-antibody recognition. , 1987, Annual review of biophysics and biophysical chemistry.

[22]  Itay Mayrose,et al.  Stepwise prediction of conformational discontinuous B‐cell epitopes using the Mapitope algorithm , 2007, Proteins.

[23]  Sudipto Saha,et al.  Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network , 2006, Proteins.

[24]  Yuxin Li,et al.  Pep-3D-Search: a method for B-cell epitope prediction based on mimotope analysis , 2008, BMC Bioinformatics.

[25]  Gajendra P. S. Raghava,et al.  BcePred: Prediction of Continuous B-Cell Epitopes in Antigenic Sequences Using Physico-chemical Properties , 2004, ICARIS.

[26]  Jonathan M Gershoni,et al.  The use of epitope arrays in immunodiagnosis of infectious disease: hepatitis C virus, a case study. , 2013, Analytical biochemistry.

[27]  Van Regenmortel MHV Mapping Epitope Structure and Activity: From One-Dimensional Prediction to Four-Dimensional Description of Antigenic Specificity , 1996, Methods.

[28]  Tun-Wen Pai,et al.  Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification , 2011, Journal of biomedicine & biotechnology.

[29]  D. A. Dougherty,et al.  Cation-π Interactions in Chemistry and Biology: A New View of Benzene, Phe, Tyr, and Trp , 1996, Science.

[30]  Violaine Moreau,et al.  Discontinuous epitope prediction based on mimotope analysis , 2006, Bioinform..

[31]  Urmila Kulkarni-Kale,et al.  CEP: a conformational epitope prediction server , 2005, Nucleic Acids Res..

[32]  Violaine Moreau,et al.  PEPOP: Computational design of immunogenic peptides , 2008, BMC Bioinformatics.

[33]  M. V. Van Regenmortel,et al.  Antigenicity and immunogenicity of synthetic peptides. , 2001, Biologicals : journal of the International Association of Biological Standardization.

[34]  Tun-Wen Pai,et al.  Estimation and extraction of B‐cell linear epitopes predicted by mathematical morphology approaches , 2008, Journal of molecular recognition : JMR.

[35]  Neil S. Greenspan,et al.  Defining epitopes: It's not as easy as it seems , 1999, Nature Biotechnology.

[36]  Andrew C. R. Martin,et al.  SACS-Self-maintaining database of antibody crystal structure information , 2002, Bioinform..

[37]  Pierre Baldi,et al.  PEPITO: improved discontinuous B-cell epitope prediction using multiple distance thresholds and half sphere exposure , 2008, Bioinform..