B-cell epitope prediction through a graph model

BackgroundPrediction of B-cell epitopes from antigens is useful to understand the immune basis of antibody-antigen recognition, and is helpful in vaccine design and drug development. Tremendous efforts have been devoted to this long-studied problem, however, existing methods have at least two common limitations. One is that they only favor prediction of those epitopes with protrusive conformations, but show poor performance in dealing with planar epitopes. The other limit is that they predict all of the antigenic residues of an antigen as belonging to one single epitope even when multiple non-overlapping epitopes of an antigen exist.ResultsIn this paper, we propose to divide an antigen surface graph into subgraphs by using a Markov Clustering algorithm, and then we construct a classifier to distinguish these subgraphs as epitope or non-epitope subgraphs. This classifier is then taken to predict epitopes for a test antigen. On a big data set comprising 92 antigen-antibody PDB complexes, our method significantly outperforms the state-of-the-art epitope prediction methods, achieving 24.7% higher averaged f-score than the best existing models. In particular, our method can successfully identify those epitopes with a non-planarity which is too small to be addressed by the other models. Our method can also detect multiple epitopes whenever they exist.ConclusionsVarious protrusive and planar patches at the surface of antigens can be distinguishable by using graphical models combined with unsupervised clustering and supervised learning ideas. The difficult problem of identifying multiple epitopes from an antigen can be made easied by using our subgraph approach. The outstanding residue combinations found in the supervised learning will be useful for us to form new hypothesis in future studies.

[1]  J. Berzofsky,et al.  The antigenic structure of proteins: a reappraisal. , 1984, Annual review of immunology.

[2]  Sudipto Saha,et al.  Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network , 2006, Proteins.

[3]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[4]  Sokal Rr,et al.  Biometry: the principles and practice of statistics in biological research 2nd edition. , 1981 .

[5]  O. Lund,et al.  Prediction of residues in discontinuous B‐cell epitopes using protein 3D structures , 2006, Protein science : a publication of the Protein Society.

[6]  Jamie K. Scott,et al.  Random-peptide libraries and antigen-fragment libraries for epitope mapping and the development of vaccines and diagnostics , 2001, Current Opinion in Chemical Biology.

[7]  A. Giuliani,et al.  A computational approach identifies two regions of Hepatitis C Virus E1 protein as interacting domains involved in viral fusion process , 2009, BMC Structural Biology.

[8]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Wei Wang,et al.  Mining protein family specific residue packing patterns from protein structure graphs , 2004, RECOMB.

[11]  Huan-Xiang Zhou,et al.  Prediction of interface residues in protein–protein complexes by a consensus neural network method: Test against NMR data , 2005, Proteins.

[12]  Pierre Baldi,et al.  COBEpro: a novel system for predicting continuous B-cell epitopes. , 2009, Protein engineering, design & selection : PEDS.

[13]  Avner Schlessinger,et al.  Epitome: database of structure-inferred antigenic epitopes , 2005, Nucleic Acids Res..

[14]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[15]  BMC Bioinformatics , 2005 .

[16]  S. Dongen Graph clustering by flow simulation , 2000 .

[17]  Itay Mayrose,et al.  Epitopia: a web-server for predicting B-cell epitopes , 2009, BMC Bioinformatics.

[18]  Di Wu,et al.  SEPPA: a computational server for spatial epitope prediction of protein antigens , 2009, Nucleic Acids Res..

[19]  E Westhof,et al.  Predicting location of continuous epitopes in proteins from their primary structures. , 1991, Methods in enzymology.

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  A. Abbas,et al.  Comprar Cellular and Molecular Immunology, Updated Edition, 6th Edition With STUDENT CONSULT Online Access | Abul K. Abbas | 9781416031239 | Saunders , 2009 .

[22]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[23]  A. Abbas,et al.  Cellular and Molecular Immunology , 1991 .

[24]  Arno Lukas,et al.  Identification of discontinuous antigenic determinants on proteins based on shape complementarities , 2007, Journal of molecular recognition : JMR.

[25]  P. Karplus,et al.  Prediction of chain flexibility in proteins , 1985, Naturwissenschaften.

[26]  Jinyan Li,et al.  Antibody-Specified B-Cell Epitope Prediction in Line with the Principle of Context-Awareness , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[27]  Urmila Kulkarni-Kale,et al.  CEP: a conformational epitope prediction server , 2005, Nucleic Acids Res..

[28]  M. Atassi,et al.  Antigenic structure of myoglobin: the complete immunochemical anatomy of a protein and conclusions relating to antigenic structures of proteins. , 1975, Immunochemistry.

[29]  Ulf Reimer,et al.  Prediction of linear B-cell epitopes. , 2009, Methods in molecular biology.

[30]  Wei Li,et al.  ElliPro: a new structure-based tool for the prediction of antibody epitopes , 2008, BMC Bioinformatics.

[31]  References , 1971 .

[32]  Morten Nielsen,et al.  Improved method for predicting linear B-cell epitopes , 2006, Immunome research.

[33]  Bernd Mayer,et al.  Machine learning approaches for prediction of linear B‐cell epitopes on proteins , 2006, Journal of molecular recognition : JMR.

[34]  Pierre Baldi,et al.  PEPITO: improved discontinuous B-cell epitope prediction using multiple distance thresholds and half sphere exposure , 2008, Bioinform..

[35]  Nimrod D. Rubinstein,et al.  A machine-learning approach for predicting B-cell epitopes. , 2009, Molecular immunology.

[36]  Vasant G Honavar,et al.  Predicting linear B‐cell epitopes using string kernels , 2008, Journal of molecular recognition : JMR.