A multi-resolution graph convolution network for contiguous epitope prediction

Computational methods for predicting binding interfaces between antigens and antibodies (epitopes and paratopes) are faster and cheaper than traditional experimental structure determination methods. A sufficiently reliable computational predictor that could scale to large sets of available antibody sequence data could thus inform and expedite many biomedical pursuits, such as better understanding immune responses to vaccination and natural infection and developing better drugs and vaccines. However, current state-of-the-art predictors produce discontiguous predictions, e.g., predicting the epitope in many different spots on an antigen, even though in reality they typically comprise a single localized region. We seek to produce contiguous predicted epitopes, accounting for long-range spatial relationships between residues. We therefore build a novel Graph Convolution Network (GCN) that performs graph convolutions at multiple resolutions so as to represent and constrain long-range spatial dependencies. In evaluation on a standard epitope prediction benchmark, we see a significant boost with the multi-resolution approach compared to a previous state-of-the-art GCN predictor, with half of the test cases increasing in AUC-PR by an average of 0.15 and the other half decreasing by only 0.05. We further introduce a clustering algorithm that takes advantage of the contiguity yielded by our model, grouping the raw predictions into a small set of discrete potential epitopes. We show that within the top 3 clusters, 73% of test cases contain a cluster covering most of the actual epitope, demonstrating the utility of contiguous predictions for guiding experimental methods by yielding a small set of reasonable hypotheses for further investigation.

[1]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[2]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Vasant Honavar,et al.  Predicting flexible length linear B-cell epitopes. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[4]  Dmitrij Frishman,et al.  STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins , 2004, Nucleic Acids Res..

[5]  Chenchen Liu,et al.  How convolutional neural networks see the world - A survey of convolutional neural network visualization methods , 2018, Math. Found. Comput..

[6]  Jiye Shi,et al.  ABodyBuilder: Automated antibody structure prediction with data–driven accuracy estimation , 2016, mAbs.

[7]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8]  V. Giudicelli,et al.  IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. , 2003, Developmental and comparative immunology.

[9]  M. Bronstein,et al.  Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning , 2019, Nature Methods.

[10]  Morten Nielsen,et al.  Reliable B Cell Epitope Predictions: Impacts of Method Development and Improved Benchmarking , 2012, PLoS Comput. Biol..

[11]  Christopher Joseph Pal,et al.  The Importance of Skip Connections in Biomedical Image Segmentation , 2016, LABELS/DLMIA@MICCAI.

[12]  Jeffrey J. Gray,et al.  Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires , 2016, Proceedings of the National Academy of Sciences.

[13]  Benjamin D Brooks,et al.  High-throughput epitope binning of therapeutic monoclonal antibodies: why you need to bin the fridge. , 2014, Drug discovery today.

[14]  Jean-Christophe Nebel,et al.  Progress and challenges in predicting protein interfaces , 2015, Briefings Bioinform..

[15]  Jiye Shi,et al.  Improving B-cell epitope prediction and its application to global antibody-antigen docking , 2014, Bioinform..

[16]  Chris Bailey-Kellogg,et al.  Learning Context-aware Structural Representations to Predict Antigen and Antibody Binding Interfaces , 2019, bioRxiv.

[17]  Sudipto Saha,et al.  Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network , 2006, Proteins.

[18]  Stephen P. Boyd,et al.  A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights , 2014, J. Mach. Learn. Res..

[19]  Chris Bailey-Kellogg,et al.  Protein interaction interface region prediction by geometric deep learning , 2021, Bioinform..

[20]  Vasant Honavar,et al.  Recent advances in B-cell epitope prediction methods , 2010, Immunome research.

[21]  N. Ando,et al.  X-rays in the Cryo-Electron Microscopy Era: Structural Biology's Dynamic Future. , 2018, Biochemistry.

[22]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[23]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[24]  Yanay Ofran,et al.  Antibody specific epitope prediction-emergence of a new paradigm. , 2015, Current opinion in virology.

[25]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[26]  S. Scheres,et al.  How cryo-EM is revolutionizing structural biology. , 2015, Trends in biochemical sciences.

[27]  Bo Yao,et al.  Conformational B-Cell Epitope Prediction on Antigen Protein Structures: A Review of Current Algorithms and Comparison with Common Binding Site Prediction Methods , 2013, PloS one.

[28]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Antonio Lanzavecchia,et al.  Antibody-guided vaccine design: identification of protective epitopes. , 2016, Current opinion in immunology.

[30]  Y. Louzoun,et al.  Rep‐Seq: uncovering the immunological repertoire through next‐generation sequencing , 2012, Immunology.

[31]  Inbal Sela-Culang,et al.  The Structural Basis of Antibody-Antigen Recognition , 2013, Front. Immunol..

[32]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[33]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Alex Fout,et al.  Protein Interface Prediction using Graph Convolutional Networks , 2017, NIPS.

[35]  Hsin-Jung Li,et al.  Development of therapeutic antibodies for the treatment of diseases , 2020, Journal of Biomedical Science.

[36]  Petar Velickovic,et al.  Attentive cross-modal paratope prediction , 2018, J. Comput. Biol..