Deep Learning of High-Order Interactions for Protein Interface Prediction

Protein interactions are important in a broad range of biological processes. Traditionally, computational methods have been developed to automatically predict protein interface from hand-crafted features. Recent approaches employ deep neural networks and predict the interaction of each amino acid pair independently. However, these methods do not incorporate the important sequential information from amino acid chains and the high-order pairwise interactions. Intuitively, the prediction of an amino acid pair should depend on both their features and the information of other amino acid pairs. In this work, we propose to formulate the protein interface prediction as a 2D dense prediction problem. In addition, we propose a novel deep model to incorporate the sequential information and high-order pairwise interactions to perform interface predictions. We represent proteins as graphs and employ graph neural networks to learn node features. Then we propose the sequential modeling method to incorporate the sequential information and reorder the feature matrix. Next, we incorporate high-order pairwise interactions to generate a 3D tensor containing different pairwise interactions. Finally, we employ convolutional neural networks to perform 2D dense predictions. Experimental results on multiple benchmarks demonstrate that our proposed method can consistently improve the protein interface prediction performance.

[1]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[2]  Zhiping Weng,et al.  Accelerating Protein Docking in ZDOCK Using an Advanced 3D Convolution Library , 2011, PloS one.

[3]  Huan-Xiang Zhou,et al.  Interaction-site prediction for protein complexes: a critical assessment , 2007, Bioinform..

[4]  Thomas C. Northey,et al.  IntPred: a structure-based predictor of protein–protein interaction sites , 2017, Bioinform..

[5]  Alex Fout,et al.  Protein Interface Prediction using Graph Convolutional Networks , 2017, NIPS.

[6]  Xiaolong Wang,et al.  Prediction of protein binding sites in protein structures using hidden Markov support vector machine , 2009, BMC Bioinformatics.

[7]  Jure Leskovec,et al.  Hierarchical Graph Representation Learning with Differentiable Pooling , 2018, NeurIPS.

[8]  Alfonso Valencia,et al.  Progress and challenges in predicting protein-protein interaction sites , 2008, Briefings Bioinform..

[9]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[10]  Feihong Wu,et al.  Comparing Kernels for Predicting Protein Binding Sites from Amino Acid Sequence , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[11]  Kurt Gebruers,et al.  Identification of structural determinants for inhibition strength and specificity of wheat xylanase inhibitors TAXI‐IA and TAXI‐IIA , 2009, The FEBS journal.

[12]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[13]  Shuigeng Zhou,et al.  Prediction of protein-protein interaction sites using an ensemble method , 2009, BMC Bioinformatics.

[14]  Shuiwang Ji,et al.  Global Pixel Transformers for Virtual Staining of Microscopy Images , 2019, IEEE Transactions on Medical Imaging.

[15]  H R Drew,et al.  Structure of a B-DNA dodecamer: conformation and dynamics. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Martin Zacharias,et al.  In silico prediction of binding sites on proteins. , 2010, Current medicinal chemistry.

[17]  Raphael A. G. Chaleil,et al.  Updates to the Integrated Protein-Protein Interaction Benchmarks: Docking Benchmark Version 5 and Affinity Benchmark Version 2. , 2015, Journal of molecular biology.

[18]  Z. Weng,et al.  Protein–protein docking benchmark 2.0: An update , 2005, Proteins.

[19]  Donald F. Towsley,et al.  Diffusion-Convolutional Neural Networks , 2015, NIPS.

[20]  Ozlem Keskin,et al.  Prediction of protein–protein interactions: unifying evolution and structure at protein interfaces , 2011, Physical biology.

[21]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22]  Shuiwang Ji,et al.  StructPool: Structured Graph Pooling via Conditional Random Fields , 2020, ICLR.

[23]  Hao Yuan,et al.  Learning Local and Global Multi-context Representations for Document Classification , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[24]  Rishi Bedi,et al.  End-to-End Learning on 3D Protein Structure for Interface Prediction , 2019, NeurIPS.

[25]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[27]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[28]  Kristian Vlahovicek,et al.  Prediction of Protein–Protein Interaction Sites in Sequences and 3D Structures by Random Forests , 2009, PLoS Comput. Biol..

[29]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Vasant Honavar,et al.  Predicting protein-protein interface residues using local surface structural similarity , 2012, BMC Bioinformatics.

[31]  O. Schueler‐Furman,et al.  Progress in protein–protein docking: Atomic resolution predictions in the CAPRI experiment using RosettaDock with an improved treatment of side‐chain flexibility , 2005, Proteins.

[32]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Joan Segura,et al.  A holistic in silico approach to predict functional sites in protein structures , 2012, Bioinform..

[34]  Z. Weng,et al.  Protein–protein docking benchmark version 3.0 , 2008, Proteins.

[35]  Alexandre Tkatchenko,et al.  Quantum-chemical insights from deep tensor neural networks , 2016, Nature Communications.

[36]  A. Ben-Hur,et al.  PAIRpred: Partner‐specific prediction of interacting residues from sequence and structure , 2014, Proteins.

[37]  K. Mizuguchi,et al.  Partner-Aware Prediction of Interacting Residues in Protein-Protein Complexes from Sequence Data , 2011, PloS one.

[38]  David R. Westhead,et al.  Improved prediction of protein-protein binding sites using a support vector machines approach. , 2005, Bioinformatics.

[39]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[40]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[41]  José María Carazo,et al.  BIPSPI: a method for the prediction of partner-specific protein–protein interfaces , 2018, Bioinform..

[42]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[43]  Alán Aspuru-Guzik,et al.  Convolutional Networks on Graphs for Learning Molecular Fingerprints , 2015, NIPS.