EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Predicting how a drug-like molecule binds to a specific protein target is a core problem in drug discovery. An extremely fast computational binding method would enable key applications such as fast virtual screening or drug engineering. Existing methods are computationally expensive as they rely on heavy candidate sampling coupled with scoring, ranking, and fine-tuning steps. We challenge this paradigm with E QUI B IND , an SE(3)-equivariant geometric deep learning model performing direct-shot prediction of both i) the receptor binding location (blind docking) and ii) the ligand’s bound pose and orientation. EquiBind achieves significant speed-ups and better quality compared to traditional and recent baselines. Further, we show extra improvements when coupling it with existing fine-tuning techniques at the cost of increased running time. Finally, we propose a novel and fast fine-tuning model that adjusts torsion angles of a ligand’s rotatable bonds based on closed-form global minima of the von Mises an-gular distance to a given input atomic point cloud, avoiding previous expensive differential evolution strategies for energy minimization.

[1]  Shengchao Liu,et al.  MolGenSurvey: A Systematic Survey in Machine Learning Models for Molecule Design , 2022, ArXiv.

[2]  S. Vajda,et al.  Side-chain Packing Using SE(3)-Transformer , 2021, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[3]  T. Jaakkola,et al.  Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking , 2021, ICLR.

[4]  Elise van der Pol,et al.  Geometric and Physical Quantities improve E(3) Equivariant Message Passing , 2021, ICLR.

[5]  Jonathan P. Mailoa,et al.  E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials , 2021, Nature Communications.

[6]  Vignesh Ram Somnath,et al.  Multi-Scale Representation Learning on Proteins , 2022, NeurIPS.

[7]  Jike Wang,et al.  InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein-Ligand Interaction Predictions. , 2021, Journal of medicinal chemistry.

[8]  Pietro Lio,et al.  Structure-aware generation of drug-like molecules , 2021, ArXiv.

[9]  Brandon M. Anderson,et al.  Learning physics confers pose-sensitivity in structure-based virtual screening , 2021, 2110.15459.

[10]  Tingjun Hou,et al.  The impact of cross-docked poses on performance of machine learning classifier for protein–ligand binding pose prediction , 2021, Journal of Cheminformatics.

[11]  Protein complex prediction with AlphaFold-Multimer , 2021, bioRxiv.

[12]  F. Seeber Faculty Opinions recommendation of Accurate prediction of protein structures and interactions using a three-track neural network. , 2021, Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature.

[13]  Dejing Dou,et al.  Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity , 2021, KDD.

[14]  Oriol Vinyals,et al.  Highly accurate protein structure prediction with AlphaFold , 2021, Nature.

[15]  Regina Barzilay,et al.  GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles , 2021, NeurIPS.

[16]  Bryn C. Taylor,et al.  Structure-based protein function prediction using graph convolutional networks , 2021, Nature Communications.

[17]  C. V. Jawahar,et al.  DeepPocket: Ligand Binding Site Detection and Segmentation using 3D Convolutional Neural Networks , 2021, J. Chem. Inf. Model..

[18]  Jingxiao Bao,et al.  DeepBSP - a Machine Learning Method for Accurate Prediction of Protein-Ligand Docking Structures , 2021, J. Chem. Inf. Model..

[19]  Jian Tang,et al.  Learning Gradient Fields for Molecular Conformation Generation , 2021, ICML.

[20]  Charlotte M. Deane,et al.  Deep generative design with 3D pharmacophoric constraints , 2021, bioRxiv.

[21]  E. A. del Rio-Chanona,et al.  A geometric deep learning approach to predict binding conformations of bioactive molecules , 2021, Nature Machine Intelligence.

[22]  Chris Bailey-Kellogg,et al.  Protein interaction interface region prediction by geometric deep learning , 2021, Bioinform..

[23]  Max Welling,et al.  E(n) Equivariant Graph Neural Networks , 2021, ICML.

[24]  David Ryan Koes,et al.  GNINA 1.0: molecular docking with deep learning , 2021, Journal of Cheminformatics.

[25]  Taj Mohammad,et al.  InstaDock: A single-click graphical user interface for molecular docking-based virtual high-throughput screening , 2020, Briefings Bioinform..

[26]  Raphael J. L. Townshend,et al.  Learning from Protein Structure with Geometric Vector Perceptrons , 2020, ICLR.

[27]  Shitong Luo,et al.  Predicting Molecular Conformation via Dynamic Graph Score Matching , 2021, NeurIPS.

[28]  Y. Bengio,et al.  Learning Neural Generative Dynamics for Molecular Conformation Generation , 2021, ICLR.

[29]  M. Bronstein,et al.  Fast end-to-end learning on protein surfaces , 2020, bioRxiv.

[30]  D. Koes,et al.  Generating 3D Molecular Structures Conditional on a Receptor Binding Site with Deep Generative Models , 2020, ArXiv.

[31]  David Ryan Koes,et al.  3D Convolutional Neural Networks and a CrossDocked Dataset for Structure-Based Drug Design. , 2020, Journal of chemical information and modeling.

[32]  Jacek Tabor,et al.  Emulating Docking Results Using a Deep Neural Network: A New Perspective for Virtual Screening , 2020, J. Chem. Inf. Model..

[33]  Fabian B. Fuchs,et al.  SE(3)-Transformers: 3D Roto-Translation Equivariant Attention Networks , 2020, NeurIPS.

[34]  Raphael J. L. Townshend,et al.  Hierarchical, rotation-equivariant neural networks to predict the structure of protein complexes , 2020, ArXiv.

[35]  Minghao Yin,et al.  EDock: blind protein–ligand docking by replica-exchange monte carlo simulation , 2020, Journal of Cheminformatics.

[36]  Joseph A Morrone,et al.  Combining Docking Pose Rank and Structure with Deep Learning Improves Protein-Ligand Binding Mode Prediction over a Baseline Docking Approach , 2019, J. Chem. Inf. Model..

[37]  M. Bronstein,et al.  Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning , 2019, Nature Methods.

[38]  Daniele Toti,et al.  Computational methods and tools for binding site recognition between proteins and small molecules: from classical geometrical approaches to modern machine learning strategies , 2019, Journal of Computer-Aided Molecular Design.

[39]  Seongok Ryu,et al.  Predicting Drug-Target Interaction Using a Novel Graph Neural Network with 3D Structure-Embedded Graph Representation , 2019, J. Chem. Inf. Model..

[40]  Pushmeet Kohli,et al.  Graph Matching Networks for Learning the Similarity of Graph Structured Objects , 2019, ICML.

[41]  Russ B. Altman,et al.  High precision protein functional site detection using 3D convolutional neural networks , 2018, Bioinform..

[42]  Raphael J. L. Townshend,et al.  End-to-End Learning on 3D Protein Structure for Interface Prediction , 2018, NeurIPS.

[43]  Russ B. Altman,et al.  Graph Convolutional Neural Networks for Predicting Drug-Target Interactions , 2018, bioRxiv.

[44]  Sheng-You Huang,et al.  Comprehensive assessment of flexible‐ligand docking algorithms: current effectiveness and challenges , 2018, Briefings Bioinform..

[45]  Ruben Abagyan,et al.  Hybrid receptor structure/ligand-based docking and activity prediction in ICM: development and evaluation in D3R Grand Challenge 3 , 2018, Journal of Computer-Aided Molecular Design.

[46]  Ping Zhang,et al.  Interpretable Drug Target Prediction Using Deep Neural Representation , 2018, IJCAI.

[47]  Di Wu,et al.  DeepAffinity: Interpretable Deep Learning of Compound-Protein Affinity through Unified Recurrent and Convolutional Neural Networks , 2018, bioRxiv.

[48]  Li Li,et al.  Tensor Field Networks: Rotation- and Translation-Equivariant Neural Networks for 3D Point Clouds , 2018, ArXiv.

[49]  Chee-Keong Kwoh,et al.  Protein-Ligand Blind Docking Using QuickVina-W With Inter-Process Spatio-Temporal Integration , 2017, Scientific Reports.

[50]  Gianni De Fabritiis,et al.  DeepSite: protein‐binding site predictor using 3D‐convolutional neural networks , 2017, Bioinform..

[51]  M. Schapira,et al.  A systematic analysis of atomic protein–ligand interactions in the PDB , 2017, MedChemComm.

[52]  Ruben Abagyan,et al.  Ligand-biased ensemble receptor docking (LigBEnD): a hybrid ligand/receptor structure-based approach , 2017, Journal of Computer-Aided Molecular Design.

[53]  Zhihai Liu,et al.  Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions. , 2017, Accounts of chemical research.

[54]  David Ryan Koes,et al.  Protein-Ligand Scoring with Convolutional Neural Networks , 2016, Journal of chemical information and modeling.

[55]  Max Welling,et al.  Group Equivariant Convolutional Networks , 2016, ICML.

[56]  Yuan-Ling Xia,et al.  Insights into Protein–Ligand Interactions: Mechanisms, Models, and Methods , 2016, International journal of molecular sciences.

[57]  Sereina Riniker,et al.  Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation , 2015, J. Chem. Inf. Model..

[58]  Izhar Wallach,et al.  AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery , 2015, ArXiv.

[59]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[60]  David Ryan Koes,et al.  Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise , 2013, J. Chem. Inf. Model..

[61]  J. Reymond,et al.  Exploring chemical space for drug discovery using the chemical universe database. , 2012, ACS chemical neuroscience.

[62]  Michael I. Jordan,et al.  Active site prediction using evolutionary and structural information , 2010, Bioinform..

[63]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[64]  I. Kuntz,et al.  DOCK 6: combining techniques to model RNA-small molecule complexes. , 2009, RNA.

[65]  Aniko Simon,et al.  eHiTS: a new fast, exhaustive flexible ligand docking system. , 2007, Journal of molecular graphics & modelling.

[66]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[67]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.