Curvature-based Transformer for Molecular Property Prediction

The prediction of molecular properties is one of the most important and challenging tasks in the field of artificial intelligence-based drug design. Among the current mainstream methods, the most commonly used feature representation for training DNN models is based on SMILES and molecular graphs, although these methods are concise and effective, they also limit the ability to capture spatial information. In this work, we propose Curvature-based Transformer to improve the ability of Graph Transformer neural network models to extract structural information on molecular graph data by introducing Discretization of Ricci Curvature. To embed the curvature in the model, we add the curvature information of the graph as positional Encoding to the node features during the attention-score calculation. This method can introduce curvature information from graph data without changing the original network architecture, and it has the potential to be extended to other models. We performed experiments on chemical molecular datasets including PCQM4M-LST, MoleculeNet and compared with models such as Uni-Mol, Graphormer, and the results show that this method can achieve the state-of-the-art results. It is proved that the discretized Ricci curvature also reflects the structural and functional relationship while describing the local geometry of the graph molecular data.

[1]  Jinho Chang,et al.  Bidirectional generation of structure and properties through a single molecular foundation model , 2022, Nature communications.

[2]  Yue Zhang,et al.  Application of Computational Biology and Artificial Intelligence in Drug Design , 2022, International journal of molecular sciences.

[3]  Yaopeng Li,et al.  Directed message passing neural network (D-MPNN) with graph edge attention (GEA) for property prediction of biofuel-relevant species , 2022, Energy and AI.

[4]  Yi Liao,et al.  Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs , 2022, ICLR.

[5]  N. Kotov,et al.  Unifying structural descriptors for biological and bioinspired nanoscale complexes , 2022, Nature Computational Science.

[6]  Azalia Mirhoseini,et al.  Representing Long-Range Context for Graph Neural Networks with Global Attention , 2022, NeurIPS.

[7]  Francesco Di Giovanni,et al.  Understanding over-squashing and bottlenecks on graphs via curvature , 2021, ICLR.

[8]  Dragomir R. Radev,et al.  Molformer: Motif-Based Transformer on 3D Heterogeneous Molecular Graphs , 2021, AAAI.

[9]  Hua Wu,et al.  Geometry-enhanced molecular representation learning for property prediction , 2021, Nature Machine Intelligence.

[10]  Julien Mairal,et al.  GraphiT: Encoding Graph Structure in Transformers , 2021, ArXiv.

[11]  Di He,et al.  Do Transformers Really Perform Bad for Graph Representation? , 2021, ArXiv.

[12]  Dominique Beaini,et al.  Rethinking Graph Transformers with Spectral Attention , 2021, NeurIPS.

[13]  Jure Leskovec,et al.  OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs , 2021, NeurIPS Datasets and Benchmarks.

[14]  Kelin Xia,et al.  Ollivier Persistent Ricci Curvature-Based Machine Learning for the Protein-Ligand Binding Affinity Prediction , 2021, J. Chem. Inf. Model..

[15]  Xavier Bresson,et al.  A Generalization of Transformer Networks to Graphs , 2020, ArXiv.

[16]  J. Leskovec,et al.  Direct Multi-hop Attention based Graph Neural Network , 2020, ArXiv.

[17]  Wenbing Huang,et al.  GROVER: Self-supervised Message Passing Transformer on Large-scale Molecular Data , 2020, ArXiv.

[18]  Chao Chen,et al.  Curvature Graph Network , 2020, ICLR.

[19]  Tie-Yan Liu,et al.  On Layer Normalization in the Transformer Architecture , 2020, ICML.

[20]  Hongjian Li,et al.  Machine‐learning scoring functions for structure‐based drug lead optimization , 2020, WIREs Computational Molecular Science.

[21]  Jing Jiang,et al.  Interpretable Rumor Detection in Microblogs by Attending to User Interactions , 2020, AAAI.

[22]  Jiawei Zhang,et al.  Graph-Bert: Only Attention is Needed for Learning Graph Representations , 2020, ArXiv.

[23]  Deng Cai,et al.  Graph Transformer for Graph-to-Sequence Learning , 2019, AAAI.

[24]  E. Jonckheere,et al.  Ollivier-Ricci Curvature-Based Method to Community Detection in Complex Networks , 2019, Scientific Reports.

[25]  Regina Barzilay,et al.  Analyzing Learned Molecular Representations for Property Prediction , 2019, J. Chem. Inf. Model..

[26]  Frederic Sala,et al.  Learning Mixed-Curvature Representations in Product Spaces , 2018, ICLR.

[27]  Jie Gao,et al.  Network Alignment by Discrete Ollivier-Ricci Flow , 2018, GD.

[28]  Richard Socher,et al.  Multi-Hop Knowledge Graph Reasoning with Reward Shaping , 2018, EMNLP.

[29]  Yingyu Liang,et al.  N-Gram Graph: Simple Unsupervised Representation for Graphs, with Applications to Molecules , 2018, NeurIPS.

[30]  Guo-Wei Wei,et al.  Quantitative Toxicity Prediction Using Topology Based Multitask Deep Neural Networks , 2017, J. Chem. Inf. Model..

[31]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[32]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[33]  Vijay S. Pande,et al.  MoleculeNet: a benchmark for molecular machine learning , 2017, Chemical science.

[34]  Kaitlyn M. Gayvert,et al.  A Data-Driven Approach to Predicting Successes and Failures of Clinical Trials. , 2016, Cell chemical biology.

[35]  Vijay S. Pande,et al.  Computational Modeling of β-Secretase 1 (BACE-1) Inhibitors Using Ligand Based Approaches , 2016, J. Chem. Inf. Model..

[36]  Vijay S. Pande,et al.  Molecular graph convolutions: moving beyond fingerprints , 2016, Journal of Computer-Aided Molecular Design.

[37]  J. Jost,et al.  Forman curvature for complex networks , 2016, 1603.00386.

[38]  Jie Gao,et al.  Ricci curvature of the Internet topology , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[39]  Luis Pinheiro,et al.  A Bayesian Approach to in Silico Blood-Brain Barrier Penetration Modeling , 2012, J. Chem. Inf. Model..

[40]  S. Yau,et al.  Ricci curvature of graphs , 2011 .

[41]  Silvere Bonnabel,et al.  Stochastic Gradient Descent on Riemannian Manifolds , 2011, IEEE Transactions on Automatic Control.

[42]  C. Villani Optimal Transport: Old and New , 2008 .

[43]  Y. Ollivier Ricci curvature of Markov chains on metric spaces , 2007, math/0701886.

[44]  Robin Forman,et al.  Bochner's Method for Cell Complexes and Combinatorial Ricci Curvature , 2003, Discret. Comput. Geom..

[45]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[46]  Guolin Ke,et al.  Uni-Mol: A Universal 3D Molecular Representation Learning Framework , 2023, ICLR.

[47]  Mohammed J. Zaki,et al.  Edge-augmented Graph Transformers: Global Self-attention is Enough for Graphs , 2021, ArXiv.