FraGAT: a fragment-oriented multi-scale graph attention model for molecular property prediction

Abstract Motivation Molecular property prediction is a hot topic in recent years. Existing graph-based models ignore the hierarchical structures of molecules. According to the knowledge of chemistry and pharmacy, the functional groups of molecules are closely related to its physio-chemical properties and binding affinities. So, it should be helpful to represent molecular graphs by fragments that contain functional groups for molecular property prediction. Results In this article, to boost the performance of molecule property prediction, we first propose a definition of molecule graph fragments that may be or contain functional groups, which are relevant to molecular properties, then develop a fragment-oriented multi-scale graph attention network for molecular property prediction, which is called FraGAT. Experiments on several widely used benchmarks are conducted to evaluate FraGAT. Experimental results show that FraGAT achieves state-of-the-art predictive performance in most cases. Furthermore, our case studies show that when the fragments used to represent the molecule graphs contain functional groups, the model can make better predictions. This conforms to our expectation and demonstrates the interpretability of the proposed model. Availability and implementation The code and data underlying this work are available in GitHub, at https://github.com/ZiqiaoZhang/FraGAT. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Gianni De Fabritiis,et al.  KDEEP: Protein-Ligand Absolute Binding Affinity Prediction via 3D-Convolutional Neural Networks , 2018, J. Chem. Inf. Model..

[2]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[3]  Sarah L. Williams,et al.  Allosteric Inhibition of SHP2: Identification of a Potent, Selective, and Orally Efficacious Phosphatase Inhibitor. , 2016, Journal of medicinal chemistry.

[4]  Regina Barzilay,et al.  Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction , 2017, J. Chem. Inf. Model..

[5]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[6]  Jonathan D Hirst,et al.  Machine learning in virtual screening. , 2009, Combinatorial chemistry & high throughput screening.

[7]  Yingyu Liang,et al.  N-Gram Graph: Simple Unsupervised Representation for Graphs, with Applications to Molecules , 2018, NeurIPS.

[8]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[9]  Vijay S. Pande,et al.  Molecular graph convolutions: moving beyond fingerprints , 2016, Journal of Computer-Aided Molecular Design.

[10]  Xianghui Liu,et al.  SVM Model for Virtual Screening of Lck Inhibitors , 2009, J. Chem. Inf. Model..

[11]  Henning Sirringhaus,et al.  Fragment Graphical Variational AutoEncoding for Screening Molecules with Small Data , 2019, ArXiv.

[12]  Xiaomin Luo,et al.  Artificial intelligence in drug design , 2018, Science China Life Sciences.

[13]  Shuigeng Zhou,et al.  TOP: A Deep Mixture Representation Learning Method for Boosting Molecular Toxicity Prediction. , 2020, Methods.

[14]  A. Hopfinger,et al.  Methods for applying the quantitative structure-activity relationship paradigm. , 2004, Methods in molecular biology.

[15]  Alán Aspuru-Guzik,et al.  Convolutional Networks on Graphs for Learning Molecular Fingerprints , 2015, NIPS.

[16]  Regina Barzilay,et al.  Junction Tree Variational Autoencoder for Molecular Graph Generation , 2018, ICML.

[17]  P. Ertl,et al.  The Most Common Functional Groups in Bioactive Molecules and How Their Popularity has Evolved Over Time. , 2020, Journal of medicinal chemistry.

[18]  Yuedong Yang,et al.  Communicative Representation Learning on Attributed Molecular Graphs , 2020, IJCAI.

[19]  Sarah L. Williams,et al.  Identification of TNO155, an Allosteric SHP2 Inhibitor for the Treatment of Cancer. , 2020, Journal of medicinal chemistry.

[20]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[21]  Joseph Gomes,et al.  MoleculeNet: a benchmark for molecular machine learning† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc02664a , 2017, Chemical science.

[22]  G. Schneider,et al.  Rethinking drug design in the artificial intelligence era , 2019, Nature Reviews Drug Discovery.

[23]  Olgun Guvench Computational functional group mapping for drug discovery. , 2016, Drug discovery today.

[24]  M. D. Hill,et al.  Applications of Fluorine in Medicinal Chemistry. , 2015, Journal of medicinal chemistry.

[25]  Xiaocheng Li,et al.  Graph Convolution: A High-Order and Adaptive Approach , 2017, 1706.09916.

[26]  Xiaomin Luo,et al.  Pushing the boundaries of molecular representation for drug discovery with graph attention mechanism. , 2020, Journal of medicinal chemistry.