Predicting Unknown Interactions Between Known Drugs and Targets via Matrix Completion

Drug-target interactions map patterns, associations and relationships between drugs and target proteins. Identifying interactions between drug and target is critical in drug discovery, but biochemically validating these interactions are both laborious and expensive. In this paper, we propose a novel interaction profiles based method to predict potential drug-target interactions by using matrix completion. Our method first arranges the drug-target interactions in a matrix, whose entries include interaction pairs, non-interaction pairs and undetermined pairs, and finds its approximation matrix which contains the predicted values at undetermined positions. Then our method learns an approximation matrix by minimizing the distance between the drug-target interaction matrix and its approximation subject that the values in the observed positions equal to the known interactions at the corresponding positions. As a consequence, our method can directly predict new potential interactions according to the high values at the undetermined positions. We evaluated our method by comparing against five counterpart methods on “gold standard” datasets. Our method outperforms the counterparts, and achieves high AUC and \(F_1\)-score on enzyme, ion channel, GPCR, nuclear receptor and integrated datasets, respectively. We showed the intelligibility of our method by validating some predicted interactions in both DrugBank and KEGG databases.

[1]  Yoshihiro Yamanishi,et al.  Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[2]  Hiroki Kobayashi,et al.  Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening , 2009, PLoS Comput. Biol..

[3]  J. Irwin,et al.  Lead discovery using molecular docking. , 2002, Current opinion in chemical biology.

[4]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[5]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.

[6]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[7]  Hiroshi Mamitsuka,et al.  A probabilistic model for mining implicit 'chemical compound-gene' relations from literature , 2005, ECCB/JBI.

[8]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[9]  Stuart L. Schreiber,et al.  Dissecting glucose signalling with diversity-oriented synthesis and small-molecule microarrays , 2002, Nature.

[10]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[11]  H. Yabuuchi,et al.  Analysis of multiple compound–protein interactions reveals novel bioactive molecules , 2011, Molecular systems biology.

[12]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[13]  J. Ballesteros,et al.  G protein-coupled receptor drug discovery: implications from the crystal structure of rhodopsin. , 2001, Current opinion in drug discovery & development.

[14]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[15]  Daniel R. Caffrey,et al.  Structure-based maximal affinity model predicts small-molecule druggability , 2007, Nature Biotechnology.

[16]  Chuang Liu,et al.  Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference , 2012, PLoS Comput. Biol..

[17]  Mehmet Gönen,et al.  Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization , 2012, Bioinform..

[18]  Chang Liu,et al.  Predicting Drug–Target Interactions Using Probabilistic Matrix Factorization , 2013, J. Chem. Inf. Model..

[19]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[20]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[21]  Hao Ding,et al.  Similarity-based machine learning methods for predicting drug-target interactions: a brief review , 2014, Briefings Bioinform..

[22]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[23]  Yasubumi Sakakibara,et al.  Statistical prediction of protein-chemical interactions based on chemical structure and mass spectrometry data , 2007, Bioinform..

[24]  Damian Szklarczyk,et al.  STITCH 3: zooming in on protein–chemical interactions , 2011, Nucleic Acids Res..

[25]  D. Bojanic,et al.  Keynote review: in vitro safety pharmacology profiling: an essential tool for successful drug development. , 2005, Drug discovery today.

[26]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[27]  YamanishiYoshihiro,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008 .

[28]  Yuhao Wang,et al.  Predicting drug-target interactions using restricted Boltzmann machines , 2013, Bioinform..

[29]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[30]  Andrew L. Hopkins,et al.  Drug discovery: Predicting promiscuity , 2009, Nature.

[31]  J. Bajorath,et al.  Compound promiscuity: what can we learn from current data? , 2013, Drug discovery today.

[32]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[33]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[34]  Charles J. Manly,et al.  The impact of informatics and computational chemistry on synthesis and screening. , 2001, Drug discovery today.

[35]  J. Irwin,et al.  Docking and chemoinformatic screens for new ligands and targets. , 2009, Current opinion in biotechnology.

[36]  D. Butina,et al.  Predicting ADME properties in silico: methods and models. , 2002, Drug discovery today.

[37]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[38]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[39]  Sanjay Joshua Swamidass,et al.  Mining small-molecule screens to repurpose drugs , 2011, Briefings Bioinform..

[40]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[41]  Jean-Philippe Vert,et al.  Protein-ligand interaction prediction: an improved chemogenomics approach , 2008, Bioinform..

[42]  Gerhard Hessler,et al.  Drug Design Strategies for Targeting G‐Protein‐Coupled Receptors , 2002, Chembiochem : a European journal of chemical biology.

[43]  T. Klabunde Chemogenomic approaches to drug discovery: similar receptors bind similar ligands , 2007, British journal of pharmacology.

[44]  S. Haggarty,et al.  Multidimensional chemical genetic analysis of diversity-oriented synthesis-derived deacetylase inhibitors using cell-based assays. , 2003, Chemistry & biology.

[45]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[46]  Laetitia Martin-Chanas,et al.  Identify drug repurposing candidates by mining the Protein Data Bank , 2011, Briefings Bioinform..

[47]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..

[48]  G. Golub,et al.  Inexact and preconditioned Uzawa algorithms for saddle point problems , 1994 .

[49]  Pierre Acklin,et al.  Similarity Metrics for Ligands Reflecting the Similarity of the Target Proteins , 2003, J. Chem. Inf. Comput. Sci..

[50]  Joel Dudley,et al.  Exploiting drug-disease relationships for computational drug repositioning , 2011, Briefings Bioinform..

[51]  Hao Ding,et al.  Collaborative matrix factorization with multiple similarities for predicting drug-target interactions , 2013, KDD.

[52]  Brian K. Shoichet,et al.  Molecular docking using shape descriptors , 1992 .