论文信息 - Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization

Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization

MOTIVATION Identifying interactions between drug compounds and target proteins has a great practical importance in the drug discovery process for known diseases. Existing databases contain very few experimentally validated drug-target interactions and formulating successful computational methods for predicting interactions remains challenging. RESULTS In this study, we consider four different drug-target interaction networks from humans involving enzymes, ion channels, G-protein-coupled receptors and nuclear receptors. We then propose a novel Bayesian formulation that combines dimensionality reduction, matrix factorization and binary classification for predicting drug-target interaction networks using only chemical similarity between drug compounds and genomic similarity between target proteins. The novelty of our approach comes from the joint Bayesian formulation of projecting drug compounds and target proteins into a unified subspace using the similarities and estimating the interaction network in that subspace. We propose using a variational approximation in order to obtain an efficient inference scheme and give its detailed derivations. Finally, we demonstrate the performance of our proposed method in three different scenarios: (i) exploratory data analysis using low-dimensional projections, (ii) predicting interactions for the out-of-sample drug compounds and (iii) predicting unknown interactions of the given network. AVAILABILITY Software and Supplementary Material are available at http://users.ics.aalto.fi/gonen/kbmf2k. CONTACT mehmet.gonen@aalto.fi SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

Mehmet Gönen | M. Gönen

[1] Anthony Widjaja,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[2] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[3] Neil D. Lawrence,et al. Semi-supervised Learning via Gaussian Processes , 2004, NIPS.

[4] Jens Sadowski,et al. Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[5] Robert B. Russell,et al. SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[6] Yoshihiro Yamanishi,et al. Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[7] Hiroshi Mamitsuka,et al. A probabilistic model for mining implicit 'chemical compound-gene' relations from literature , 2005, ECCB/JBI.

[8] Adrian F. M. Smith,et al. Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[9] David S. Wishart,et al. DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[10] Bernhard Schölkopf,et al. Kernel Methods in Computational Biology , 2005 .

[11] John P. Overington,et al. ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..