Computational Study of Drugs by Integrating Omics Data with Kernel Methods

With the rapid development of genomic and chemogenomic techniques, many omics data sources for drugs have been publicly available. These data sources illustrate drug’s biological function in the living cell from different levels and different aspects. One straightforward idea is to learn understandable rules via computational models and algorithms to mine and integrate these data sources. Here, we review our recent efforts on developing kernel‐based methods to integrate drug related omics data sources. Three promising applications of our framework are shown to predict drug targets, assign drug’s ATC‐code annotation, and reveal drug repositioning. We demonstrate that data integration does provide more information and improve the accuracy by recovering more experimentally observed target proteins, ATC‐codes, and drug repositioning. Importantly, data integration can indicate novel predictions which are supported by database search and functional annotation analysis and worthy of further experimental validation. In conclusion, kernel methods can efficiently integrate heterogeneous data sources to computationally study drugs, and will promote the further research in drug discovery in a low‐cost way.

[1]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[2]  William Stafford Noble,et al.  Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[3]  Chunhua Zhang,et al.  Kernel-based data fusion improves the drug-protein interaction prediction , 2011, Comput. Biol. Chem..

[4]  Hunter B. Fraser,et al.  Using protein complexes to predict phenotypic effects of gene mutation , 2007, Genome Biology.

[5]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[6]  Joel Dudley,et al.  Exploiting drug-disease relationships for computational drug repositioning , 2011, Briefings Bioinform..

[7]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[8]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[9]  Yong Wang,et al.  Network predicting drug's anatomical therapeutic chemical code , 2013, Bioinform..

[10]  Alex Smola,et al.  Kernel methods in machine learning , 2007, math/0701907.

[11]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[12]  Patrick Aloy,et al.  Recycling side-effects into clinical markers for drug repositioning , 2012, Genome Medicine.

[13]  M. Kanehisa,et al.  Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. , 2003, Journal of the American Chemical Society.

[14]  Gary D Bader,et al.  BIND--The Biomolecular Interaction Network Database. , 2001, Nucleic acids research.

[15]  R. Sharan,et al.  PREDICT: a method for inferring novel drug indications with application to personalized medicine , 2011, Molecular systems biology.

[16]  Michael Gribskov,et al.  Use of Receiver Operating Characteristic (ROC) Analysis to Evaluate Sequence Matching , 1996, Comput. Chem..

[17]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[18]  Yong Wang,et al.  Computationally Probing Drug-Protein Interactions Via Support Vector Machine , 2010 .

[19]  Shiwen Zhao,et al.  Network-Based Relating Pharmacological and Genomic Spaces for Drug Target Identification , 2010, PloS one.

[20]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[21]  P. Bork,et al.  Large‐scale prediction of drug–target relationships , 2008, FEBS letters.

[22]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[23]  Yu-Dong Cai,et al.  Prediction of protein function in the absence of significant sequence similarity. , 2004, Current medicinal chemistry.

[24]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[25]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[26]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[27]  Luonan Chen,et al.  Network-based drug repositioning. , 2013, Molecular bioSystems.

[28]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[29]  J. Whisstock,et al.  Prediction of protein function from protein sequence and structure , 2003, Quarterly Reviews of Biophysics.

[30]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[31]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[32]  Igor Jurisica,et al.  Online Predicted Human Interaction Database , 2005, Bioinform..

[33]  Kriston L. McGary,et al.  Open Access Method , 2007 .

[34]  Stefan Günther,et al.  SuperPred: drug classification and target prediction , 2008, Nucleic Acids Res..

[35]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[36]  Pankaj Agarwal,et al.  Systematic Drug Repositioning Based on Clinical Side-Effects , 2011, PloS one.