Multi-aspect candidates for repositioning: data fusion methods using heterogeneous information sources.

Drug repositioning, an innovative therapeutic application of an old drug, has received much attention as a particularly costeffective strategy in drug R&D Recent work has indicated that repositioning can be promoted by utilizing a wide range of information sources, including medicinal chemical, target, mechanism, main and side-effect-related information, and also bibliometric and taxonomical fingerprints, signatures and knowledge bases. This article describes the adaptation of a conceptually novel, more efficient approach for the identification of new possible therapeutic applications of approved drugs and drug candidates, based on a kernel-based data fusion method. This strategy includes (1) the potentially multiple representation of information sources, (2) the automated weighting and statistically optimal combination of information sources, and (3) the automated weighting of parts of the query compounds. The performance was systematically evaluated by using Anatomical Therapeutic Chemical Classification System classes in a cross-validation framework. The results confirmed that kernel-based data fusion can integrate heterogeneous information sources significantly better than standard rank-based fusion can, and this method provides a unique solution for repositioning; it can also be utilized for de novo drug discovery. The advantages of kernel-based data fusion are illustrated with examples and open problems that are particularly relevant for pharmaceutical applications.

[1]  A. Chiang,et al.  Systematic Evaluation of Drug–Disease Relationships to Identify Leads for Novel Drug Uses , 2009, Clinical pharmacology and therapeutics.

[2]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[3]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[4]  Tijl De Bie,et al.  Kernel-based data fusion for gene prioritization , 2007, ISMB/ECCB.

[5]  P Mátyus,et al.  Application of neural networks in structure–activity relationships , 1999, Medicinal research reviews.

[6]  G. V. Paolini,et al.  Global mapping of pharmacological space , 2006, Nature Biotechnology.

[7]  Russell Greiner,et al.  Learning and Classifying Under Hard Budgets , 2005, ECML.

[8]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[9]  Jürgen Bajorath,et al.  Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. , 2007, Drug discovery today.

[10]  Hugo Kubinyi,et al.  Similarity and Dissimilarity: A Medicinal Chemist’s View , 2002 .

[11]  Cheng Zhu,et al.  Drug repositioning for orphan diseases , 2011, Briefings Bioinform..

[12]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.

[13]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[14]  Mårten Fryknäs,et al.  In Vitro Drug Sensitivity-Gene Expression Correlations Involve a Tissue of Origin Dependency , 2007, J. Chem. Inf. Model..

[15]  Sanjay Joshua Swamidass,et al.  Mining small-molecule screens to repurpose drugs , 2011, Briefings Bioinform..

[16]  Peter Willett,et al.  Similarity Searching in Files of Three-Dimensional Chemical Structures: Evaluation of the EVA Descriptor and Combination of Rankings Using Data Fusion , 1997, J. Chem. Inf. Comput. Sci..

[17]  Wei Zhao,et al.  A statistical framework to evaluate virtual screening , 2009, BMC Bioinformatics.

[18]  David J. Hand,et al.  Construction and Assessment of Classification Rules , 1997 .

[19]  Miklos Feher,et al.  Consensus scoring for protein-ligand interactions. , 2006, Drug discovery today.

[20]  Valerie J. Gillet,et al.  Analysis of Data Fusion Methods in Virtual Screening: Theoretical Model , 2006, J. Chem. Inf. Model..

[21]  Chris L Waller,et al.  Strategies to support drug discovery through integration of systems and data. , 2007, Drug discovery today.

[22]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[23]  Bart De Moor,et al.  A guide to web tools to prioritize candidate genes , 2011, Briefings Bioinform..

[24]  Jason Weston,et al.  Learning Gene Functional Classifications from Multiple Data Types , 2002, J. Comput. Biol..

[25]  Nello Cristianini,et al.  Kernel-Based Data Fusion and Its Application to Protein Function Prediction in Yeast , 2003, Pacific Symposium on Biocomputing.

[26]  Jérôme Hert,et al.  New Methods for Ligand-Based Virtual Screening: Use of Data Fusion and Machine Learning to Enhance the Effectiveness of Similarity Searching , 2006, J. Chem. Inf. Model..

[27]  P. Willett,et al.  Combination of molecular similarity measures using data fusion , 2000 .

[28]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[29]  Philip E. Bourne,et al.  PROMISCUOUS: a database for network-based drug-repositioning , 2010, Nucleic Acids Res..

[30]  Gunnar Rätsch,et al.  Active Learning in the Drug Discovery Process , 2001, NIPS.

[31]  Pierre Baldi,et al.  A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval , 2010, Bioinform..

[32]  M. Cataldi,et al.  Imatinib-Mesylate Blocks Recombinant T-Type Calcium Channels Expressed in Human Embryonic Kidney-293 Cells by a Protein Tyrosine Kinase-Independent Mechanism , 2004, Journal of Pharmacology and Experimental Therapeutics.

[33]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[34]  Guanghui Hu,et al.  Human Disease-Drug Network Based on Genomic Expression Profiles , 2009, PloS one.

[35]  Naomie Salim,et al.  Combination of Fingerprint-Based Similarity Coefficients Using Data Fusion , 2003, J. Chem. Inf. Comput. Sci..

[36]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[37]  S. Patterson,et al.  Exploiting Drug Repositioning for Discovery of a Novel HIV Combination Therapy , 2010, Journal of Virology.

[38]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[39]  Joel Dudley,et al.  Exploiting drug-disease relationships for computational drug repositioning , 2011, Briefings Bioinform..

[40]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[41]  Steven J. M. Jones,et al.  Drug repositioning for personalized medicine , 2012, Genome Medicine.

[42]  S. Seto,et al.  Actions of thalidomide in producing vascular relaxations. , 2010, European journal of pharmacology.

[43]  Shivani Agarwal,et al.  Ranking Chemical Structures for Drug Discovery: A New Machine Learning Approach , 2010, J. Chem. Inf. Model..

[44]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[45]  George Papadatos,et al.  Evaluation of machine-learning methods for ligand-based virtual screening , 2007, J. Comput. Aided Mol. Des..

[46]  Don R. Hush,et al.  Network constraints and multi-objective optimization for one-class classification , 1996, Neural Networks.

[47]  R. Sharan,et al.  PREDICT: a method for inferring novel drug indications with application to personalized medicine , 2011, Molecular systems biology.

[48]  Eric K. Neumann,et al.  Inferring novel disease indications for known drugs by semantically linking drug action and disease mechanism relationships , 2009, BMC Bioinformatics.

[49]  Michael K. Gilson,et al.  Virtual Screening of Molecular Databases Using a Support Vector Machine , 2005, J. Chem. Inf. Model..

[50]  Peter Willett,et al.  Promoting Access to White Rose Research Papers Enhancing the Effectiveness of Ligand-based Virtual Screening Using Data Fusion , 2022 .

[51]  Steven J. M. Jones,et al.  A Computational Approach to Finding Novel Targets for Existing Drugs , 2011, PLoS Comput. Biol..

[52]  Bart De Moor,et al.  L n -norm Multiple Kernel Learning and Least Squares Support Vector Machines , 2011 .

[53]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[54]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[55]  Peter Willett,et al.  Analysis of Data Fusion Methods in Virtual Screening: Similarity and Group Fusion , 2006, J. Chem. Inf. Model..

[56]  Fredrik Svensson,et al.  Virtual Screening Data Fusion Using Both Structure- and Ligand-Based Methods , 2012, J. Chem. Inf. Model..

[57]  R. Tagliaferri,et al.  Discovery of drug mode of action and drug repositioning from transcriptional responses , 2010, Proceedings of the National Academy of Sciences.

[58]  Alexander A. Morgan,et al.  Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data , 2011, Science Translational Medicine.

[59]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[60]  C. Teixeira,et al.  Vardenafil, but not sildenafil or tadalafil, has calcium‐channel blocking activity in rabbit isolated pulmonary artery and human washed platelets , 2008, British journal of pharmacology.

[61]  Chunhua Zhang,et al.  Kernel-based data fusion improves the drug-protein interaction prediction , 2011, Comput. Biol. Chem..

[62]  Peter Willett,et al.  Combination Rules for Group Fusion in Similarity‐Based Virtual Screening , 2010, Molecular informatics.

[63]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[64]  Nello Cristianini,et al.  A statistical framework for genomic data fusion , 2004, Bioinform..

[65]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[66]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[67]  Christopher I. Bayly,et al.  Evaluating Virtual Screening Methods: Good and Bad Metrics for the "Early Recognition" Problem , 2007, J. Chem. Inf. Model..

[68]  Shein-Chung Chow,et al.  Adaptive design methods in clinical trials – a review , 2008, Orphanet journal of rare diseases.

[69]  Andreas Zell,et al.  Estimation of the applicability domain of kernel-based machine learning models for virtual screening , 2010, J. Cheminformatics.

[70]  Gunnar Rätsch,et al.  Active Learning with Support Vector Machines in the Drug Discovery Process , 2003, J. Chem. Inf. Comput. Sci..

[71]  P. Randhawa,et al.  Viral drug sensitivity testing using quantitative PCR: effect of tyrosine kinase inhibitors on polyomavirus BK replication. , 2010, American journal of clinical pathology.