In Silico target fishing: addressing a “Big Data” problem by ligand-based similarity rankings with data fusion

BackgroundLigand-based in silico target fishing can be used to identify the potential interacting target of bioactive ligands, which is useful for understanding the polypharmacology and safety profile of existing drugs. The underlying principle of the approach is that known bioactive ligands can be used as reference to predict the targets for a new compound.ResultsWe tested a pipeline enabling large-scale target fishing and drug repositioning, based on simple fingerprint similarity rankings with data fusion. A large library containing 533 drug relevant targets with 179,807 active ligands was compiled, where each target was defined by its ligand set. For a given query molecule, its target profile is generated by similarity searching against the ligand sets assigned to each target, for which individual searches utilizing multiple reference structures are then fused into a single ranking list representing the potential target interaction profile of the query compound. The proposed approach was validated by 10-fold cross validation and two external tests using data from DrugBank and Therapeutic Target Database (TTD). The use of the approach was further demonstrated with some examples concerning the drug repositioning and drug side-effects prediction. The promising results suggest that the proposed method is useful for not only finding promiscuous drugs for their new usages, but also predicting some important toxic liabilities.ConclusionsWith the rapid increasing volume and diversity of data concerning drug related targets and their ligands, the simple ligand-based target fishing approach would play an important role in assisting future drug design and discovery.

[1]  Kai Huang,et al.  PharmMapper server: a web server for potential drug target identification using pharmacophore mapping approach , 2010, Nucleic Acids Res..

[2]  Jian Zhang,et al.  Peptide deformylase is a potential target for anti‐Helicobacter pylori drugs: Reverse docking, enzymatic assay, and X‐ray crystallography validation , 2006, Protein science : a publication of the Protein Society.

[3]  J. A. Grant,et al.  A shape-based 3-D scaffold hopping method and its application to a bacterial protein-protein interaction. , 2005, Journal of medicinal chemistry.

[4]  Anna Vulpetti,et al.  Predicting Polypharmacology by Binding Site Similarity: From Kinases to the Protein Universe , 2010, J. Chem. Inf. Model..

[5]  G. V. Paolini,et al.  Global mapping of pharmacological space , 2006, Nature Biotechnology.

[6]  Jérôme Hert,et al.  New Methods for Ligand-Based Virtual Screening: Use of Data Fusion and Machine Learning to Enhance the Effectiveness of Similarity Searching , 2006, J. Chem. Inf. Model..

[7]  Hong Liu,et al.  Computational Screening for Active Compounds Targeting Protein Sequences: Methodology and Experimental Validation , 2011, J. Chem. Inf. Model..

[8]  Jean-Philippe Vert,et al.  Protein-ligand interaction prediction: an improved chemogenomics approach , 2008, Bioinform..

[9]  D. Roden Drug-induced prolongation of the QT interval. , 2004, The New England journal of medicine.

[10]  Darren R. Flower,et al.  On the Properties of Bit String-Based Measures of Chemical Similarity , 1998, J. Chem. Inf. Comput. Sci..

[11]  Massih-Reza Amini,et al.  A boosting algorithm for learning bipartite ranking functions with partially labeled data , 2008, SIGIR '08.

[12]  M. Sanguinetti,et al.  hERG potassium channels and cardiac arrhythmia , 2006, Nature.

[13]  Anders Wallqvist,et al.  Exploring Polypharmacology Using a ROCS-Based Target Fishing Approach , 2012, J. Chem. Inf. Model..

[14]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[15]  Panagiotis Korantzopoulos,et al.  Drug-induced prolongation of the QT interval. , 2004, The New England journal of medicine.

[16]  Lin He,et al.  DRAR-CPI: a server for identifying drug repositioning potential and adverse drug reactions via the chemical–protein interactome , 2011, Nucleic Acids Res..

[17]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[18]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[19]  J. Kornhuber,et al.  Memantine in moderate-to-severe Alzheimer's disease. , 2003, The New England journal of medicine.

[20]  Peter Willett,et al.  Enhancing the Effectiveness of Virtual Screening by Fusing Nearest Neighbor Lists: A Comparison of Similarity Coefficients , 2004, J. Chem. Inf. Model..

[21]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[22]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[23]  Xiaomin Luo,et al.  PDTD: a web-accessible protein database for drug target identification , 2008, BMC Bioinformatics.

[24]  R. J. Eden,et al.  Preclinical pharmacology of ropinirole (SK&F 101468-A) a novel dopamine D2 agonist , 1991, Pharmacology Biochemistry and Behavior.

[25]  K. Shokat,et al.  Targeting the cancer kinome through polypharmacology , 2010, Nature Reviews Cancer.

[26]  J. Isaacsohn,et al.  Colesevelam hydrochloride (cholestagel): a new, potent bile acid sequestrant associated with a low incidence of gastrointestinal side effects. , 1999, Archives of internal medicine.

[27]  Andrew Simon Bell,et al.  Sildenafil (VIAGRATM), a potent and selective inhibitor of type 5 cGMP phosphodiesterase with utility for the treatment of male erectile dysfunction , 1996 .

[28]  Michael J. Keiser,et al.  Off-target networks derived from ligand set similarity. , 2009, Methods in molecular biology.

[29]  C. Ung,et al.  Can an in silico drug-target search method be used to probe potential mechanisms of medicinal plant ingredients? , 2003, Natural product reports.

[30]  D. Tompson,et al.  Steady-state pharmacokinetic properties of a 24-hour prolonged-release formulation of ropinirole: results of two randomized studies in patients with Parkinson's disease. , 2007, Clinical therapeutics.

[31]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[32]  Andreas Bender,et al.  From in silico target prediction to multi-target drug design: current databases, methods and applications. , 2011, Journal of proteomics.

[33]  P. Willett,et al.  Combination of molecular similarity measures using data fusion , 2000 .

[34]  Jérôme Hert,et al.  Comparison of Fingerprint-Based Methods for Virtual Screening Using Multiple Bioactive Reference Structures , 2004, J. Chem. Inf. Model..

[35]  S. Lukas,et al.  Buprenorphine treatment of refractory depression. , 1995, Journal of clinical psychopharmacology.

[36]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[37]  Yanli Wang,et al.  Identifying Compound-Target Associations by Combining Bioactivity Profile Similarity Search and Public Databases Mining , 2011, J. Chem. Inf. Model..

[38]  Yang Song,et al.  Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery , 2011, Nucleic Acids Res..

[39]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.

[40]  Dong Wang,et al.  The relationship between rational drug design and drug side effects , 2012, Briefings Bioinform..

[41]  R. Iyengar,et al.  Systems approaches to polypharmacology and drug discovery. , 2010, Current opinion in drug discovery & development.

[42]  Didier Rognan,et al.  Structure‐Based Approaches to Target Fishing and Ligand Profiling , 2010, Molecular informatics.

[43]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[44]  Honglin Li,et al.  [6]-Gingerol suppresses colon cancer growth by targeting leukotriene A4 hydrolase. , 2009, Cancer research.