Computational/in silico methods in drug target and lead prediction

Drug-like compounds are most of the time denied approval and use owing to the unexpected clinical side effects and cross-reactivity observed during clinical trials. These unexpected outcomes resulting in significant increase in attrition rate centralizes on the selected drug targets. These targets may be disease candidate proteins or genes, biological pathways, disease-associated microRNAs, disease-related biomarkers, abnormal molecular phenotypes, crucial nodes of biological network or molecular functions. This is generally linked to several factors, including incomplete knowledge on the drug targets and unpredicted pharmacokinetic expressions upon target interaction or off-target effects. A method used to identify targets, especially for polygenic diseases, is essential and constitutes a major bottleneck in drug development with the fundamental stage being the identification and validation of drug targets of interest for further downstream processes. Thus, various computational methods have been developed to complement experimental approaches in drug discovery. Here, we present an overview of various computational methods and tools applied in predicting or validating drug targets and drug-like molecules. We provide an overview on their advantages and compare these methods to identify effective methods which likely lead to optimal results. We also explore major sources of drug failure considering the challenges and opportunities involved. This review might guide researchers on selecting the most efficient approach or technique during the computational drug discovery process.

[1]  Maykel Cruz-Monteagudo,et al.  Activity cliffs in drug discovery: Dr Jekyll or Mr Hyde? , 2014, Drug discovery today.

[2]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[3]  Sona Warrier,et al.  Reverse docking: a powerful tool for drug repositioning and drug rescue. , 2014, Future medicinal chemistry.

[4]  Pierre Baldi,et al.  One- to Four-Dimensional Kernels for Virtual Screening and the Prediction of Physical, Chemical, and Biological Properties , 2007, J. Chem. Inf. Model..

[5]  Danishuddin,et al.  Descriptors and their selection methods in QSAR analysis: paradigm for drug design. , 2016, Drug discovery today.

[6]  M. Kamruzzaman,et al.  Application of the Subtractive Genomics and Molecular Docking Analysis for the Identification of Novel Putative Drug Targets against Salmonella enterica subsp. enterica serovar Poona , 2017, BioMed research international.

[7]  Camille G Wermuth,et al.  Selective optimization of side activities: the SOSA approach. , 2006, Drug discovery today.

[8]  Jian Peng,et al.  A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information , 2017, Nature Communications.

[9]  Jianxin Chen,et al.  Large-scale exploration and analysis of drug combinations , 2015, Bioinform..

[10]  Jürgen Bajorath,et al.  Combinatorial Preferences Affect Molecular Similarity/Diversity Calculations Using Binary Fingerprints and Tanimoto Coefficients , 2000, J. Chem. Inf. Comput. Sci..

[11]  Teresa M. Przytycka,et al.  Identifying Causal Genes and Dysregulated Pathways in Complex Diseases , 2011, PLoS Comput. Biol..

[12]  R. Morphy Selectively nonselective kinase inhibition: striking the right balance. , 2010, Journal of medicinal chemistry.

[13]  C. E. Peishoff,et al.  A critical assessment of docking programs and scoring functions. , 2006, Journal of medicinal chemistry.

[14]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[15]  Stefan Günther,et al.  SuperPred: drug classification and target prediction , 2008, Nucleic Acids Res..

[16]  Yong Zhang,et al.  Text mining‑based drug discovery in cutaneous squamous cell carcinoma. , 2018, Oncology reports.

[17]  Jian Zhang,et al.  Peptide deformylase is a potential target for anti‐Helicobacter pylori drugs: Reverse docking, enzymatic assay, and X‐ray crystallography validation , 2006, Protein science : a publication of the Protein Society.

[18]  Yong Huang,et al.  Large-Scale Chemical Similarity Networks for Target Profiling of Compounds Identified in Cell-Based Chemical Screens , 2015, PLoS Comput. Biol..

[19]  Tudor I. Oprea,et al.  Quantifying the Relationships among Drug Classes , 2008, J. Chem. Inf. Model..

[20]  Andreas Bender,et al.  Ligand-Target Prediction Using Winnow and Naive Bayesian Algorithms and the Implications of Overall Performance Statistics , 2008, J. Chem. Inf. Model..

[21]  Monica Campillos,et al.  HitPick: a web server for hit identification and target prediction of chemical screenings , 2013, Bioinform..

[22]  Tao Huang,et al.  MOST: most-similar ligand based approach to target prediction , 2017, BMC Bioinformatics.

[23]  Peter Willett,et al.  Similarity Searching and Clustering of Chemical-Structure Databases Using Molecular Property Data , 1994, J. Chem. Inf. Comput. Sci..

[24]  J. Panteleev,et al.  Recent applications of machine learning in medicinal chemistry. , 2018, Bioorganic & medicinal chemistry letters.

[25]  Yen-Jen Oyang,et al.  On the Design of Optimization Algorithms for Prediction of Molecular Interactions , 2009, 2009 Ninth IEEE International Conference on Bioinformatics and BioEngineering.

[26]  I. Kola,et al.  Can the pharmaceutical industry reduce attrition rates? , 2004, Nature Reviews Drug Discovery.

[27]  B Testa,et al.  In silico pharmacology for drug discovery: applications to targets and beyond , 2007, British journal of pharmacology.

[28]  J. Dopazo Genomics and transcriptomics in drug discovery. , 2014, Drug discovery today.

[29]  H. El-Shemy,et al.  Target-similarity search using Plasmodium falciparum proteome identifies approved drugs with anti-malarial activity and their possible targets , 2017, PloS one.

[30]  M. Huss,et al.  A primer on deep learning in genomics , 2018, Nature Genetics.

[31]  Ruixin Zhu,et al.  Calmodulin as a Potential Target by Which Berberine Induces Cell Cycle Arrest in Human Hepatoma Bel7402 Cells , 2013, Chemical biology & drug design.

[32]  Dmitrij Frishman,et al.  TargetSpy: a supervised machine learning approach for microRNA target prediction , 2010, BMC Bioinformatics.

[33]  L. Cardon,et al.  Precision medicine, genomics and drug discovery. , 2016, Human molecular genetics.

[34]  Jung-Hsin Lin,et al.  idTarget: a web server for identifying protein targets of small chemical molecules with robust scoring functions and a divide-and-conquer docking approach , 2012, Nucleic Acids Res..

[35]  B. Roth,et al.  Magic shotguns versus magic bullets: selectively non-selective drugs for mood disorders and schizophrenia , 2004, Nature Reviews Drug Discovery.

[36]  Yufeng Liu,et al.  Relating Anatomical Therapeutic Indications by the Ensemble Similarity of Drug Sets , 2013, J. Chem. Inf. Model..

[37]  Y.Z. Chen,et al.  Ligand–protein inverse docking and its potential use in the computer search of protein targets of a small molecule , 2001, Proteins.

[38]  David G. Kirkpatrick,et al.  Linear Time Euclidean Distance Algorithms , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  E. Koonin,et al.  The structure of the protein universe and genome evolution , 2002, Nature.

[40]  Gianluigi Lauro,et al.  Inverse virtual screening of antitumor targets: pilot study on a small database of natural bioactive compounds. , 2011, Journal of natural products.

[41]  Bo Wang,et al.  Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities , 2018, Inf. Fusion.

[42]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[43]  George Karypis,et al.  Target Fishing for Chemical Compounds Using Target-Ligand Activity Data and Ranking Based Methods , 2009, J. Chem. Inf. Model..

[44]  Ariel Fernández,et al.  Turning promiscuous kinase inhibitors into safer drugs. , 2008, Trends in biotechnology.

[45]  Jessica Holien,et al.  Improvements, trends, and new ideas in molecular docking: 2012–2013 in review , 2015, Journal of molecular recognition : JMR.

[46]  Haiyuan Yu,et al.  Three-dimensional reconstruction of protein networks provides insight into human genetic disease , 2012, Nature Biotechnology.

[47]  Minho Lee,et al.  Large-scale reverse docking profiles and their applications , 2012, BMC Bioinformatics.

[48]  P. Willett,et al.  Comparison of topological descriptors for similarity-based virtual screening using multiple bioactive reference structures. , 2004, Organic & biomolecular chemistry.

[49]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[50]  Zheng Yin,et al.  Improving chemical similarity ensemble approach in target prediction , 2016, Journal of Cheminformatics.

[51]  G. Bifulco,et al.  Inverse Virtual Screening allows the discovery of the biological activity of natural compounds. , 2012, Bioorganic & medicinal chemistry.

[52]  David S. Roos,et al.  TDR Targets: a chemogenomics resource for neglected diseases , 2011, Nucleic Acids Res..

[53]  Sarah A Heerboth,et al.  Drug Resistance in Cancer: An Overview , 2014, Cancers.

[54]  Xiaomin Luo,et al.  TarFisDock: a web server for identifying drug targets with docking approach , 2006, Nucleic Acids Res..

[55]  Dongsup Kim,et al.  Using reverse docking for target identification and its applications for drug discovery , 2016, Expert opinion on drug discovery.

[56]  H. Kitano,et al.  Combining Machine Learning Systems and Multiple Docking Simulation Packages to Improve Docking Prediction Reliability for Network Pharmacology , 2013, PloS one.

[57]  X. Chen,et al.  TTD: Therapeutic Target Database , 2002, Nucleic Acids Res..

[58]  Andreas Bender,et al.  From in silico target prediction to multi-target drug design: current databases, methods and applications. , 2011, Journal of proteomics.

[59]  P. Taylor,et al.  Panel docking of small-molecule libraries - Prospects to improve efficiency of lead compound discovery. , 2015, Biotechnology Advances.

[60]  John P. Overington,et al.  Can we rationally design promiscuous drugs? , 2006, Current opinion in structural biology.

[61]  Xiaomin Luo,et al.  Artificial intelligence in drug design , 2018, Science China Life Sciences.

[62]  Katharina Nöh,et al.  Current state and challenges for dynamic metabolic modeling. , 2016, Current opinion in microbiology.

[63]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[64]  M. Ramanathan,et al.  Network‐Based Approaches in Drug Discovery and Early Development , 2013, Clinical pharmacology and therapeutics.

[65]  Peter Willett,et al.  CLIP: Similarity Searching of 3D Databases Using Clique Detection , 2003, J. Chem. Inf. Comput. Sci..

[66]  Daniel C. Weaver Applying data mining techniques to library design, lead generation and lead optimization. , 2004, Current opinion in chemical biology.

[67]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[69]  Cheng Zhang,et al.  Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks , 2017, Scientific Reports.

[70]  Yongliang Yang,et al.  Target discovery from data mining approaches. , 2009, Drug discovery today.

[71]  Weilin Zhang,et al.  Computational Multitarget Drug Design , 2017, J. Chem. Inf. Model..

[72]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[73]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[74]  John B. O. Mitchell,et al.  A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking , 2010, Bioinform..

[75]  Nicola J. Mulder,et al.  Large-scale data-driven integrative framework for extracting essential targets and processes from disease-associated gene data sets , 2017, Briefings Bioinform..

[76]  Meir Glick,et al.  Prediction of Biological Targets for Compounds Using Multiple-Category Bayesian Models Trained on Chemogenomics Databases , 2006, J. Chem. Inf. Model..

[77]  Z. Deng,et al.  Bridging chemical and biological space: "target fishing" using 2D and 3D molecular descriptors. , 2006, Journal of medicinal chemistry.

[78]  R. Glen,et al.  Molecular similarity: a key technique in molecular informatics. , 2004, Organic & biomolecular chemistry.

[79]  Serena Dotolo,et al.  A theoretical study on predicted protein targets of apple polyphenols and possible mechanisms of chemoprevention in colorectal cancer , 2016, Scientific reports.

[80]  A. Fliri,et al.  Biological spectra analysis: Linking biological activity profiles to molecular structure. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[81]  Peter Willett,et al.  Promoting Access to White Rose Research Papers Effectiveness of Graph-based and Fingerprint-based Similarity Measures for Virtual Screening of 2d Chemical Structure Databases , 2022 .

[82]  R. Mahapatra,et al.  An in silico strategy for identification of novel drug targets against Plasmodium falciparum , 2017, Parasitology Research.

[83]  Petra Schneider,et al.  Identifying the macromolecular targets of de novo-designed chemical entities through self-organizing map consensus , 2014, Proceedings of the National Academy of Sciences.

[84]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[85]  Richard Morphy,et al.  From magic bullets to designed multiple ligands. , 2004, Drug discovery today.

[86]  C. Cava,et al.  In silico identification of drug target pathways in breast cancer subtypes using pathway cross-talk inhibition , 2018, Journal of Translational Medicine.

[87]  Russ B Altman,et al.  Machine learning in chemoinformatics and drug discovery. , 2018, Drug discovery today.

[88]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[89]  G. Gobbi,et al.  Clozapine blocks dopamine, 5-HT2 and 5-HT3 responses in the medial prefrontal cortex: an in vivo microiontophoretic study , 1999, European Neuropsychopharmacology.

[90]  A. Fliri,et al.  Biospectra analysis: model proteome characterizations for linking molecular structure and biological response. , 2005, Journal of medicinal chemistry.

[91]  Theodora Katsila,et al.  Computational approaches in target identification and drug discovery , 2016, Computational and structural biotechnology journal.

[92]  Vladimir Poroikov,et al.  PASS: prediction of activity spectra for biologically active substances , 2000, Bioinform..

[93]  Xiangrong Liu,et al.  Machine Learning for Drug-Target Interaction Prediction , 2018, Molecules.

[94]  Peter A. Flach,et al.  Naive Bayesian Classification of Structured Data , 2004, Machine Learning.

[95]  Brian K. Shoichet,et al.  Chemical informatics uncovers a new role for moexipril as a novel inhibitor of cAMP phosphodiesterase-4 (PDE4) , 2013, Biochemical pharmacology.

[96]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[97]  Xin Chen,et al.  The interprotein scoring noises in glide docking scores , 2012, Proteins.

[98]  Andreas Hoppe,et al.  Antimalarial drug targets in Plasmodium falciparum predicted by stage-specific metabolic network analysis , 2010, BMC Systems Biology.

[99]  G. V. Paolini,et al.  Global mapping of pharmacological space , 2006, Nature Biotechnology.

[100]  Xing-Ming Zhao,et al.  Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data , 2011, PLoS Comput. Biol..

[101]  Thomas Blaschke,et al.  The rise of deep learning in drug discovery. , 2018, Drug discovery today.

[102]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[103]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[104]  Julio Saez-Rodriguez,et al.  Machine Learning Prediction of Cancer Cell Sensitivity to Drugs Based on Genomic and Chemical Properties , 2012, PloS one.

[105]  Nagaraja Haleagrahara,et al.  Current Concepts of Neurodegenerative Mechanisms in Alzheimer's Disease , 2018, BioMed research international.

[106]  Rieko Arimoto,et al.  Computational models for predicting interactions with cytochrome p450 enzyme. , 2006, Current topics in medicinal chemistry.

[107]  Aurélien Grosdidier,et al.  SwissTargetPrediction: a web server for target prediction of bioactive small molecules , 2014, Nucleic Acids Res..

[108]  Alexander Golbraikh,et al.  Predictive QSAR modeling: Methods and applications in drug discovery and chemical risk assessment , 2012 .

[109]  Laleh Soltan Ghoraie,et al.  Predictive approaches for drug combination discovery in cancer , 2018, Briefings Bioinform..

[110]  C. Andrade,et al.  In silico prediction of drug metabolism by P450. , 2014, Current drug metabolism.

[111]  Bruce M Altevogt,et al.  Improving and Accelerating Therapeutic Development for Nervous System Disorders: Workshop Summary , 2014 .

[112]  Zengrui Wu,et al.  Network-Based Methods for Prediction of Drug-Target Interactions , 2018, Front. Pharmacol..

[113]  Antonio Lavecchia,et al.  Machine-learning approaches in drug discovery: methods and applications. , 2015, Drug discovery today.

[114]  Jean-Louis Reymond,et al.  The polypharmacology browser: a web-based multi-fingerprint target prediction tool using ChEMBL bioactivity data , 2017, Journal of Cheminformatics.

[115]  Jean-Louis Reymond,et al.  Polypharmacology Browser PPB2: Target Prediction Combining Nearest Neighbors with Machine Learning , 2018, J. Chem. Inf. Model..

[116]  Péter Csermely,et al.  The efficiency of multi-target drugs: the network approach might help drug design. , 2004, Trends in pharmacological sciences.

[117]  A. Butte,et al.  Leveraging big data to transform target selection and drug discovery , 2016, Clinical pharmacology and therapeutics.

[118]  Christian von Mering,et al.  STITCH: interaction networks of chemicals and proteins , 2007, Nucleic Acids Res..

[119]  Arthur J. Olson,et al.  Robust Scoring Functions for Protein-Ligand Interactions with Quantum Chemical Charge Models , 2011, J. Chem. Inf. Model..

[120]  Kalidas Yeturu,et al.  targetTB: A target identification pipeline for Mycobacterium tuberculosis through an interactome, reactome and genome-scale structural analysis , 2008, BMC Systems Biology.

[121]  Gaston K Mazandu,et al.  Contribution of microarray data to the advancement of knowledge on the Mycobacterium tuberculosis interactome: use of the random partial least squares approach. , 2011, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[122]  G. Mazandu,et al.  Orienting Future Trends in Local Ancestry Deconvolution Models to Optimally Decipher Admixed Individual Genome Variations , 2019, Bioinformatics Tools for Detection and Clinical Interpretation of Genomic Variations.

[123]  Francisco de A. T. de Carvalho,et al.  Clustering of interval data based on city-block distances , 2004, Pattern Recognit. Lett..

[124]  E. Ashley,et al.  Artemisinin-based combinations , 2005, Current opinion in infectious diseases.

[125]  Gisbert Schneider,et al.  In Silico Target Prediction for Small Molecules. , 2018, Methods in molecular biology.

[126]  A. Bender,et al.  In silico target fishing: Predicting biological targets from chemical structure , 2006 .