A Drug Decision Support System for Developing A Successful Drug Candidate Using Machine Learning Techniques.

BACKGROUND Computer-aided data mining methods and sophisticated softwares are used to examine candidate molecules during the drug discovery process. Data mining and machine learning are effective tools in leveraging the drug datasets. OBJECTIVE Developed model in this study can be used as a simple filter in drug modelling to eliminate potentially inappropriate molecules in the early stages. In this work we developed a Drug Decision Support System (DDSS), in which these tools are used to induce classification models, association rules and subgraph patterns. DDSS helps drug designers to develop a successful drug candidate. METHODS Molecular descriptors that are effective in classification models were identified for determination of a number of rules in drug molecules. They are derived using ADRIANA.Code program and Lipinski's rule of five. Closed frequent molecular structures in the form of subgraph fragments were also obtained with Gaston algorithm. Gaston algorithm, included in ParMol Package (Parallel Molecular Mining) was used to find common molecular fragments in the drug datasets. WEKA machine learning tool version 3.6.11 and MATLAB software package (MATLAB & SIMULINK, R2015a) were used as tools for this study. RESULTS We observed that TPSA, XlogP Natoms, HDon_O and TPSA are the most distinctive features in the pool of the molecular descriptors. Cardiac therapy, anti-epileptics and anti-parkinson drugs with approved and withdrawn drugs are identified and related databases are screened to obtain datasets for experimentation. CONCLUSION The experimental evaluation shows that the system is promising at determination of potential drug molecules to classify drug molecules correctly according to the types of diseases.

[1]  Dimitrios I. Fotiadis,et al.  Automated Diagnosis of Coronary Artery Disease Based on Data Mining and Fuzzy Modeling , 2008, IEEE Transactions on Information Technology in Biomedicine.

[2]  Doheon Lee,et al.  Combining Neuroinformatics Databases for Multi-Level Analysis of Brain Disorders , 2012 .

[3]  Guang Ping Cao,et al.  Classification of HDAC8 Inhibitors and Non-Inhibitors Using Support Vector Machines , 2012 .

[4]  J. Panteleev,et al.  Recent applications of machine learning in medicinal chemistry. , 2018, Bioorganic & medicinal chemistry letters.

[5]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[6]  Cheng Luo,et al.  In silico ADME/T modelling for rational drug design , 2015, Quarterly Reviews of Biophysics.

[7]  John B. O. Mitchell Machine learning methods in chemoinformatics , 2014, Wiley interdisciplinary reviews. Computational molecular science.

[8]  Adriano D Andricopulo,et al.  ADMET modeling approaches in drug discovery. , 2019, Drug discovery today.

[9]  G. Pizzolato,et al.  Trace amine metabolism in Parkinson's disease: Low circulating levels of octopamine in early disease stages , 2010, Neuroscience Letters.

[10]  Nigel Greene,et al.  Finding the rules for successful drug optimisation. , 2014, Drug discovery today.

[11]  Padmakumari K. N. Anooj,et al.  Clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules and decision tree rules , 2011, Central European Journal of Computer Science.

[12]  David Vidal,et al.  Large-Scale Predictive Drug Safety: From Structural Alerts to Biological Mechanisms. , 2015, Chemical research in toxicology.

[13]  M. Khan Cardiac Drug Therapy , 1992 .

[14]  Michael T. M. Emmerich,et al.  Improving the drug discovery process by using multiple classifier systems , 2019, Expert Syst. Appl..

[15]  Russ B Altman,et al.  Machine learning in chemoinformatics and drug discovery. , 2018, Drug discovery today.

[16]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[17]  Shu-Hsien Liao,et al.  Data mining techniques and applications - A decade review from 2000 to 2011 , 2012, Expert Syst. Appl..

[18]  R. Evens Drug and Biological Development , 2007 .

[19]  Ying Liu,et al.  A Comparative Study on Feature Selection Methods for Drug Discovery , 2004, J. Chem. Inf. Model..

[20]  S. Shakir,et al.  An investigation into drug products withdrawn from the EU market between 2002 and 2011 for safety reasons and the evidence used to support the decision-making , 2014, BMJ Open.

[21]  Barrie Wilkinson,et al.  Drug discovery beyond the 'rule-of-five'. , 2007, Current opinion in biotechnology.

[22]  Iskander Yusof,et al.  Considering the impact drug-like properties have on the chance of success. , 2013, Drug discovery today.

[23]  E. Ansorena,et al.  Drug development in Parkinson's disease: from emerging molecules to innovative drug delivery systems. , 2013, Maturitas.

[24]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[25]  Joost N. Kok,et al.  A quickstart in frequent structure mining can make a difference , 2004, KDD.

[26]  J. Chen,et al.  Predicting adverse side effects of drugs , 2011, BMC Genomics.

[27]  S. Siva Sathya,et al.  Evolutionary algorithms for de novo drug design - A survey , 2015, Appl. Soft Comput..

[28]  Cheng Luo,et al.  Computational methods for drug design and discovery: focus on China , 2013, Trends in Pharmacological Sciences.

[29]  George Papadatos,et al.  Evaluation of machine-learning methods for ligand-based virtual screening , 2007, J. Comput. Aided Mol. Des..

[30]  J. Montastruc,et al.  The nature of the scientific evidence leading to drug withdrawals for pharmacovigilance reasons in France , 2006, Pharmacoepidemiology and drug safety.

[31]  Sean Ekins,et al.  Application of data mining approaches to drug delivery. , 2006, Advanced drug delivery reviews.

[32]  Serge N. Demidenko,et al.  Multivariate alternating decision trees , 2016, Pattern Recognit..

[33]  Xin Chen,et al.  Effect of Molecular Descriptor Feature Selection in Support Vector Machine Classification of Pharmacokinetic and Toxicological Properties of Chemical Agents , 2004, J. Chem. Inf. Model..

[34]  C. Lipinski Drug-like properties and the causes of poor solubility and poor permeability. , 2000, Journal of pharmacological and toxicological methods.

[35]  J. Aronson,et al.  Delays in the post-marketing withdrawal of drugs to which deaths have been attributed: a systematic investigation and analysis , 2015, BMC Medicine.

[36]  M. Peppelenbosch,et al.  Biological effects of propionic acid in humans; metabolism, potential applications and underlying mechanisms. , 2010, Biochimica et biophysica acta.

[37]  Anti-epileptic drugs: a guide for the non-neurologist. , 2010, Clinical medicine.

[38]  J. Aronson,et al.  Post-marketing withdrawal of 462 medicinal products because of adverse drug reactions: a systematic review of the world literature , 2016, BMC Medicine.

[39]  J. Drews Drug discovery: a historical perspective. , 2000, Science.

[40]  Ichigaku Takigawa,et al.  Graph mining: procedure, application to drug discovery and recent advances. , 2013, Drug discovery today.

[41]  Pietro Liò,et al.  DrugClust: A machine learning approach for drugs side effects prediction , 2017, Comput. Biol. Chem..

[42]  A. Fliri,et al.  Analysis of drug-induced effect patterns to link structure and side effects of medicines , 2005, Nature chemical biology.

[43]  Jürgen Bajorath,et al.  Selected Concepts and Investigations in Compound Classification, Molecular Descriptor Analysis, and Virtual Screening , 2001, J. Chem. Inf. Comput. Sci..

[44]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery, 8. The Prediction of Human Intestinal Absorption by a Support Vector Machine , 2007, J. Chem. Inf. Model..

[45]  C. Codina,et al.  The use of evidence in pharmacovigilance , 2001, European Journal of Clinical Pharmacology.

[46]  Thomas Blaschke,et al.  The rise of deep learning in drug discovery. , 2018, Drug discovery today.

[47]  T. Meinl,et al.  The ParMol Package for Frequent Subgraph Mining , 2007, Electron. Commun. Eur. Assoc. Softw. Sci. Technol..

[48]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[49]  S. Shakir,et al.  An Assessment of the Publicly Disseminated Evidence of Safety Used in Decisions to Withdraw Medicinal Products from the UK and US Markets , 2006, Drug safety.

[50]  Jon Atli Benediktsson,et al.  Automatic selection of molecular descriptors using random forest: Application to drug discovery , 2017, Expert Syst. Appl..

[51]  M. Bialer,et al.  Pharmacokinetic analysis and anticonvulsant activity of glycine and glycinamide derivatives , 1999, Epilepsy Research.

[52]  A. Agustí,et al.  Obstacles and solutions for spontaneous reporting of adverse drug reactions in the hospital. , 2005, British journal of clinical pharmacology.

[53]  Thomas Sander,et al.  Toxicity-Indicating Structural Patterns , 2006, J. Chem. Inf. Model..

[54]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[55]  Hua Zou,et al.  Predicting potential side effects of drugs by recommender methods and ensemble learning , 2016, Neurocomputing.

[56]  Robert Preissner,et al.  WITHDRAWN—a resource for withdrawn and discontinued drugs , 2015, Nucleic Acids Res..

[57]  Hanna Geppert,et al.  Current Trends in Ligand-Based Virtual Screening: Molecular Representations, Data Mining Methods, New Application Areas, and Performance Evaluation , 2010, J. Chem. Inf. Model..

[58]  A Lavecchia,et al.  Virtual screening strategies in drug discovery: a critical review. , 2013, Current medicinal chemistry.

[59]  Gokmen Zararsiz,et al.  Drug/nondrug classification using Support Vector Machines with various feature selection strategies , 2014, Comput. Methods Programs Biomed..

[60]  David McLean,et al.  Logistic Model Tree Extraction From Artificial Neural Networks , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[61]  Lu Zhang,et al.  From machine learning to deep learning: progress in machine intelligence for rational drug discovery. , 2017, Drug discovery today.

[62]  A Srinivas Reddy,et al.  Virtual screening in drug discovery -- a computational perspective. , 2007, Current protein & peptide science.

[63]  Antonio Lavecchia,et al.  Machine-learning approaches in drug discovery: methods and applications. , 2015, Drug discovery today.

[64]  Pickett,et al.  Computational methods for the prediction of 'drug-likeness' , 2000, Drug discovery today.

[65]  Gregory W. Kauffman,et al.  QSAR and k-Nearest Neighbor Classification Analysis of Selective Cyclooxygenase-2 Inhibitors Using Topologically-Based Numerical Descriptors , 2001, J. Chem. Inf. Comput. Sci..