Virtual screening and repositioning of inconclusive molecules of beta-lactamase Bioassays - A data mining approach

This study focuses on the best possible way forward in utilizing inconclusive molecules of PubChem bioassays AID 1332, AID 434987 and AID 434955, which are related to beta-lactamase inhibitors of Mycobacterium tuberculosis (Mtb). The inadequacy in the experimental methods that were observed during the invitro screening resulted in an inconclusive dataset. This could be due to certain moieties present within the molecules. In order to reconsider such molecules, insilico methods can be suggested in place of invitro methods For instance, datamining and medicinal chemistry methods: have been adopted to prioritise the inconclusive dataset into active or inactive molecules. These include the Random Forest algorithm for dataminning, Lilly MedChem rules for virtually screening out the promiscuity, and Self Organizing Maps (SOM) for clustering the active molecules and enlisting them for repositioning through the use of artificial neural networks. These repositioned molecules could then be prioritized for downstream drug discovery analysis.

[1]  Sean Ekins,et al.  Analysis and hit filtering of a very large library of compounds screened against Mycobacterium tuberculosis. , 2010, Molecular bioSystems.

[2]  Barry A. Bunin,et al.  Machine Learning Models and Pathway Genome Data Base for Trypanosoma cruzi Drug Discovery , 2015, PLoS neglected tropical diseases.

[3]  Tudor I. Oprea,et al.  Ligand-Based Virtual Screening by Novelty Detection with Self-Organizing Maps , 2007, J. Chem. Inf. Model..

[4]  Vinod Scaria,et al.  Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets , 2011, BMC Research Notes.

[5]  Antony J. Williams,et al.  Machines first, humans second: on the importance of algorithmic interpretation of open chemistry data , 2015, Journal of Cheminformatics.

[6]  P Schneider,et al.  Self-organizing maps in drug discovery: compound library design, scaffold-hopping, repurposing. , 2009, Current medicinal chemistry.

[7]  Sean Ekins,et al.  Computational Prediction and Validation of an Expert's Evaluation of Chemical Probes , 2014, J. Chem. Inf. Model..

[8]  Sean Ekins,et al.  Combining Cheminformatics Methods and Pathway Analysis to Identify Molecules with Whole-Cell Activity Against Mycobacterium Tuberculosis , 2012, Pharmaceutical Research.

[9]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[10]  A Srinivas Reddy,et al.  Virtual screening in drug discovery -- a computational perspective. , 2007, Current protein & peptide science.

[11]  Amanda C. Schierz Virtual screening of bioassay data , 2009, J. Cheminformatics.

[12]  Sean Ekins,et al.  Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Models for Mycobacterium tuberculosis Drug Discovery , 2015, PloS one.

[13]  Sean Ekins,et al.  Computational models for tuberculosis drug discovery. , 2013, Methods in molecular biology.

[14]  P. Tonge,et al.  Novel trisubstituted benzimidazoles, targeting Mtb FtsZ, as a new class of antitubercular agents. , 2011, Journal of medicinal chemistry.

[15]  Alex M. Clark,et al.  Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets , 2015, J. Chem. Inf. Model..

[16]  Sean Ekins,et al.  Combining Computational Methods for Hit to Lead Optimization in Mycobacterium Tuberculosis Drug Discovery , 2013, Pharmaceutical Research.

[17]  Kozo Takayama,et al.  Artificial Neural Network as a Novel Method to Optimize Pharmaceutical Formulations , 2004, Pharmaceutical Research.

[18]  Ian A. Watson,et al.  Rules for identifying potentially reactive or promiscuous compounds. , 2012, Journal of medicinal chemistry.

[19]  K. Mdluli,et al.  Novel targets for tuberculosis drug discovery. , 2006, Current opinion in pharmacology.

[20]  Johann Gasteiger,et al.  Neural networks and genetic algorithms in drug design , 2001 .

[21]  Sean Ekins,et al.  In silico repositioning of approved drugs for rare and neglected diseases. , 2011, Drug discovery today.

[22]  Jun Feng,et al.  PowerMV: A Software Environment for Molecular Viewing, Descriptor Generation, Data Analysis and Hit Evaluation , 2005, J. Chem. Inf. Model..