Iterative Stochastic Elimination for Solving Complex Combinatorial Problems in Drug Discovery

Iterative Stochastic Elimination (ISE) is a novel algorithm that was originally developed to solve extremely complex problems in protein structure and interactions, and has since been applied to diverse topics that share a few general “ingredients”: they are extremely complex, of combinatorial nature, may be presented as large sets of variables that can each have many alternative values, there is some interdependence of the variables on each other, and there is a scoring function that can evaluate each choice of the problems “configuration”; this is the set of single values of each of the variables that constitute its full presentation. Those are picked randomly in a large sample, the analysis of which allows decisions to be made for rejecting some values for each of the variables; thus resulting in a smaller set of potential combinations. This continues in iterations until the number of combinations allows all the remaining options to be computed exhaustively and to order them by their scores. ISE has been mainly applied to problems that are relevant to drug design and discovery. We demonstrate, among others, the use of ISE to determine the properties of molecular ensembles and to pick the best molecules (“focused libraries”) for hitting a specific target. Future ideas for using ISE are discussed, as well as mentioning its contributions to the construction of two start-up companies.

[1]  Alexander Golbraikh,et al.  Quantitative structure-property relationship modeling of remote liposome loading of drugs. , 2012, Journal of controlled release : official journal of the Controlled Release Society.

[2]  Tudor I. Oprea,et al.  Model-Free Drug-Likeness from Fragments , 2010, J. Chem. Inf. Model..

[3]  Sean Ekins,et al.  Computational Models to Assign Biopharmaceutics Drug Disposition Classification from Molecular Structure , 2007, Pharmaceutical Research.

[4]  A. Goldblum,et al.  Molecular properties from conformational ensembles. 1. Dipole moments of molecules with multiple internal rotations. , 2011, The journal of physical chemistry. A.

[5]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[6]  S. Rasmussen,et al.  Structure of a nanobody-stabilized active state of the β2 adrenoceptor , 2010, Nature.

[7]  M. Karplus,et al.  A combined quantum mechanical and molecular mechanical potential for molecular dynamics simulations , 1990 .

[8]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[9]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[10]  Richard D. Taylor,et al.  Modeling water molecules in protein-ligand docking using GOLD. , 2005, Journal of medicinal chemistry.

[11]  Amiram Goldblum,et al.  Predicting Oral Druglikeness by Iterative Stochastic Elimination , 2010, J. Chem. Inf. Model..

[12]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[13]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[14]  Rolf H. Möhring,et al.  Resource-constrained project scheduling: Notation, classification, models, and methods , 1999, Eur. J. Oper. Res..

[15]  Samuel H. Wilson,et al.  Crystal structures of human DNA polymerase beta complexed with gapped and nicked DNA: evidence for an induced fit mechanism. , 1997, Biochemistry.

[16]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[17]  Tudor I. Oprea,et al.  Is There a Difference between Leads and Drugs? A Historical Perspective , 2001, J. Chem. Inf. Comput. Sci..

[18]  Vinod Nair,et al.  Biopharmaceutic classification system: a scientific framework for pharmacokinetic optimization in drug research. , 2004, Current drug metabolism.

[19]  J Hoflack,et al.  Modeling of G-protein-coupled receptors: application to dopamine, adrenaline, serotonin, acetylcholine, and mammalian opsin receptors. , 1992, Journal of medicinal chemistry.

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  Matthew P. Repasky,et al.  Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. , 2006, Journal of medicinal chemistry.

[22]  R. Stevens,et al.  High-Resolution Crystal Structure of an Engineered Human β2-Adrenergic G Protein–Coupled Receptor , 2007, Science.

[23]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[24]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[25]  Richard D. Taylor,et al.  Improved protein–ligand docking using GOLD , 2003, Proteins.

[26]  Amiram Goldblum,et al.  High quality binding modes in docking ligands to proteins , 2008, Proteins.

[27]  Amiram Goldblum,et al.  A stochastic algorithm for global optimization and for best populations: A test case of side chains in proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Koenraad Van Leemput,et al.  Automated model-based tissue classification of MR images of the brain , 1999, IEEE Transactions on Medical Imaging.

[29]  F. Allen The Cambridge Structural Database: a quarter of a million crystal structures and rising. , 2002, Acta crystallographica. Section B, Structural science.

[30]  W Patrick Walters,et al.  A detailed comparison of current docking and scoring methods on systems of pharmaceutical relevance , 2004, Proteins.

[31]  Tudor I. Oprea,et al.  Property distribution of drug-related chemical databases* , 2000, J. Comput. Aided Mol. Des..

[32]  Scott Boyer,et al.  Development, interpretation and temporal evaluation of a global QSAR of hERG electrophysiology screening data , 2007, J. Comput. Aided Mol. Des..

[34]  A. Rayan,et al.  Exploring the conformational space of cyclic peptides by a stochastic search method. , 2004, Journal of molecular graphics & modelling.

[35]  Amiram Goldblum,et al.  Computer-aided design of liposomal drugs: In silico prediction and experimental validation of drug candidates for liposomal remote loading. , 2014, Journal of controlled release : official journal of the Controlled Release Society.

[36]  M. Levitt,et al.  Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. , 1976, Journal of molecular biology.

[37]  Kuo-Chen Chou,et al.  Recent advances in predicting protein classification and their applications to drug development. , 2013, Current topics in medicinal chemistry.