OpenGrowth: An Automated and Rational Algorithm for Finding New Protein Ligands.

We present a new open-source software, called OpenGrowth, which aims to create de novo ligands by connecting small organic fragments in the active site of proteins. Molecule growth is biased to produce structures that statistically resemble drugs in an input training database. Consequently, the produced molecules have superior synthetic accessibility and pharmacokinetic properties compared with randomly grown molecules. The growth process can take into account the flexibility of the target protein and can be started from a seed to mimic R-group strategy or fragment-based drug discovery. Primary applications of the software on the HIV-1 protease allowed us to quickly identify new inhibitors with a predicted Kd as low as 18 nM. We also present a graphical user interface that allows a user to select easily the fragments to include in the growth process. OpenGrowth is released under the GNU GPL license and is available free of charge on the authors' website and at http://opengrowth.sourceforge.net/ .

[1]  Berk Hess,et al.  P-LINCS:  A Parallel Linear Constraint Solver for Molecular Simulation. , 2008, Journal of chemical theory and computation.

[2]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[3]  E. Shakhnovich,et al.  SMall Molecule Growth 2001 (SMoG2001): an improved knowledge-based scoring function for protein-ligand interactions. , 2002, Journal of medicinal chemistry.

[4]  Sergio H. Rotstein,et al.  GroupBuild: a fragment-based method for de novo drug design , 1994 .

[5]  Holger Claussen,et al.  Second-generation de novo design: a view from a medicinal chemist perspective , 2009, J. Comput. Aided Mol. Des..

[6]  Alán Aspuru-Guzik,et al.  The Harvard Clean Energy Project: Large-Scale Computational Screening and Design of Organic Photovoltaics on the World Community Grid , 2011 .

[7]  GREGORY N. TEW,et al.  De novo design of antimicrobial polymers, foldamers, and small molecules: from discovery to practical applications. , 2010, Accounts of chemical research.

[8]  H. M. Vinkers,et al.  SYNOPSIS: SYNthesize and OPtimize System in Silico. , 2003, Journal of medicinal chemistry.

[9]  Youyong Li,et al.  Assessing the performance of MM/PBSA and MM/GBSA methods. 4. Accuracies of MM/PBSA and MM/GBSA methodologies evaluated by various simulation protocols using PDBbind data set. , 2014, Physical chemistry chemical physics : PCCP.

[10]  D. E. Clark,et al.  Outstanding challenges in protein–ligand docking and structure‐based virtual screening , 2011 .

[11]  Peter A. Kollman,et al.  AMBER, a package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules , 1995 .

[12]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[13]  Markus Hartenfeller,et al.  De novo drug design. , 2010, Methods in molecular biology.

[14]  S. Teague Implications of protein flexibility for drug discovery , 2003, Nature Reviews Drug Discovery.

[15]  Michael M. Hann,et al.  RECAP — Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry. , 1998 .

[16]  Claudio N. Cavasotto,et al.  Protein flexibility in ligand docking and virtual screening to protein kinases. , 2004, Journal of molecular biology.

[17]  J. Tomasi,et al.  The IEF version of the PCM solvation method: an overview of a new method addressed to study molecular solutes at the QM ab initio level , 1999 .

[18]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[19]  Julian Tirado-Rives,et al.  Computer-aided design of non-nucleoside inhibitors of HIV-1 reverse transcriptase. , 2006, Bioorganic & medicinal chemistry letters.

[20]  L. M. Amzel,et al.  Unique Thermodynamic Response of Tipranavir to Human Immunodeficiency Virus Type 1 Protease Drug Resistance Mutations , 2007, Journal of Virology.

[21]  Jana Václavíková,et al.  Enzymatic and structural analysis of the I47A mutation contributing to the reduced susceptibility to HIV protease inhibitor lopinavir , 2008, Protein science : a publication of the Protein Society.

[22]  Teruki Honma,et al.  Recent advances in de novo design strategy for practical lead identification , 2003, Medicinal research reviews.

[23]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[24]  Luhua Lai,et al.  RASSE: A New Method for Structure-Based Drug Design , 1996, J. Chem. Inf. Comput. Sci..

[25]  Luhua Lai,et al.  LigBuilder: A Multi-Purpose Program for Structure-Based Drug Design , 2000 .

[26]  G. Whitesides,et al.  Combinatorial computational method gives new picomolar ligands for a known enzyme , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Hongwei Huang,et al.  E-Novo: An Automated Workflow for Efficient Structure-Based Lead Optimization , 2009, J. Chem. Inf. Model..

[28]  Wolfgang Wenzel,et al.  Flexible side chain models improve enrichment rates in in silico screening. , 2008, Journal of medicinal chemistry.

[29]  J. Mestan,et al.  Aza-peptide analogs as potent human immunodeficiency virus type-1 protease inhibitors with oral bioavailability. , 1996, Journal of medicinal chemistry.

[30]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[31]  D. Truhlar,et al.  The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals , 2008 .

[32]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[33]  Ruben Abagyan,et al.  Four-dimensional docking: a fast and accurate account of discrete receptor flexibility in ligand docking. , 2009, Journal of medicinal chemistry.

[34]  Tingjun Hou,et al.  Molecular dynamics and free energy studies on the wild-type and double mutant HIV-1 protease complexed with amprenavir and two amprenavir-related inhibitors: mechanism for binding and drug resistance. , 2007, Journal of medicinal chemistry.

[35]  Thomas Bäck,et al.  The Molecule Evoluator. An Interactive Evolutionary Algorithm for the Design of Drug-Like Molecules , 2006, J. Chem. Inf. Model..

[36]  D. Koshland Application of a Theory of Enzyme Specificity to Protein Synthesis. , 1958, Proceedings of the National Academy of Sciences of the United States of America.

[37]  W. L. Jorgensen Quantum and statistical mechanical studies of liquids. 10. Transferable intermolecular potential functions for water, alcohols, and ethers. Application to liquid water , 2002 .

[38]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[39]  R. Abagyan,et al.  Flexible ligand docking to multiple receptor conformations: a practical alternative. , 2008, Current opinion in structural biology.

[40]  L. Hartmann,et al.  Selective estrogen-receptor modulators -- mechanisms of action and application to clinical practice. , 2003, The New England journal of medicine.

[41]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[42]  William L Jorgensen,et al.  Efficient drug lead discovery and optimization. , 2009, Accounts of chemical research.

[43]  Youyong Li,et al.  Assessing the performance of MM/PBSA and MM/GBSA methods. 3. The impact of force fields and ligand charge models. , 2013, The journal of physical chemistry. B.

[44]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[45]  D. Selwood,et al.  Two- and Three-dimensional Rings in Drugs , 2014, Chemical biology & drug design.

[46]  Michel F Sanner,et al.  FLIPDock: Docking flexible ligands into flexible receptors , 2007, Proteins.

[47]  Gisbert Schneider,et al.  Flux (2): Comparison of Molecular Mutation and Crossover Operators for Ligand-Based de Novo Design , 2007, J. Chem. Inf. Model..

[48]  Arun K. Ghosh,et al.  P2′ Benzene Carboxylic Acid Moiety Is Associated with Decrease in Cellular Uptake: Evaluation of Novel Nonpeptidic HIV-1 Protease Inhibitors Containing P2 bis-Tetrahydrofuran Moiety , 2013, Antimicrobial Agents and Chemotherapy.

[49]  Gisbert Schneider De novo Molecular Design: Schneider/De novo Molecular Design , 2013 .

[50]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[51]  Gisbert Schneider,et al.  Flux (1): A Virtual Synthesis Scheme for Fragment-Based de Novo Design , 2006, J. Chem. Inf. Model..

[52]  H. Carlson Protein flexibility and drug design: how to hit a moving target. , 2002, Current opinion in chemical biology.

[53]  J F Davies,et al.  Viracept (nelfinavir mesylate, AG1343): a potent, orally bioavailable inhibitor of HIV-1 protease. , 1997, Journal of medicinal chemistry.

[54]  Cheng Zhu,et al.  Drug repositioning for orphan diseases , 2011, Briefings Bioinform..

[55]  T. H. Dunning Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen , 1989 .

[56]  Woody Sherman,et al.  Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments , 2013, Journal of Computer-Aided Molecular Design.

[57]  Michael C. Hutter Separating Drugs from Nondrugs: A Statistical Approach Using Atom Pair Distributions , 2007, J. Chem. Inf. Model..

[58]  T. Dunning,et al.  Electron affinities of the first‐row atoms revisited. Systematic basis sets and wave functions , 1992 .

[59]  Markus Hartenfeller,et al.  DOGS: Reaction-Driven de novo Design of Bioactive Compounds , 2012, PLoS Comput. Biol..

[60]  Regine Bohacek,et al.  Multiple Highly Diverse Structures Complementary to Enzyme Binding Sites: Results of Extensive Application of a de Novo Design Method Incorporating Combinatorial Growth , 1994 .

[61]  V. Stoll,et al.  X-ray crystallographic structure of ABT-378 (lopinavir) bound to HIV-1 protease. , 2002, Bioorganic & medicinal chemistry.

[62]  S. Kim,et al.  "Soft docking": matching of molecular surface cubes. , 1991, Journal of molecular biology.

[63]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[64]  J. Irwin,et al.  ZINC ? A Free Database of Commercially Available Compounds for Virtual Screening. , 2005 .

[65]  Mark A. Murcko,et al.  GenStar: A method for de novo drug design , 1993, J. Comput. Aided Mol. Des..

[66]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[67]  Eugene I. Shakhnovich,et al.  SMOG : DE NOVO DESIGN METHOD BASED ON SIMPLE, FAST, AND ACCURATE FREE ENERGY ESTIMATES. 2. CASE STUDIES IN MOLECULAR DESIGN , 1997 .

[68]  S. Nosé A unified formulation of the constant temperature molecular dynamics methods , 1984 .

[69]  Günther Wess,et al.  Medicinal Chemistry: Challenges and Opportunities. , 2001, Angewandte Chemie.

[70]  Maureen M Goodenow,et al.  Analysis of HIV-1 CRF_01 A/E protease inhibitor resistance: structural determinants for maintaining sensitivity and developing resistance to atazanavir. , 2006, Biochemistry.

[71]  Norman Wolmark,et al.  Effects of tamoxifen vs raloxifene on the risk of developing invasive breast cancer and other disease outcomes: the NSABP Study of Tamoxifen and Raloxifene (STAR) P-2 trial. , 2006, JAMA.

[72]  F. Clavel,et al.  HIV Drug Resistance , 2000, The New England journal of medicine.

[73]  Asher Mullard,et al.  Drug repurposing programmes get lift off , 2012, Nature Reviews Drug Discovery.

[74]  M. Parrinello,et al.  Isothermal-isobaric molecular dynamics using stochastic velocity rescaling. , 2009, The Journal of chemical physics.

[75]  Peter S Kutchukian,et al.  De novo design: balancing novelty and confined chemical space , 2010, Expert opinion on drug discovery.

[76]  M. Parrinello,et al.  Polymorphic transitions in single crystals: A new molecular dynamics method , 1981 .

[77]  G. Schneider,et al.  Enabling future drug discovery by de novo design , 2011 .

[78]  Akiko Itai,et al.  Effective handling of induced‐fit motion in flexible docking , 2006, Proteins.

[79]  Johann Gasteiger,et al.  Structure and reaction based evaluation of synthetic accessibility , 2007, J. Comput. Aided Mol. Des..

[80]  Matthias Rarey,et al.  In Pursuit of Fully Flexible Protein‐Ligand Docking: Modeling the Bilateral Mechanism of Binding , 2010, Molecular informatics.

[81]  Luhua Lai,et al.  LigBuilder 2: A Practical de Novo Drug Design Approach , 2011, J. Chem. Inf. Model..

[82]  Guosheng Wu,et al.  Structure-Based Design Technology Contour and Its Application to the Design of Renin Inhibitors , 2012, J. Chem. Inf. Model..

[83]  Hoover,et al.  Canonical dynamics: Equilibrium phase-space distributions. , 1985, Physical review. A, General physics.

[84]  Brian K. Shoichet,et al.  The incorporation of protein flexibility and conformational energy penalties in docking screens to improve ligand discovery , 2014, Nature chemistry.

[85]  J Andrew McCammon,et al.  AutoGrow 3.0: an improved algorithm for chemically tractable, semi-automated protein inhibitor design. , 2013, Journal of molecular graphics & modelling.

[86]  David L Mobley,et al.  Perspective: Alchemical free energy calculations for drug discovery. , 2012, The Journal of chemical physics.

[87]  David S. Goodsell,et al.  A semiempirical free energy force field with charge‐based desolvation , 2007, J. Comput. Chem..

[88]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[89]  Sheng-Yong Yang,et al.  PhDD: a new pharmacophore-based de novo design method of drug-like molecules combined with assessment of synthetic accessibility. , 2010, Journal of molecular graphics & modelling.

[90]  G. Bemis,et al.  BREED: Generating novel inhibitors through hybridization of known ligands. Application to CDK2, p38, and HIV protease. , 2004, Journal of medicinal chemistry.

[91]  Ross C. Walker,et al.  An overview of the Amber biomolecular simulation package , 2013 .

[92]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[93]  Woody Sherman,et al.  Computational approaches for fragment-based and de novo design. , 2010, Current topics in medicinal chemistry.

[94]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[95]  Mark L. Pearson,et al.  Complete nucleotide sequence of the AIDS virus, HTLV-III , 1985, Nature.

[96]  G. Bold,et al.  Novel pseudosymmetric inhibitors of HIV-1 protease , 1993 .

[97]  David Lou,et al.  FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules Occupying Druglike Chemical Space , 2009, J. Chem. Inf. Model..

[98]  M. Hatada,et al.  Novel binding mode of highly potent HIV-proteinase inhibitors incorporating the (R)-hydroxyethylamine isostere. , 1991, Journal of medicinal chemistry.

[99]  Kam-Chuen Jim,et al.  An electronic environment and contact direction sensitive scoring function for predicting affinities of protein-ligand complexes in Contour(®). , 2014, Journal of molecular graphics & modelling.

[100]  Matthias Rarey,et al.  FlexNovo: Structure‐Based Searching in Large Fragment Spaces , 2006, ChemMedChem.

[101]  Wim F Vranken,et al.  ACPYPE - AnteChamber PYthon Parser interfacE , 2012, BMC Research Notes.

[102]  Rommie E. Amaro,et al.  AutoGrow: A Novel Algorithm for Protein Inhibitor Design , 2009, Chemical biology & drug design.

[103]  Shantenu Jha,et al.  Computing Clinically Relevant Binding Free Energies of HIV-1 Protease Inhibitors , 2014, Journal of chemical theory and computation.

[104]  Holger Gohlke,et al.  MMPBSA.py: An Efficient Program for End-State Free Energy Calculations. , 2012, Journal of chemical theory and computation.

[105]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[106]  Hans-Joachim Böhm,et al.  The computer program LUDI: A new method for the de novo design of enzyme inhibitors , 1992, J. Comput. Aided Mol. Des..

[107]  S. Vasavanonda,et al.  ABT-538 is a potent inhibitor of human immunodeficiency virus protease and has high oral bioavailability in humans. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[108]  R. Friesner,et al.  Novel procedure for modeling ligand/receptor induced fit effects. , 2006, Journal of medicinal chemistry.

[109]  Tingjun Hou,et al.  Assessing the Performance of the MM/PBSA and MM/GBSA Methods. 1. The Accuracy of Binding Free Energy Calculations Based on Molecular Dynamics Simulations , 2011, J. Chem. Inf. Model..

[110]  A. Roitberg,et al.  pH-REMD simulations indicate that the catalytic aspartates of HIV-1 protease exist primarily in a monoprotonated state. , 2014, The journal of physical chemistry. B.

[111]  Natalia Novac,et al.  Challenges and opportunities of drug repositioning. , 2013, Trends in pharmacological sciences.

[112]  Akiko Itai,et al.  Automatic creation of drug candidate structures based on receptor structure. Starting point for artificial lead generation , 1991 .

[113]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997 .

[114]  Richard D. Taylor,et al.  Rings in drugs. , 2014, Journal of medicinal chemistry.

[115]  Ulf Norinder,et al.  Prediction of ADMET Properties , 2006, ChemMedChem.

[116]  D. Goodsell,et al.  Automated docking to multiple target structures: Incorporation of protein mobility and structural water heterogeneity in AutoDock , 2002, Proteins.

[117]  J. Mestan,et al.  New aza-dipeptide analogues as potent and orally absorbed HIV-1 protease inhibitors: candidates for clinical development. , 1998, Journal of medicinal chemistry.

[118]  M. Murcko,et al.  Crystal Structure of HIV-1 Protease in Complex with Vx-478, a Potent and Orally Bioavailable Inhibitor of the Enzyme , 1995 .

[119]  L. Kuo,et al.  Crystal structure at 1.9-A resolution of human immunodeficiency virus (HIV) II protease complexed with L-735,524, an orally bioavailable inhibitor of the HIV proteases. , 1996, The Journal of biological chemistry.

[120]  P. Kollman,et al.  Automatic atom type and bond type perception in molecular mechanical calculations. , 2006, Journal of molecular graphics & modelling.

[121]  Claudio N. Cavasotto,et al.  Homology modeling in drug discovery: current trends and applications. , 2009, Drug discovery today.

[122]  G. Schneider,et al.  Voyages to the (un)known: adaptive design of bioactive compounds. , 2009, Trends in biotechnology.

[123]  Giulio Rastelli,et al.  Fast and accurate predictions of binding free energies using MM‐PBSA and MM‐GBSA , 2009, J. Comput. Chem..

[124]  Gisbert Schneider,et al.  Computer-based de novo design of drug-like molecules , 2005, Nature Reviews Drug Discovery.

[125]  William L Jorgensen,et al.  From docking false-positive to active anti-HIV agent. , 2007, Journal of medicinal chemistry.

[126]  S. Siva Sathya,et al.  Evolutionary algorithms for de novo drug design - A survey , 2015, Appl. Soft Comput..