EvoMD: An Algorithm for Evolutionary Molecular Design

Traditionally, Computer-Aided Molecular Design (CAMD) uses heuristic search and mathematical programming to tackle the molecular design problem. But these techniques do not handle large and nonlinear search space very well. To overcome these drawbacks, graph-based evolutionary algorithms (EAs) have been proposed to evolve molecular design by mimicking chemical reactions on the exchange of chemical bonds and components between molecules. For these EAs to perform their tasks, known molecular components, which can serve as building blocks for the molecules to be designed, and known chemical rules, which govern chemical combination between different components, have to be introduced before the evolutionary process can take place. To automate molecular design without these constraints, this paper proposes an EA called Evolutionary Algorithm for Molecular Design (EvoMD). EvoMD encodes molecular designs in graphs. It uses a novel crossover operator which does not require known chemistry rules known in advanced and it uses a set of novel mutation operators. EvoMD uses atomics-based and fragment-based approaches to handle different size of molecule, and the value of the fitness function it uses is made to depend on the property descriptors of the design encoded in a molecular graph. It has been tested with different data sets and has been shown to be very promising.

[1]  W. Spears,et al.  On the Virtues of Parameterized Uniform Crossover , 1995 .

[2]  M. L. MartinezScienti On the Use of Direct Search Methods for the Molecular Conformation Problem , 1994 .

[3]  Yi-Zeng Liang,et al.  The Matrix Expression, Topological Index and Atomic Attribute of Molecular Topological Structure , 2021, Journal of Data Science.

[4]  Gennady M Verkhivker,et al.  Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. , 1995, Chemistry & biology.

[5]  T. D. Plantenga,et al.  Novel Applications of Optimization to Molecule Design , 1997 .

[6]  Peter Willett,et al.  Chemical structure systems : computational techniques for representation, searching and processing of structural information , 1991 .

[7]  Juan C. Meza,et al.  Do intelligent configuration search techniques outperform random search for large molecules , 1992 .

[8]  W. Guida,et al.  The art and practice of structure‐based drug design: A molecular modeling perspective , 1996, Medicinal research reviews.

[9]  Tad Hurst,et al.  Flexible 3D searching: The directed tweak technique , 1994, J. Chem. Inf. Comput. Sci..

[10]  Barry Robson,et al.  PRO_LIGAND: An approach to de novo molecular design. 1. Application to the design of organic molecules , 1995, J. Comput. Aided Mol. Des..

[11]  Mahindra T. Makhija,et al.  De novo design and synthesis of HIV-1 integrase inhibitors. , 2004, Bioorganic & medicinal chemistry.

[12]  Petra Schneider,et al.  De novo design of molecular architectures by evolutionary assembly of drug-derived building blocks , 2000, J. Comput. Aided Mol. Des..

[13]  Robert C. Glen,et al.  A genetic algorithm for the automated generation of molecules within constraints , 1995, J. Comput. Aided Mol. Des..

[14]  J. Devillers Genetic algorithms in molecular modeling , 1996 .

[15]  David E. Clark Evolutionary Algorithms in Computer-Aided Molecular Design: A Review of Current Applications and a Look to the Future , 1999 .

[16]  A. Globus,et al.  Automatic molecular design using evolutionary techniques , 1999 .

[17]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[18]  Gerta Rücker,et al.  Exploring the Limits of Graph Invariant- and Spectrum-Based Discrimination of (Sub)structures , 2002, J. Chem. Inf. Comput. Sci..

[19]  Irwin D. Kuntz,et al.  A genetic algorithm for structure-based de novo design , 2001, J. Comput. Aided Mol. Des..

[20]  G. Labesse,et al.  LEA3D: a computer-aided ligand design for structure-based drug design. , 2005, Journal of medicinal chemistry.

[21]  Edwin R. Hancock,et al.  Eigenspaces for Graphs , 2002, Int. J. Image Graph..

[22]  Johann Gasteiger,et al.  A Graph-Based Genetic Algorithm and Its Application to the Multiobjective Evolution of Median Molecules , 2004, J. Chem. Inf. Model..

[23]  John Bradshaw Chemical structure systems, Computational Techniques for Representation, Searching and Processing of Structural Information, ed. by Janet E. Ash, Wendy A. Warr and Peter Willett, Ellis Horwood, Chichester, 1991, ISBN 0‐13‐12669‐3, 351 pp., £39.50 , 1993 .

[24]  Tingjun Hou,et al.  Conformational analysis of peptides using Monte Carlo simulations combined with the genetic algorithm , 1999 .

[25]  Venkat Venkatasubramanian,et al.  Evolutionary Design of Molecules with Desired Properties Using the Genetic Algorithm , 1995, J. Chem. Inf. Comput. Sci..

[26]  C. B. Lucasius,et al.  Conformational analysis of a dinucleotide photodimer with the aid of the genetic algorithm , 1992, Biopolymers.

[27]  I. Gutman,et al.  Mathematical Concepts in Organic Chemistry , 1986 .

[28]  Luhua Lai,et al.  LigBuilder: A Multi-Purpose Program for Structure-Based Drug Design , 2000 .

[29]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[30]  J. Devillers,et al.  Designing Biodegradable Molecules from the Combined Use of a Backpropagation Neural Network and a Genetic Algorithm , 1996 .

[31]  Lutgarde M. C. Buydens,et al.  The ineffectiveness of recombination in a genetic algorithm for the structure elucidation of a heptapeptide in torsion angle space. A comparison to simulated annealing , 1997 .

[32]  Juan C. Meza,et al.  On the Use of Direct Search Methods for the Molecular Conformation Problem , 1994 .

[33]  S. Wilson,et al.  Applications of simulated annealing to peptides , 1990, Biopolymers.

[34]  Michael K. Gilson,et al.  Tork: Conformational analysis method for molecules and complexes , 2003, J. Comput. Chem..

[35]  Kenneth A. De Jong,et al.  An Analysis of the Interacting Roles of Population Size and Crossover in Genetic Algorithms , 1990, PPSN.

[36]  Ovidiu Ivanciuc,et al.  Graph Theory in Chemistry , 2002 .

[37]  I. Kuntz Structure-Based Strategies for Drug Design and Discovery , 1992, Science.

[38]  Johann Gasteiger,et al.  Canonical Numbering and Constitutional Symmetry , 1977, J. Chem. Inf. Comput. Sci..

[39]  James Devillers,et al.  Designing Molecules with Specific Properties from Intercommunicating Hybrid Systems , 1996, J. Chem. Inf. Comput. Sci..