Computer‐aided synthesis design: 40 years on

The discipline of retrosynthetic analysis is now just over 40 years old. From the earliest day, attempts were made to incorporate this approach into computer programs to test the extent in which chemical perception and synthetic thinking could be formalized. Despite pioneering research efforts, computer‐aided synthetic analysis failed to achieve widespread routine use by chemists, which can be attributed in part to the difficulty of building the required high‐quality retrosynthetic transform databases required for credible analyses. However, with the advent over the past 25 years of large comprehensive reaction databases, work on successfully automating the construction of reliable and comprehensive reaction rule databases is promising to revitalize research in this field. This review compares and contrasts the diverse approaches taken by selected programs in both the design and implementation of molecule feature perception and reaction rule representation, and we review the concepts of synthetic strategy selection, representation, and execution. In particular, we discuss the current work on automating the construction of reliable and comprehensive synthetic rule sets from available reaction databases in newer programs such as ARChem. We argue that the progress achieved in this aspect paves the way to a deeper exploration of computer approaches to applying strategy and control in the synthesis problem. © 2011 John Wiley & Sons, Ltd.

[1]  A. Johnson,et al.  Starting Material Oriented Retrosynthetic Analysis in the LHASA Program. Part 2. Mapping the SM and Target Structures. , 2010 .

[2]  Pierre Baldi,et al.  No Electron Left Behind: A Rule-Based Expert System To Predict Chemical Reactions and Reaction Mechanisms , 2009, J. Chem. Inf. Model..

[3]  Yang Liu,et al.  Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation , 2009, J. Chem. Inf. Model..

[4]  B. Gunther,et al.  On the Recognition of Composed Systems of Stereocenters in Molecular Graph Theory by Wreath Products , 2007 .

[5]  F. Glorius,et al.  Chiral Auxiliaries — Principles and Recent Applications , 2006 .

[6]  Jonathan M. Goodman,et al.  The ROBIA Program for Predicting Organic Reactivity , 2006, Journal of Chemical Information and Modeling.

[7]  Keith T. Taylor,et al.  ROBIA: a reaction prediction program. , 2005, Organic letters.

[8]  Matthew H Todd,et al.  Computer-aided organic synthesis. , 2005, Chemical Society reviews.

[9]  Iwao Ojima,et al.  Catalytic Asymmetric Synthesis: Ojima/Asymmetric Synthesis , 2005 .

[10]  I. Alfonso,et al.  Enantioselective enzymatic desymmetrizations in organic synthesis. , 2005, Chemical reviews.

[11]  Franziska Berger,et al.  Counterexamples in Chemical Ring Perception , 2004, J. Chem. Inf. Model..

[12]  A. Nelson,et al.  A two-directional synthesis of the C58-C71 fragment of palytoxin. , 2004, Organic & biomolecular chemistry.

[13]  A. Nelson,et al.  Beyond Breaking the Mirror Plane: The Desymmetrization of Centrosymmetric Molecules as an Efficient Strategy for Asymmetric Synthesis , 2003 .

[14]  Philippe Jauffret,et al.  Using the COSYMA System for the Discovery of Synthesis Strategies by Analogy , 2003 .

[15]  R. W. Hoffmann meso Compounds: stepchildren or favored children of stereoselective synthesis? , 2003, Angewandte Chemie.

[16]  A. Nelson,et al.  Beyond Breaking the Mirror Plane: The Desymmetrization of Centrosymmetric Molecules as an Efficient Strategy for Asymmetric Synthesis , 2003 .

[17]  L. Paquette Chiral reagents for asymmetric synthesis , 2003 .

[18]  A. Rheingold,et al.  Synthesis of (-)-morphine. , 2002, Journal of the American Chemical Society.

[19]  K Wang,et al.  Construction of a generic reaction knowledge base by reaction data mining. , 2001, Journal of molecular graphics & modelling.

[20]  Paul Meehan,et al.  CrossFire: a structural revolution for chemists , 2001, Online Inf. Rev..

[21]  S. Denmark,et al.  Catalytic asymmetric synthesis. , 2000, Accounts of chemical research.

[22]  Jean-Pierre Doucet,et al.  Comment on "Isomorphism, Automorphism Partitioning, and Canonical Labeling Can Be Solved in Polynomial-Time for Molecular Graphs" , 1999, J. Chem. Inf. Comput. Sci..

[23]  Kimito Funatsu,et al.  A Novel Approach to Retrosynthetic Analysis Using Knowledge Bases Derived from Reaction Databases , 1999, J. Chem. Inf. Comput. Sci..

[24]  A. Srikrishna,et al.  A formal total synthesis of (±)-homogynolide-B , 1999 .

[25]  Krishna K. Agarwal An Algorithm for Computing the Automorphism Group of Organic Structures with Stereochemistry and a Measure of its Efficiency , 1998, J. Chem. Inf. Comput. Sci..

[26]  Claude Ostermann,et al.  Evolution of the Concept of Synthesis Strategy in the COSYMA System: Introduction of the Synthesis Invariant. , 1997 .

[27]  Martin A. Ott,et al.  Long-Range Strategies in the LHASA Program: The Quinone Diels-Alder Transform , 1997, J. Chem. Inf. Comput. Sci..

[28]  Philippe Vismara,et al.  Union of all the Minimum Cycle Bases of a Graph , 1997, Electron. J. Comb..

[29]  Johann Gasteiger,et al.  Computer‐Assisted Planning of Organic Syntheses: The Second Generation of Programs , 1996 .

[30]  T. Huynh-Dinh,et al.  The logic of chemical synthesis , 1996 .

[31]  Jean-Pierre Doucet,et al.  Detection of Constitutionally Equivalent Sites from a Connection Table , 1996, J. Chem. Inf. Comput. Sci..

[32]  Steven Magnuson Two-directional synthesis and its use in natural product synthesis , 1995 .

[33]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[34]  Alan K. Long,et al.  Computer-assisted synthetic analysis. Performance of tactical combinations of transforms , 1994, J. Chem. Inf. Comput. Sci..

[35]  Renzo Balducci,et al.  Efficient exact solution of the ring perception problem , 1994, J. Chem. Inf. Comput. Sci..

[36]  Christopher S. Poss,et al.  TWO-DIRECTIONAL CHAIN SYNTHESIS AND TERMINUS DIFFERENTIATION , 1994 .

[37]  Andreas Dietz,et al.  Models, concepts, theories, and formal languages in chemistry and their use as a basis for computer assistance in chemistry , 1994, J. Chem. Inf. Comput. Sci..

[38]  John M. Barnard,et al.  Substructure searching methods: Old and new , 1993, J. Chem. Inf. Comput. Sci..

[39]  Rainer Herges,et al.  Computer-assisted solution of chemical problems : the historical development and the present state of the art of a new discipline of chemistry , 1993 .

[40]  Alan H. Lipkus,et al.  Relative Configurations in Stereochemical Substructure Search , 1993 .

[41]  John Figueras,et al.  Morgan revisited , 1993, J. Chem. Inf. Comput. Sci..

[42]  Maurizio Botta,et al.  Computer-assisted perception of similarity using the Chiron program: a powerful tool for the analysis and prediction of biogenetic patterns , 1992, J. Chem. Inf. Comput. Sci..

[43]  Chris Marshall,et al.  Starting material oriented retrosynthetic analysis in the LHASA program. 2. Mapping the SM and target structures , 1992, J. Chem. Inf. Comput. Sci..

[44]  Philip N. Judson,et al.  Starting material oriented retrosynthetic analysis in the LHASA program. 1. General description , 1992, J. Chem. Inf. Comput. Sci..

[45]  Chris Marshall,et al.  Starting material oriented retrosynthetic analysis in the LHASA program. 3. Heuristic estimation of synthetic proximity , 1992, J. Chem. Inf. Comput. Sci..

[46]  James B. Hendrickson,et al.  A program for the FORWARD generation of synthetic routes , 1992, J. Chem. Inf. Comput. Sci..

[47]  A. Dengler,et al.  The algebraic and graph theoretical completion of truncated reaction equations , 1992 .

[48]  John Figueras Automorphism and equivalence classes , 1992, J. Chem. Inf. Comput. Sci..

[49]  James B. Hendrickson,et al.  Reaction Classification and Retrieval. A Linkage Between Synthesis Generation and Reaction Databases. , 1991 .

[50]  Alan H. Lipkus,et al.  Chemical Abstracts Service Chemical Registry System. 13. Enhanced handling of stereochemistry , 1991, J. Chem. Inf. Comput. Sci..

[51]  E. Corey,et al.  The Logic of Chemical Synthesis: Multistep Synthesis of Complex Carbogenic Molecules (Nobel Lecture)† , 1991 .

[52]  Eric Fontain,et al.  The generation of reaction networks with RAIN. 1. The reaction generator , 1991, J. Chem. Inf. Comput. Sci..

[53]  J. B. Hendrickson The SYNGEN approach to synthesis design , 2010 .

[54]  J. B. Hendrickson,et al.  Organic Synthesis in the Age of Computers , 1990 .

[55]  Chyouhwa Chen,et al.  Building and refining a knowledge base for synthetic organic chemistry via the methodology of inductive and deductive machine learning , 1990, J. Chem. Inf. Comput. Sci..

[56]  Edward S. Blurock Computer-aided synthesis design at RISC-Linz: automatic extraction and use of reaction classes , 1990, J. Chem. Inf. Comput. Sci..

[57]  Stephen Hanessian,et al.  Computer-assisted analysis and perception of stereochemical features in organic molecules using the CHIRON program , 1990, J. Chem. Inf. Comput. Sci..

[58]  Gerta Rücker,et al.  Computer perception of constitutional (topological) symmetry: TOPSYM, a fast algorithm for partitioning atoms and pairwise relations among atoms into equivalence classes , 1990, J. Chem. Inf. Comput. Sci..

[59]  J. Gasteiger,et al.  Computer-assisted reaction prediction and synthesis design , 1990 .

[60]  J. Gasteiger,et al.  Automated derivation of reaction rules for the EROS 6.0 system for reaction prediction , 1990 .

[61]  R. Woodward,et al.  The total synthesis of chlorophyll a , 1990 .

[62]  Michael F. Lynch,et al.  Theoretical aspects of ring perception and development of the extended set of smallest rings concept , 1989, J. Chem. Inf. Comput. Sci..

[63]  Michael F. Lynch,et al.  Review of ring perception algorithms for chemical graphs , 1989, J. Chem. Inf. Comput. Sci..

[64]  J. B. Hendrickson,et al.  Systematic synthesis design: the SYNGEN program , 1989 .

[65]  A. Greene,et al.  Direct approach to the bakkanes: A synthesis of (±)-homogynolide-B , 1989 .

[66]  J. B. Hendrickson,et al.  Synthesis design logic and the SYNGEN (synthesis generation) program , 1988 .

[67]  Gushurst Computer-assisted mechanistic evaluation of organic reactions , 1988 .

[68]  Thomas V. Lee,et al.  Expert systems in synthesis planning: A user's view of the LHASA program , 1987 .

[69]  Ekaterina V. Gordeeva,et al.  Computer-assisted Synthesis , 1987 .

[70]  H. Kaminaka Computer-assisted design of organic synthesis , 1986 .

[71]  E. Corey,et al.  Computer-assisted analysis in organic synthesis. , 1985, Science.

[72]  W. Todd Wipke,et al.  Artificial intelligence in organic synthesis. SST: starting material selection strategies. An application of superstructure search , 1984, J. Chem. Inf. Comput. Sci..

[73]  Judea Pearl,et al.  Heuristics : intelligent search strategies for computer problem solving , 1984 .

[74]  Stuart Warren,et al.  Organic Synthesis: The Disconnection Approach , 1982 .

[75]  Maciej M. Syslo,et al.  An Efficient Cycle Vector Space Algorithm for Listing all Cycles of a Planar Graph , 1981, SIAM J. Comput..

[76]  William L. Jorgensen,et al.  Computer-assisted mechanistic evaluation of organic reactions. 2. Perception of rings, aromaticity, and tautomers , 1981, J. Chem. Inf. Comput. Sci..

[77]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  James B. Hendrickson,et al.  A logic for synthesis design , 1981 .

[79]  James B. Hendrickson,et al.  Systematic synthesis design. 8. Generation of reaction sequences , 1980 .

[80]  Johann Mulzer,et al.  Computer-Assisted Synthetic Analysis. Long-Range Search Procedures for Antithetic Simplification of Complex Targets by Application of the Halolactonization Transform , 1980, J. Chem. Inf. Comput. Sci..

[81]  COMPUTER-ASSISTED SYNTHETIC ANALYSIS. TECHNIQUES FOR EFFICIENT LONG-RANGE RETROSYNTHETIC SEARCHES APPLIED TO THE ROBINSON ANNULATION PROCESS , 1980 .

[82]  W. L. Jorgensen,et al.  Computer-assisted mechanistic evaluation of organic reactions. 1. Overview , 1980 .

[83]  W. Todd Wipke,et al.  Computer-Assisted Synthetic Analysis at Merck , 1980, J. Chem. Inf. Comput. Sci..

[84]  Morton E. Munk,et al.  An Approach to the Assignment of Canonical Connection Tables and Topological Symmetry Perception , 1979, J. Chem. Inf. Comput. Sci..

[85]  G. A. Wilson,et al.  The Chemical Abstracts Service Chemical Registry System. II. Augmented Connectivity Molecular Formula , 1979, J. Chem. Inf. Comput. Sci..

[86]  Raymond Carhart Letter to the Editor. Perception of Topological Symmetry , 1979 .

[87]  Johann Gasteiger,et al.  On the Misinterpretation of Our Algorithm for the Perception of Constitutional Symmetry , 1979, J. Chem. Inf. Comput. Sci..

[88]  Johann Gasteiger,et al.  EROS A computer program for generating sequences of reactions , 1978 .

[89]  S. Krishnan,et al.  Simulation and Evaluation of Chemical Synthesis - SECS: An Application of Artificial Intelligence Techniques , 1978, Artif. Intell..

[90]  Computer-assisted synthetic analysis. Performance of long-range strategies for stereoselective olefin synthesis , 1978 .

[91]  Raymond E. Carhart Erroneous Claims Concerning the Perception of Topological Symmetry , 1978, J. Chem. Inf. Comput. Sci..

[92]  S. Krishnan,et al.  Hash Functions for Rapid Storage and Retrieval of Chemical Structures , 1978, J. Chem. Inf. Comput. Sci..

[93]  A F Sanders,et al.  Empirical Explorations of SYNCHEM , 1977, Science.

[94]  James B. Hendrickson,et al.  Systematic synthesis design. 6. Yield analysis and convergency , 1977 .

[95]  G. Smith,et al.  SECS—Simulation and Evaluation of Chemical Synthesis: Strategy and Planning , 1977 .

[96]  Johann Gasteiger,et al.  Canonical Numbering and Constitutional Symmetry , 1977, J. Chem. Inf. Comput. Sci..

[97]  W. Todd Wipke,et al.  Simulation and evaluation of chemical synthesis. Congestion: a conformation-dependent function of steric environment at a reaction center. Application with torsional terms to stereoselectivity of nucleophilic additions to ketones , 1976 .

[98]  W. L. Jorgensen,et al.  Computer-assisted synthetic analysis. Generation of synthetic sequences involving sequential functional group interchanges , 1976 .

[99]  William L. Jorgensen,et al.  Computer-assisted synthetic analysis. Synthetic strategies based on appendages and the use of reconnective transforms , 1976 .

[100]  J. B. Hendrickson,et al.  Systematic synthesis design. IV. Numerical codification of construction reactions , 1975 .

[101]  G. A. Petersson,et al.  General methods of synthetic analysis. Strategic bond disconnections for bridged polycyclic structures , 1975 .

[102]  W. Todd Wipke,et al.  Use of Ring Assemblies in Ring Perception Algorithm , 1975, J. Chem. Inf. Comput. Sci..

[103]  David A. Pensak,et al.  Computer-assisted synthetic analysis. Methods for machine generation of synthetic intermediates involving multistep look-ahead , 1974 .

[104]  W. T. Wipke,et al.  SIMULATION AND EVALUATION OF CHEMICAL SYNTHESIS, COMPUTER REPRESENTATION AND MANIPULATION OF STEREOCHEMISTRY , 1974 .

[105]  W. T. Wipke,et al.  Stereochemically unique naming algorithm , 1974 .

[106]  W. T. Wipke,et al.  Techniques for perception by a computer of synthetically significant structural features in complex molecules , 1972 .

[107]  W. T. Wipke,et al.  Computer-assisted synthetic analysis. Facile man-machine communication of chemical structure by interactive computer graphics , 1972 .

[108]  E. J. Corey,et al.  Algorithm for machine perception of synthetically significant rings in complex cyclic organic structures , 1972 .

[109]  Richard D. Cramer,et al.  Computer-assisted synthetic analysis for complex molecules. Methods and procedures for machine generation of synthetic intermediates , 1972 .

[110]  George W. Ernst,et al.  GPS : a case study in generality and problem solving , 1971 .

[111]  Morris Plotkin,et al.  Mathematical Basis of Ring-Finding Algorithms in CIDS , 1971 .

[112]  E J Corey,et al.  Computer-assisted design of complex organic syntheses. , 1969, Science.

[113]  Norman E. Gibbs,et al.  A Cycle Generation Algorithm for Finite Undirected Linear Graphs , 1969, JACM.

[114]  Keith Paton,et al.  An algorithm for finding a fundamental set of cycles of a graph , 1969, CACM.

[115]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[116]  E. Corey General methods for the construction of complex molecules , 1967 .

[117]  John T. Welch,et al.  A Mechanical Analysis of the Cyclic Structure of Undirected Linear Graphs , 1966, J. ACM.

[118]  H. L. Morgan The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. , 1965 .

[119]  M. Dewar,et al.  Ground States of Conjugated Molecules. II. Allowance for Molecular Geometry1a,b , 1965 .

[120]  E. Corey,et al.  TOTAL SYNTHESIS OF d,l-LONGIFOLENE , 1961 .

[121]  R. Woodward,et al.  THE TOTAL SYNTHESIS OF CORTISONE , 1951 .

[122]  W. Huber,et al.  Über die Ester und Äther des synthetischen Vitamins A , 1949 .