Theory-Driven Discovery of Reaction Pathways in the MECHEM System

One goal of machine discovery is to automate creative tasks from human scientific practice. This paper describes a project to automate in a general manner the theory-driven discovery of reaction pathways in chemistry and biology. We have designed a system - called MECHEM - that proposes credible pathway hypotheses from data ordinarily available to the chemist. MECHEM has been applied to reactions drawn from the history of biochemistry, from recent industrial chemistry as reported in journals, and from organic chemistry textbooks. The paper first explains the chemical problem and discusses previous AI treatments. Then are presented the architecture of the system, the key algorithmic ideas, and the heuristics used to explore the very large space of chemical pathways. The system's efficacy is demonstrated on a biochemical reaction studied earlier by Kulkarni and Simon in the KEKADA system, and on another reaction from industrial chemistry. Our project has also resulted in separate novel contributions to chemical knowledge, demonstrating that we have not simplified the task for our convenience, but have addressed its full complexity.

[1]  Raúl E. Valdés-Pérez,et al.  Symbolic computing on reaction pathways , 1990 .

[2]  Joshua Lederberg,et al.  Applications of Artificial Intelligence for Chemical Inference: The Dendral Project , 1980 .

[3]  Herbert A. Simon,et al.  Scientific discovery: compulalional explorations of the creative process , 1987 .

[4]  Herbert A. Simon,et al.  The Processes of Scientific Discovery: The Strategy of Experimentation , 1988, Cogn. Sci..

[5]  H. Simon,et al.  Scientific Discovery and the Psychology of Problem Solving , 1977 .

[6]  Holmes Fl,et al.  Hans Krebs and the discovery of the ornithine cycle. , 1980 .

[7]  Jean Jourdan,et al.  Constraint Logic Programming Applied to Hypothetical Reasoning in Chemistry , 1990, NACLP.

[8]  B. K. Carpenter Determination of Organic Reaction Mechanisms , 1984 .

[9]  Raul E. Valdes-Perez On the concept of stoichiometry of reaction mechanisms , 1991 .

[10]  Raul E. Valdes-Perez,et al.  A necessary condition for catalysis in reaction pathways , 1992 .

[11]  Pat Langley,et al.  A Hill-Climbing Approach to Machine Discovery , 1988, ML.

[12]  H. Lodish,et al.  How receptors bring proteins and particles into cells. , 1984, Scientific American.

[13]  Dennis D. Murphy,et al.  Book review: Computational Models of Scientific Discovery and Theory Formation Edited by Jeff Shrager & Pat Langley (Morgan Kaufmann San Mateo, CA, 1990) , 1992, SGAR.

[14]  Raúl E. Valdés-Pérez,et al.  Algorithm to generate reaction pathways for computer‐assisted elucidation , 1992 .

[15]  Russ B. Altman,et al.  PROTEAN: Deriving Protein Structure from Constraints , 1986, AAAI.

[16]  Raúl E. Valdés-Pérez,et al.  A canonical representation of multistep reactions , 1991, J. Chem. Inf. Comput. Sci..