Using inductive logic programming to discover knowledge hidden in chemical data

This paper demonstrates how general purpose tools from the field of Inductive Logic Programming (ILP) can be applied to analytical chemistry. As far as these authors are aware, this is the first published work to describe the application of the ILP tool Golem to separation science. An outline of the theory of ILP is given, together with a description of Golem and previous applications of ILP. The advantages of ILP over classical machine induction techniques, such as the Top-Down-Induction-of-Decision-Tree family, are explained. A case-study is then presented in which Golem is used to induce rules which predict, with a high accuracy (82%), whether each of a series of attempted separations succeed or fail. The separation data was obtained from published work on the attempted separation of a series of 3-substituted phthalide enantiomer pairs on (R)-N-(3,5-dinitrobenzoyl)-phenylglycine.

[1]  Scott T. Stauffer Expert system shells in chemistry: CHIRULE, a chiral chromatographic column selection system using similarity searching and personal construct theory , 1993 .

[2]  K. Maher,et al.  Chiral separations by high-performance liquid chromatography , 1992 .

[3]  Sakir Kocabas,et al.  A review of learning , 1991, The Knowledge Engineering Review.

[4]  Peter Jackson,et al.  Introduction to expert systems , 1986 .

[5]  Christopher H. Bryant,et al.  Towards an expert system for enantioseparations: induction of rules using machine learning , 1996 .

[6]  Ivan Bratko,et al.  Applications of inductive logic programming , 1995, CACM.

[7]  Christopher H. Bryant,et al.  A review of expert systems for chromatography , 1994 .

[8]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[9]  William F. Clocksin,et al.  Programming in Prolog , 1981, Springer Berlin Heidelberg.

[10]  Saso Dzeroski,et al.  Applying ILP to Diterpene Structure Elucidation from 13C NMR Spectra , 1996, Inductive Logic Programming Workshop.

[11]  Christopher John Hogger,et al.  Essentials of logic programming , 1990 .

[12]  T. Sowin,et al.  Direct liquid chromatographic separation of phthalide enantiomers , 1987 .

[13]  Stephen Muggleton,et al.  Efficient Induction of Logic Programs , 1990, ALT.

[14]  B. Koppenhoefer,et al.  CHIRBASE, a molecular database for the separation of enantiomers by chromatography. , 1994, Journal of chromatography. A.

[15]  Luc De Raedt,et al.  Inductive Logic Programming: A Survey of European Research , 1995, AI Commun..

[16]  B. Koppenhoefer,et al.  CHIRBASE, a graphical molecular database on the separation of enantiomers by liquid-, supercritical fluid-, and gas chromatography. , 1993, Chirality.

[17]  John Wylie Lloyd,et al.  Foundations of Logic Programming , 1987, Symbolic Computation.