A comparison of classification in artificial intelligence, induction versus a self-organising neural networks

Abstract Three methods of classification (machine learning) were used to produce a program to choose a detector for ion chromatography (IC). The selected classification systems were: C4.5, an induction method based on an information theory algorithm; INDUCT, which is based on a probability algorithm and a self-organising neural network developed specifically for this application. They differ both in the learning strategy employed to structure the knowledge, and the representation of knowledge acquired by the system, i.e., rules, decision trees and a neural network. A database of almost 4000 cases, that covered most IC experiments reported in the chemical literature in the period 1979 to 1989, comprised the basis for the development of the system. Generally, all three algorithms performed very well for this application. They managed to induce rules, or produce a network that had about a 70% success rate for the prediction of detectors reported in the publication and over 90% success for choosing a detector that could be used for the described method. This was considered acceptable due to the nature of the problem domain and that of the training set. Each method effectively handled the very high noise levels in the training set and was able to select the relevant attributes.

[1]  Donald R. Scott,et al.  Pattern recognition/expert system for mass spectra of volatile toxic and other organic compounds , 1992 .

[2]  D. B. Hibbert Genetic algorithms in chemistry , 1993 .

[3]  Brian R. Gaines An Ounce of Knowledge is Worth a Ton of Data: Quantitative studies of the Trade-Off between Expertise and Data Based On Statistically Well-Founded Empirical Induction , 1989, ML.

[4]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[5]  Donald R. Scott Improved method for estimating molecular weights of volatile organic compounds from low resolution mass spectra , 1991 .

[6]  John H. Holland,et al.  Induction: Processes of Inference, Learning, and Discovery , 1987, IEEE Expert.

[7]  Donald R. Scott Classification of binary mass spectra of toxic compounds with an inductive expert system and comparison with SIMCA class modeling , 1988 .

[8]  P. Hopke,et al.  Comparison of rule-building expert systems with pattern recognition for the classification of analytical data , 1987 .

[9]  H. Lohninger,et al.  Classification of mass spectra: A comparison of yes/no classification methods for the recognition of simple structural properties , 1994 .

[10]  George F. Luger,et al.  Artificial Intelligence and the Design of Expert Systems , 1990 .

[11]  Claude Sammut,et al.  Application of the C4.5 classifier to building an expert system for ion chromatography , 1995 .

[12]  A. F. Chalmers,et al.  What Is This Thing Called Science , 1976 .

[13]  Desire L. Massart,et al.  Evaluation of the expert system shells EX-TRAN and TIMM as rule-building tools for classification purposes , 1988 .