Improving Multiclass ILP by Combining Partial Rules with Winnow Algorithm: Results on Classification of Dopamine Antagonist Molecules

In this paper, we propose an approach which can improve Inductive Logic Programming in multiclass problems. This approach is based on the idea that if a whole rule cannot be applied to an example, some partial matches of the rule can be useful. The most suitable class should be the class whose important partial matches cover the example more than those from other classes. Hence, the partial matches of the rule, called partial rules, are first extracted from the original rules. Then, we utilize the idea of Winnow algorithm to weigh each partial rule. Finally, the partial rules and the weights are combined and used to classify new examples. The weights of partial rules show another aspect of the knowledge which can be discovered from the data set. In the experiments, we apply our approach to a multiclass real-world problem, classification of dopamine antagonist molecules. The experimental results show that the proposed method gives the improvement over the original rules and yields 88.58% accuracy by running 10-fold cross validation.

[1]  Ashwin Srinivasan,et al.  Feature construction with Inductive Logic Programming: A Study of Quantitative Predictions of Biological Activity Aided by Structural Attributes , 1999, Data Mining and Knowledge Discovery.

[2]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  Hector J. Levesque,et al.  A New Method for Solving Hard Satisfiability Problems , 1992, AAAI.

[5]  Boonserm Kijsirikul,et al.  Approximate Match of Rules Using Backpropagation Neural Networks , 2001, Machine Learning.

[6]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[7]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[8]  Luc De Raedt,et al.  On Multi-class Problems and Discretization in Inductive Logic Programming , 1997, ISMIS.

[9]  Henrik Boström,et al.  Classifying Uncovered Examples by Rule Stretching , 2001, ILP.

[10]  Ashwin Srinivasan,et al.  Feature Construction with Inductive Logic Programming: A Study of Quantitative Predictions of Biological Activity by Structural Attributes , 1996, Inductive Logic Programming Workshop.

[11]  Johannes Fürnkranz,et al.  Round Robin Rule Learning , 2001, ICML.

[12]  Marie-Odile Cordier,et al.  Application of ILP to Cardiac Arrhythmia Characterization for Chronicle Recognition , 2001, ILP.

[13]  Ross D. King,et al.  Application of Inductive Logic Programming to Structure-Based Drug Design , 2003, PKDD.

[14]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[15]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .