First order random forests: Learning relational classifiers with complex aggregates

In relational learning, predictions for an individual are based not only on its own properties but also on the properties of a set of related individuals. Relational classifiers differ with respect to how they handle these sets: some use properties of the set as a whole (using aggregation), some refer to properties of specific individuals of the set, however, most classifiers do not combine both. This imposes an undesirable bias on these learners. This article describes a learning approach that avoids this bias, using first order random forests. Essentially, an ensemble of decision trees is constructed in which tests are first order logic queries. These queries may contain aggregate functions, the argument of which may again be a first order logic query. The introduction of aggregate functions in first order logic, as well as upgrading the forest’s uniform feature sampling procedure to the space of first order logic, generates a number of complications. We address these and propose a solution for them. The resulting first order random forest induction algorithm has been implemented and integrated in the ACE-ilProlog system, and experimentally evaluated on a variety of datasets. The results indicate that first order random forests with complex aggregates are an efficient and effective approach towards learning relational classifiers that involve aggregates over complex selections.

[1]  Ashwin Srinivasan,et al.  An Assessment of ILP-Assisted Models for Toxicology and the PTE-3 Experiment , 1999, ILP.

[2]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[3]  Jennifer Neville,et al.  Learning relational probability trees , 2003, KDD '03.

[4]  L. Breiman OUT-OF-BAG ESTIMATION , 1996 .

[5]  Susanne Hoche,et al.  Relational Learning Using Constrained Confidence-Rated Boosting , 2001, ILP.

[6]  Saso Dzeroski,et al.  Diterpene Structure Elucidation from 13CNMR Spectra with Inductive Logic Programming , 1998, Appl. Artif. Intell..

[7]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Arno J. Knobbe,et al.  Involving Aggregate Functions in Multi-relational Search , 2002, PKDD.

[10]  Hendrik Blockeel,et al.  Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[11]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Hendrik Blockeel,et al.  Classifying Relational Data with Neural Networks , 2005, ILP.

[15]  Maurice Bruynooghe,et al.  Aggregation versus selection bias, and relational neural networks , 2003 .

[16]  Dietrich Wettschereck,et al.  Relational Instance-Based Learning , 1996, ICML.

[17]  Daphne Koller,et al.  Probabilistic Relational Models , 1999, ILP.

[18]  Luc De Raedt,et al.  Inductive Constraint Logic , 1995, ALT.

[19]  J. R. Quinlan Learning Logical Definitions from Relations , 1990 .

[20]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[21]  Stefan Wrobel,et al.  Transformation-Based Learning Using Multirelational Aggregation , 2001, ILP.

[22]  Luc De Raedt,et al.  Lookahead and Discretization in ILP , 1997, ILP.

[23]  Ryszard S. Michalski,et al.  Pattern Recognition as Rule-Guided Inductive Inference , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[25]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[26]  Arno J. Knobbe,et al.  Propositionalisation and Aggregates , 2001, PKDD.

[27]  J. Ross Quinlan,et al.  Boosting First-Order Learning , 1996, ALT.

[28]  Peter A. Flach,et al.  Comparative Evaluation of Approaches to Propositionalization , 2003, ILP.

[29]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[30]  Bart Demoen,et al.  Improving the Efficiency of Inductive Logic Programming Through the Use of Query Packs , 2011, J. Artif. Intell. Res..

[31]  Gordon Plotkin,et al.  A Note on Inductive Generalization , 2008 .

[32]  Foster J. Provost,et al.  Aggregation-based feature invention and relational concept classes , 2003, KDD '03.

[33]  David Page,et al.  An Empirical Evaluation of Bagging in Inductive Logic Programming , 2002, ILP.