Design and evaluation of the RISE 1.0 learning system

Author(s): Domingos, Pedro | Abstract: Current rule induction systems (e.g. CN2) typically rely on a "separate and conquer" strategy: they induce one rule at a time, removing the newly covered examples from the training set after each step. This results in a dwindling number of examples being available for learning successive rules, which in turn causes several problems that adversely affect the accuracy of the resulting rules. The research reported here investigates the alternative: learning all rules simultaneously using the entire training set for each. A viable approach using this strategy is proposed and implemented in the RISE 1 system. Empirical comparison of the new system with CN2 suggests that "conquering without separating" performs similarly to its counterpart in simple domains, but achieves increasingly substantial gains in accuracy as the domain difficulty grows, without sacrificing speed.