RuleKit: A Comprehensive Suite for Rule-Based Learning

Rule-based models are often used for data analysis as they combine interpretability with predictive power. We present RuleKit, a versatile tool for rule learning. Based on a sequential covering induction algorithm, it is suitable for classification, regression, and survival problems. The presence of a user-guided induction facilitates verifying hypotheses concerning data dependencies which are expected or of interest. The powerful and flexible experimental environment allows straightforward investigation of different induction schemes. The analysis can be performed in batch mode, through RapidMiner plug-in, or R package. A documented Java API is also provided for convenience. The software is publicly available at GitHub under GNU AGPL-3.0 license.

[1]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[2]  Antonio Gomariz,et al.  The SPMF Open-Source Data Mining Library Version 2 , 2016, ECML/PKDD.

[3]  Marek Sikora,et al.  GuideR: a guided separate-and-conquer rule learning in classification, regression, and survival settings , 2018, Knowl. Based Syst..

[4]  Peter Clark,et al.  The CN2 induction algorithm , 2004, Machine Learning.

[5]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[6]  Geoff Holmes,et al.  Generating Rule Sets from Model Trees , 1999, Australian Joint Conference on Artificial Intelligence.

[7]  Marek Sikora,et al.  Learning rule sets from survival data , 2017, BMC Bioinformatics.

[8]  Johannes Fürnkranz,et al.  Foundations of Rule Learning , 2012, Cognitive Technologies.

[9]  Hamido Fujita,et al.  Multi-Imbalance: An open-source software for multi-class imbalance learning , 2019, Knowl. Based Syst..

[10]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[11]  Martin Mozina,et al.  Orange: data mining toolbox in python , 2013, J. Mach. Learn. Res..

[12]  Marcin Michalak,et al.  Rule Quality Measures Settings in Classification, Regression and Survival Rule Induction - an Empirical Approach , 2016, Fundam. Informaticae.

[13]  Ryszard S. Michalski,et al.  On the Quasi-Minimal Solution of the General Covering Problem , 1969 .

[14]  Arkadiusz Wojna,et al.  Rseslib 3: Library of Rough Set and Machine Learning Methods with Extensible Architecture , 2019, Trans. Rough Sets.

[15]  Ayman Hajja,et al.  Hierarchical object-driven action rules , 2013, Journal of Intelligent Information Systems.

[16]  Marek Sikora,et al.  CHIRA - Convex Hull Based Iterative Algorithm of Rules Aggregation , 2013, Fundam. Informaticae.