An Ensemble Learning Approach Based on Missing-Valued Tables

In classification problems on rough sets, the effectiveness of ensemble learning approaches such as bagging, random forests, and attribute sampling ensemble has been reported. We focus on occurrences of deficiencies in columns on the original decision table in random forests and attribute sampling ensemble approaches. In this paper, we generalize such deficiencies of columns to deficiencies of cells and propose an ensemble learning approach based on missing-valued decision tables. We confirmed the effectiveness of the proposed method for the classification performance through numerical experiments and the two-tailed Wilcoxon signed-rank test. Furthermore, we consider the robustness of the method in absences of condition attribute values of unknown objects.

[1]  Alexis Tsoukiàs,et al.  Incomplete Information Tables and Rough Classification , 2001, Comput. Intell..

[2]  Jerzy W. Grzymala-Busse,et al.  Generalized probabilistic approximations of incomplete data , 2014, Int. J. Approx. Reason..

[3]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[4]  Jerzy Stefanowski,et al.  The Bagging and n2-Classifiers Based on Rules Induced by MODLEM , 2004, Rough Sets and Current Trends in Computing.

[5]  Roman Slowinski,et al.  Variable Consistency Bagging Ensembles , 2010, Trans. Rough Sets.

[6]  Rafal Latkowski On Decomposition for Incomplete Data , 2003, Fundam. Informaticae.

[7]  Andrzej Skowron,et al.  Boolean Reasoning for Decision Rules Generation , 1993, ISMIS.

[8]  Marzena Kryszkiewicz,et al.  Rough Set Approach to Incomplete Information Systems , 1998, Inf. Sci..

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[11]  J. Grzymala-Busse,et al.  Rough Set Approaches to Rule Induction from Incomplete Data , 2004 .