Generation of rough sets reducts and constructs based on inter-class and intra-class information

The reduct, originating from the Classic Rough Set Approach (CRSA), is an inclusion minimal subset of attributes that provides discernibility between objects from different classes in at least the same degree as the set of all attributes. It can be thus referred to as being consistent and minimal, which are the two important characteristics of filter-based feature selection. These two characteristics have been also utilized to define reducts within the Dominance-based Rough Set Approach (DRSA). Further, the classic reduct, here referred to as an inter-class reduct, has evolved into what is known as intra-class reduct and construct in CRSA. The idea is that while inter-class reducts utilize only one part of information generated from all pairs of objects, intra-class reducts utilize the remaining part, while constructs utilize both. The paper delivers a final unification of inter-class reducts, intra-class reducts and constructs across CRSA and DRSA, showing how they can be both defined and computed uniformly, i.e. using basically the same concepts and algorithms. It also presents an exact algorithm, capable of generating all exact reduced subsets, but of considerable complexity, as well as a simple and fast heuristic, designed to generate a single reduced subset. Finally, it illustrates the computation process with examples and some experimental evaluation of CRSA constructs, which show how the use of both the inter-class and the intra-class information can assist the attribute reduction process and help obtaining useful insights into the analyzed data set.

[1]  Salvatore Greco,et al.  Dominance-Based Rough Set Approach on Pairwise Comparison Tables to Decision Involving Multiple Decision Makers , 2011, RSKT.

[2]  Robert Susmaga,et al.  Reducts Versus Constructs: an Experimental Evaluation , 2003, RSKD.

[3]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[4]  Milosz Kadzinski,et al.  Robust ordinal regression in preference learning and ranking , 2013, Machine Learning.

[5]  Salvatore Greco,et al.  Rough sets theory for multicriteria decision analysis , 2001, Eur. J. Oper. Res..

[6]  Dominik Slezak,et al.  Approximate Entropy Reducts , 2002, Fundam. Informaticae.

[7]  Robert Susmaga,et al.  Tree-Like Parallelization of Reduct and Construct Computation , 2004, Rough Sets and Current Trends in Computing.

[8]  Wojciech Kotlowski,et al.  Stochastic dominance-based rough set model for ordinal classification , 2008, Inf. Sci..

[9]  Robert Susmaga,et al.  Reducts and Constructs in Attribute Reduction , 2004, Fundam. Informaticae.

[10]  Dominik Slezak,et al.  Order Based Genetic Algorithms for the Search of Approximate Entropy Reducts , 2003, RSFDGrC.

[11]  Robert Susmaga,et al.  Reducts and constructs in classic and dominance-based rough sets approach , 2014, Inf. Sci..

[12]  Roman Słowiński,et al.  Generation of reducts and rules in multi-attribute and multi-criteria classification , 2000 .

[13]  Yuhua Qian,et al.  Test-cost-sensitive attribute reduction , 2011, Inf. Sci..

[14]  Wei-Zhi Wu,et al.  Approaches to knowledge reduction based on variable precision rough set model , 2004, Inf. Sci..

[15]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[16]  Maria E. Orlowska,et al.  Maintenance of Knowledge in Dynamic Information Systems , 1992, Intelligent Decision Support.

[17]  Stanislaw Romanski,et al.  Operations on Families of Sets for Exhaustive Search, Given a Monotonic Function , 1988, JCDKB.

[18]  Hideo Tanaka,et al.  Fuzzy Inference System Based on Rough Sets and Its Application to Medical Diagnosis , 1992, Intelligent Decision Support.

[19]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[20]  Zdzisław Pawlak,et al.  Can Bayesian confirmation measures be useful for rough set decision rules? , 2004, Eng. Appl. Artif. Intell..

[21]  Z. Pawlak,et al.  Independence of attributes , 1988 .

[22]  Marzena Kryszkiewicz,et al.  Towards Scalable Algorithms for Discovering Rough Set Reducts , 2004, Trans. Rough Sets.

[23]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[24]  Marc Boullé,et al.  Khiops: A Statistical Discretization Method of Continuous Attributes , 2004, Machine Learning.

[25]  Andrzej Skowron,et al.  Decision Algorithms: A Survey of Rough Set - Theoretic Methods , 1997, Fundam. Informaticae.

[26]  Ankica Babic,et al.  Comparison of the Rough Sets Approach and Probabilistic Data Analysis Techniques on a Common Set of Medical Data , 1992, Intelligent Decision Support.

[27]  S French,et al.  Multicriteria Methodology for Decision Aiding , 1996 .

[28]  Robert Susmaga New test for inclusion minimality in reduct generation , 2000 .

[29]  Degang Chen,et al.  A systematic study on attribute reduction with rough sets based on general binary relations , 2008, Inf. Sci..

[30]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[31]  Wojciech Ziarko,et al.  DATA‐BASED ACQUISITION AND INCREMENTAL MODIFICATION OF CLASSIFICATION RULES , 1995, Comput. Intell..

[32]  Andrzej Skowron,et al.  Modeling rough granular computing based on approximation spaces , 2012, Inf. Sci..

[33]  Salvatore Greco,et al.  Rough Sets in Decision Making , 2009, Encyclopedia of Complexity and Systems Science.

[34]  Henryk Rybinski,et al.  A New Approach to Distributed Algorithms for Reduct Calculation , 2008, Trans. Rough Sets.

[35]  S. Greco,et al.  Rough set and rule-based multicriteria decision aiding , 2012 .

[36]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[37]  Z. Pawlak,et al.  Rough set approach to multi-attribute decision analysis , 1994 .