Fuzzy Rough Discernibility Matrix Based Feature Subset Selection With MapReduce

Fuzzy-rough set theory (FRST) is a hybridization of fuzzy sets with rough sets with applications to attribute reduction in hybrid decision systems. The existing reduct computation approaches in fuzzy-rough sets are not scalable to large scale decision systems owing to higher space complexity requirements. Iterative MapReduce framework of Apache Spark facilitates the development of scalable distributed algorithms with fault tolerance. This work introduces algorithm MR_FRDM_SBE as one of the first attempts towards scalable fuzzy-rough set based attribute reduction. MR_FRDM_SBE algorithm is a combination of a novel incremental approach for the construction of distributed fuzzy-rough discernibility matrix and Sequential Backward Elimination control strategy based distributed fuzzy-rough attribute reduction using a discernibility matrix. A comparative experimental study conducted using large scale benchmark hybrid decision systems demonstrated the relevance of the proposed approach in scalable attribute reduction and better classification model construction.

[1]  Rajen B. Bhatt,et al.  On the compact computational domain of fuzzy-rough sets , 2005, Pattern Recognit. Lett..

[2]  A. Sunny Kuriakose,et al.  A novel feature selection method using fuzzy rough sets , 2018, Comput. Ind..

[3]  Jiye Liang,et al.  Ieee Transactions on Knowledge and Data Engineering 1 a Group Incremental Approach to Feature Selection Applying Rough Set Technique , 2022 .

[4]  Chris Cornelis,et al.  Fuzzy Rough Sets: from Theory into Practice , 2008, GrC 2008.

[5]  Qinghua Hu,et al.  Information-preserving hybrid data reduction based on fuzzy-rough techniques , 2006, Pattern Recognit. Lett..

[6]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[7]  Wei-Zhi Wu,et al.  Maximal-Discernibility-Pair-Based Approach to Attribute Reduction in Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[8]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[9]  Witold Pedrycz,et al.  Positive approximation: An accelerator for attribute reduction in rough set theory , 2010, Artif. Intell..

[10]  Qiang Shen,et al.  Centre for Intelligent Systems and Their Applications Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Sets and Systems ( ) – Fuzzy–rough Attribute Reduction with Application to Web Categorization , 2022 .

[11]  Xiao Zhang,et al.  A fuzzy rough set-based feature selection method using representative instances , 2018, Knowl. Based Syst..

[12]  Guoyin Wang,et al.  Attribute Reduction for Massive Data Based on Rough Set Theory and MapReduce , 2010, RSKT.

[13]  Verónica Bolón-Canedo,et al.  Data discretization: taxonomy and big data challenge , 2016, WIREs Data Mining Knowl. Discov..

[14]  C. Raghavendra Rao,et al.  An Efficient Approach for Fuzzy Decision Reduct Computation , 2014, Trans. Rough Sets.

[15]  Chris Cornelis,et al.  Attribute selection with fuzzy decision reducts , 2010, Inf. Sci..

[16]  Xiaodong Yue,et al.  Parallel attribute reduction algorithms using MapReduce , 2014, Inf. Sci..

[17]  Pradipta Maji,et al.  A Rough Hypercuboid Approach for Feature Selection in Approximation Spaces , 2014, IEEE Transactions on Knowledge and Data Engineering.

[18]  Reynold Xin,et al.  Apache Spark , 2016 .

[19]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[20]  Qingguo Li,et al.  An incremental approach to attribute reduction of dynamic set-valued information systems , 2014, Int. J. Mach. Learn. Cybern..

[21]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[22]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[23]  Praveen Kumar Singh,et al.  Scalable IQRA_IG Algorithm: An Iterative MapReduce Approach for Reduct Computation , 2017, ICDCIT.