Associated multi-label fuzzy-rough feature selection

Ahead of the process of selecting a subset of relevant features, the labels commonly need to be combined into a single one for multi-label feature selection. However the existing label combination methods assume that all labels are independent of each other and consequently suffer from high computation complexity. In this paper, association rules implied in the labels are explored to implement a fuzzy-rough feature selection method for multi-label datasets. Specifically, in order to reduce the scale of label and avoid the label overlapping phenomenon, the association rules between labels make the combination of labels collapse to a set of sub-labels. Then each set of sub-labels is regarded as a unique class during the following course of fuzzy-rough feature selection. Empirical results suggest that the quality of the selected features can be improved by the proposed approach compared to the alternative multi-label feature selection algorithms.

[1]  Qinghua Hu,et al.  Multi-label Feature Selection with Fuzzy Rough Sets , 2014, RSKT.

[2]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[3]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[4]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[5]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[6]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[7]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[8]  Newton Spolaôr,et al.  A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach , 2013, CLEI Selected Papers.

[9]  A. Nur Zincir-Heywood,et al.  Evaluation of Two Systems on Multi-class Multi-label Document Classification , 2005, ISMIS.

[10]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[11]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[12]  Ansheng Deng,et al.  Multi-label Fuzzy Similarity-Based Nearest-Neighbour Classification Using Association Rule , 2016, IDEAL.

[13]  Peter I. Cowling,et al.  MMAC: a new multi-class, multi-label associative classification approach , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[14]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[15]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[16]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[17]  Li Shining,et al.  Multi-Label Feature Selection Algorithm Based on Information Entropy , 2013 .

[18]  Shenghuo Zhu,et al.  Empirical Studies on Multi-label Classification , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[19]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[20]  Jiawei Han,et al.  Correlated multi-label feature selection , 2011, CIKM '11.

[21]  Qiang Yang,et al.  Document Transformation for Multi-label Feature Selection in Text Categorization , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[22]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[23]  Geoff Holmes,et al.  Multi-label Classification Using Ensembles of Pruned Sets , 2008, 2008 Eighth IEEE International Conference on Data Mining.