Evaluating ReliefF-Based Multi-Label Feature Selection Algorithm

In multi-label learning, each instance is associated with multiple labels, which are often correlated. As other machine learning tasks, multi-label learning also suffers from the curse of dimensionality, which can be mitigated by feature selection. This work experimentally evaluates four multi-label feature selection algorithms that use the filter approach. Three of them are based on the ReliefF algorithm, which takes into account interacting features. The quality of the selected features is assessed by three different learning algorithms. Evaluating multi-label learning algorithms is a complicated task, as multiple evaluation measures, which might optimize different loss functions, should be considered. To this end, \(General_B\), a baseline algorithm which learns by only looking at the multi-labels of the dataset, is used as a reference. Results show that feature selection contributed to improve the performance of classifiers initially worse than \(General_B\) and highlight ReliefF-based algorithms in some experimental settings.

[1]  Sebastián Ventura,et al.  ReliefF-ML: An Extension of ReliefF Algorithm to Multi-label Learning , 2013, CIARP.

[2]  Marcos Aurélio Domingues,et al.  Three Current Issues In Music Autotagging , 2011, ISMIR.

[3]  Grigorios Tsoumakas,et al.  An Empirical Study of Lazy Multilabel Classification Algorithms , 2008, SETN.

[4]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[5]  Tore Dybå,et al.  Evidence-based software engineering , 2004, Proceedings. 26th International Conference on Software Engineering.

[6]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[7]  Newton Spolaôr,et al.  A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach , 2013, CLEI Selected Papers.

[8]  Saso Dzeroski,et al.  ReliefF for Hierarchical Multi-label Classification , 2013, NFMCP.

[9]  Janez Demsar,et al.  Algorithms for subsetting attribute values with Relief , 2010, Machine Learning.

[10]  George A. Vouros,et al.  Artificial Intelligence: Theories, Models and Applications, 5th Hellenic Conference on AI, SETN 2008, Syros, Greece, October 2-4, 2008. Proceedings , 2008, SETN.

[11]  Newton Spolaôr,et al.  ReliefF for Multi-label Feature Selection , 2013, 2013 Brazilian Conference on Intelligent Systems.

[12]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[13]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[14]  Luis Alvarez,et al.  Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications , 2012, Lecture Notes in Computer Science.

[15]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[16]  Chris H. Q. Ding,et al.  Multi-label ReliefF and F-statistic feature selections for image annotation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[18]  Everton Alvares Cherman,et al.  On the Estimation of Predictive Evaluation Measure Baselines for Multi-label Learning , 2012, IBERAMIA.

[19]  M. C. Monard,et al.  A systematic review to identify feature selection publications in multi-labeled data , 2012 .

[20]  Everton Alvares Cherman,et al.  Incorporating label dependency into the binary relevance framework for multi-label classification , 2012, Expert Syst. Appl..

[21]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[22]  Rubén Fuentes-Fernández,et al.  Advances in Artificial Intelligence – IBERAMIA 2012 , 2012, Lecture Notes in Computer Science.