Multi-label Learning by Hyperparameters Calibration for Treating Class Imbalance

Multi-label learning has been becoming an increasingly active area into the machine learning community due to a wide variety of real world problems. However, only over the past few years class balancing for these kind of problems became a topic of interest. In this paper, we present a novel method named hyperparameter calibration to treat class imbalance in a multi-label problem, to this aim we develop an extensive analysis over four real-world databases and two own synthetic databases exhibiting different ratios of imbalance. The empirical analysis shows that the proposed method is able to improve the classification performance when it is combined with three of the most widely used strategies for treating multi-label classification problems.

[1]  Vasile Palade,et al.  Class Imbalance Learning Methods for Support Vector Machines , 2013 .

[2]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[3]  Francisco Charte,et al.  MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation , 2015, Knowl. Based Syst..

[4]  Eyke Hüllermeier,et al.  Multilabel classification via calibrated label ranking , 2008, Machine Learning.

[5]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[6]  Michal Wozniak,et al.  CCR: A combined cleaning and resampling algorithm for imbalanced data classification , 2017, Int. J. Appl. Math. Comput. Sci..

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Germán Castellanos-Domínguez,et al.  Evaluation of Example-Based Measures for Multi-label Classification Performance , 2015, IWBBIO.

[9]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[10]  Francisco Charte,et al.  Addressing imbalance in multilabel classification: Measures and random resampling algorithms , 2015, Neurocomputing.

[11]  Josef Kittler,et al.  Inverse random under sampling for class imbalance problem and its application to multi-label classification , 2012, Pattern Recognit..

[12]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[13]  Vasile Palade,et al.  FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning , 2010, IEEE Transactions on Fuzzy Systems.

[14]  Gert R. G. Lanckriet,et al.  Semantic Annotation and Retrieval of Music and Sound Effects , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[16]  Newton Spolaôr,et al.  A Framework to Generate Synthetic Multi-label Datasets , 2014, CLEI Selected Papers.

[17]  Germán Castellanos-Domínguez,et al.  A comparison of multi-label techniques based on problem transformation for protein functional prediction , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[18]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[19]  Piotr Synak,et al.  Multi-Label Classification of Emotions in Music , 2006, Intelligent Information Systems.