A three-way selective ensemble model for multi-label classification

Abstract Label ambiguity and data complexity are widely recognized as major challenges in multi-label classification. Existing studies strive to find approximate representations concerning label semantics, however, most of them are predefined, neglecting the personality of instance-label pair. To circumvent this drawback, this paper proposes a three-way selective ensemble (TSEN) model. In this model, three-way decisions is responsible for minimizing uncertainty, whereas ensemble learning is in charge of optimizing label associations. Both label ambiguity and data complexity are firstly reduced, which is realized by a modified probabilistic rough set. For reductions with shared attributes, we further promote the prediction performance by an ensemble strategy. The components in base classifiers are label-specific, and the voting results of instance-based level are utilized for tri-partition. Positive and negative decisions are determined directly, whereas the deferment region is determined by label-specific reduction. Empirical studies on a collection of benchmarks demonstrate that TSEN achieves competitive performance against state-of-the-art multi-label classification algorithms.

[1]  Xiaonan Li,et al.  Three-way decisions approach to multiple attribute group decision making with linguistic information-based decision-theoretic rough fuzzy set , 2018, Int. J. Approx. Reason..

[2]  Hsuan-Tien Lin,et al.  Multilabel Classification with Principal Label Space Transformation , 2012, Neural Computation.

[3]  Thierry Denoeux,et al.  Representing uncertainty on set-valued variables using belief functions , 2010, Artif. Intell..

[4]  Fang Liu,et al.  A novel dynamic rough subspace based selective ensemble , 2015, Pattern Recognit..

[5]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[6]  Guoyin Wang,et al.  Monotonic uncertainty measures for attribute reduction in probabilistic rough set model , 2015, Int. J. Approx. Reason..

[7]  Yiyu Yao,et al.  A Decision Theoretic Framework for Approximating Concepts , 1992, Int. J. Man Mach. Stud..

[8]  Geoff Holmes,et al.  MEKA: A Multi-label/Multi-target Extension to WEKA , 2016, J. Mach. Learn. Res..

[9]  Sebastián Ventura,et al.  A Tutorial on Multilabel Learning , 2015, ACM Comput. Surv..

[10]  Zhi-Hua Zhou,et al.  Multi-Label Learning by Exploiting Label Correlations Locally , 2012, AAAI.

[11]  Witold Pedrycz,et al.  Neighborhood rough sets based multi-label classification for automatic image annotation , 2013, Int. J. Approx. Reason..

[12]  Thierry Denoeux,et al.  Evidential Multi-label Classification Using the Random k-Label Sets Approach , 2012, Belief Functions.

[13]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[14]  Grigorios Tsoumakas,et al.  MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[15]  Shou-De Lin,et al.  Generalized k-Labelsets Ensemble for Multi-Label and Cost-Sensitive Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[16]  Hsuan-Tien Lin,et al.  Progressive random k-labelsets for cost-sensitive multi-label classification , 2017, Machine Learning.

[17]  Claudio Gentile,et al.  On multilabel classification and ranking with bandit feedback , 2014, J. Mach. Learn. Res..

[18]  Dae-Won Kim,et al.  SCLS: Multi-label feature selection based on scalable criterion for large label set , 2017, Pattern Recognit..

[19]  Nan Zhang,et al.  Attribute reduction for sequential three-way decisions under dynamic granulation , 2017, Int. J. Approx. Reason..

[20]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[21]  Lei Wu,et al.  Lift: Multi-Label Learning with Label-Specific Features , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Jing Zhang,et al.  A Variable Precision Attribute Reduction Approach in Multilabel Decision Tables , 2014, TheScientificWorldJournal.

[24]  Jianmin Wang,et al.  Multi-label Classification via Feature-aware Implicit Label Space Encoding , 2014, ICML.

[25]  Jie Duan,et al.  Multi-label feature selection based on neighborhood mutual information , 2016, Appl. Soft Comput..

[26]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[27]  刘景华,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015 .

[28]  Xindong Wu,et al.  Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification , 2016, IEEE Transactions on Knowledge and Data Engineering.

[29]  Wei Tang,et al.  Ensembling neural networks: Many could be better than all , 2002, Artif. Intell..

[30]  Yiyu Yao,et al.  On Reduct Construction Algorithms , 2006, RSKT.

[31]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[32]  Yiyu Yao,et al.  The superiority of three-way decisions in probabilistic rough set models , 2011, Inf. Sci..

[33]  Lei Wang,et al.  Sentiment analysis of text based on three-way decisions , 2017, J. Intell. Fuzzy Syst..

[34]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[35]  Sarah Vluymans,et al.  Multi-label classification using a fuzzy rough neighborhood consensus , 2018, Inf. Sci..

[36]  Jing-Yu Yang,et al.  Multi-label learning with label-specific feature reduction , 2016, Knowl. Based Syst..

[37]  Chong Ho Lee,et al.  Addressing class-imbalance in multi-label learning via two-stage multi-label hypernetwork , 2017, Neurocomputing.

[38]  Shunxiang Wu,et al.  Multi-label learning based on label-specific features and local pairwise label correlation , 2018, Neurocomputing.

[39]  Grigorios Tsoumakas,et al.  Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[40]  Hua Li,et al.  A novel attribute reduction approach for multi-label data based on rough set theory , 2016, Inf. Sci..

[41]  Heung Wong,et al.  On two novel types of three-way decisions in three-way decision spaces , 2017, Int. J. Approx. Reason..

[42]  Lior Rokach,et al.  Ensemble methods for multi-label classification , 2013, Expert Syst. Appl..

[43]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[44]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[45]  Xin Geng,et al.  Label Distribution Learning , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[46]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[47]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[48]  Qingming Huang,et al.  Multi-label classification by exploiting local positive and negative pairwise label correlation , 2017, Neurocomputing.

[49]  Jianxin Wu,et al.  Deep Label Distribution Learning With Label Ambiguity , 2016, IEEE Transactions on Image Processing.

[50]  Saso Dzeroski,et al.  Two stage architecture for multi-label learning , 2012, Pattern Recognit..

[51]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[52]  Michael K. Ng,et al.  ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification , 2016, IEEE Transactions on Knowledge and Data Engineering.

[53]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[54]  Dae-Won Kim,et al.  Memetic feature selection algorithm for multi-label classification , 2015, Inf. Sci..

[55]  Duoqian Miao,et al.  Three-way attribute reducts , 2017, Int. J. Approx. Reason..

[56]  Yuan-Hai Shao,et al.  MLTSVM: A novel twin support vector machine to multi-label learning , 2016, Pattern Recognit..

[57]  Chen Lin,et al.  LibD3C: Ensemble classifiers with a clustering and dynamic selection strategy , 2014, Neurocomputing.

[58]  Lu Sun,et al.  Fast random k-labELsets for large-scale multi-label classification , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[59]  Yiyu Yao,et al.  Three-Way Decision: An Interpretation of Rules in Rough Set Theory , 2009, RSKT.

[60]  Witold Pedrycz,et al.  Granular multi-label feature selection based on mutual information , 2017, Pattern Recognit..

[61]  Eneldo Loza Mencía,et al.  Learning rules for multi-label classification: a stacking and a separate-and-conquer approach , 2016, Machine Learning.

[62]  Huangjian Yi,et al.  Generalized three-way decision models based on subset evaluation , 2017, Int. J. Approx. Reason..

[63]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..