Partial label learning based on label distributions and error-correcting output codes

Partial label learning (PLL) is a class of weak supervision learning problems in which each data sample has a candidate set of labels, among which only one label is correct. In this paper, a new PLL algorithm with prior information of the label distribution based on ECOC (PL-PIE) is proposed. PL-PIE utilizes the ECOC framework to decompose the problem into multiple binary problems. Different from the instability of the existing random dichotomy, the proposal exploits the prior information of label distribution to generate positive and negative classes with stable performance. Extensive experimental results demonstrate that the proposed PL-PIE algorithm has highly competitive performance compared to the state-of-the-art PLL algorithms.

[1]  Dong Xu,et al.  Learning by Associating Ambiguously Labeled Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Wu Deng,et al.  An Improved Ant Colony Optimization Algorithm Based on Hybrid Strategies for Scheduling Problem , 2019, IEEE Access.

[3]  Thomas G. Dietterich,et al.  A Conditional Multinomial Mixture Model for Superset Label Learning , 2012, NIPS.

[4]  Francesco Orabona,et al.  Learning from Candidate Labeling Sets , 2010, NIPS.

[5]  D. Sharmila,et al.  Performance analysis of soft computing techniques for the automatic classification of fruits dataset , 2019, Soft Comput..

[6]  Cordelia Schmid,et al.  Multiple Instance Metric Learning from Automatically Labeled Bags of Faces , 2010, ECCV.

[7]  Bo Li,et al.  Study on an improved adaptive PSO algorithm for solving multi-objective gate assignment , 2017, Applied Soft Computing.

[8]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[9]  Yu Xue,et al.  A self-adaptive artificial bee colony algorithm based on global best for global optimization , 2017, Soft Computing.

[10]  Ben Taskar,et al.  Learning from Partial Labels , 2011, J. Mach. Learn. Res..

[11]  Yu Xue,et al.  Weighted linear loss multiple birth support vector machine based on information granulation for multi-class classification , 2017, Pattern Recognit..

[12]  Sergio Escalera,et al.  On the Decoding Process in Ternary Error-Correcting Output Codes , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Hong Gu,et al.  Geometric mean metric learning for partial label data , 2018, Neurocomputing.

[14]  Hong Gu,et al.  Partial Label Learning via Gaussian Processes , 2017, IEEE Transactions on Cybernetics.

[15]  Rich Caruana,et al.  Classification with partial labels , 2008, KDD.

[16]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[17]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[18]  Xu-Ying Liu,et al.  Partial Label Learning via Feature-Aware Disambiguation , 2016, KDD.

[19]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[20]  Thierry Denoeux,et al.  Partially supervised Independent Factor Analysis using soft labels elicited from multiple experts: application to railway track circuit diagnosis , 2012, Soft Comput..

[21]  Jian Yang,et al.  A Regularization Approach for Instance-Based Superset Label Learning , 2018, IEEE Transactions on Cybernetics.

[22]  Sergio Escalera,et al.  An incremental node embedding technique for error correcting output codes , 2008, Pattern Recognit..

[23]  Zhongnan Zhang,et al.  A Unified Framework for Decision Tree on Continuous Attributes , 2019, IEEE Access.

[24]  Xiao-Na Ye,et al.  A Novel Genetic Algorithm Based ECOC Algorithm , 2018, 2018 14th International Conference on Semantics, Knowledge and Grids (SKG).

[25]  Ning Xu,et al.  Label Enhancement for Label Distribution Learning , 2018, IEEE Transactions on Knowledge and Data Engineering.

[26]  Guangchun Luo,et al.  Person Re-identification through Clustering and Partial Label Smoothing Regularization , 2019, Proceedings of the 2nd International Conference on Software Engineering and Information Management.

[27]  Nancy Chinchor,et al.  MUC-4 evaluation metrics , 1992, MUC.

[28]  Xiaoli Z. Fern,et al.  Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. , 2012, The Journal of the Acoustical Society of America.

[29]  Ben Taskar,et al.  Learning from ambiguously labeled images , 2009, CVPR.

[30]  Fei Yu,et al.  Disambiguation-Free Partial Label Learning , 2017, IEEE Trans. Knowl. Data Eng..

[31]  Eyke Hüllermeier,et al.  Learning from ambiguously labeled examples , 2005, Intell. Data Anal..

[32]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[33]  C. L. Philip Chen,et al.  Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Klaus-Robert Müller,et al.  N-ary decomposition for multi-class classification , 2019, Machine Learning.

[35]  Wu Deng,et al.  A novel collaborative optimization algorithm in solving complex optimization problems , 2016, Soft Computing.

[36]  Jordi Vitrià,et al.  Discriminant ECOC: a heuristic method for application dependent design of error correcting output codes , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Sergio Escalera,et al.  ECOC-ONE: A Novel Coding and Decoding Strategy , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[38]  David W. Aha,et al.  Special Issue on Lazy Learning , 1997 .

[39]  Sergio Escalera,et al.  On the design of an ECOC-Compliant Genetic Algorithm , 2014, Pattern Recognit..

[40]  Samuel J. Gershman,et al.  A Tutorial on Bayesian Nonparametric Models , 2011, 1106.2697.

[41]  Deyu Zhou,et al.  Weakly Supervised POS Tagging without Disambiguation , 2018, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[42]  Zhongzhi Shi,et al.  Unsupervised extreme learning machine with representational features , 2015, International Journal of Machine Learning and Cybernetics.

[43]  Beizhan Wang,et al.  A novel ECOC algorithm for multiclass microarray data classification based on data complexity analysis , 2019, Pattern Recognit..

[44]  Shifei Ding,et al.  Multi layer ELM-RBF for multi-label learning , 2016, Appl. Soft Comput..

[45]  Yang Lou,et al.  Selecting evolutionary algorithms for black box design optimization problems , 2018, Soft Comput..

[46]  Wu Deng,et al.  Semi-Supervised Broad Learning System Based on Manifold Regularization and Broad Network , 2020, IEEE Transactions on Circuits and Systems I: Regular Papers.

[47]  Fei Yu,et al.  Maximum margin partial label learning , 2017, Machine Learning.

[48]  Shenglin Zhang,et al.  Device-Agnostic Log Anomaly Classification with Partial Labels , 2018, 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS).

[49]  Ligang Zhou,et al.  One versus one multi-class classification fusion using optimizing decision directed acyclic graph for predicting listing status of companies , 2017, Inf. Fusion.

[50]  Koby Crammer,et al.  On the Learnability and Design of Output Codes for Multiclass Problems , 2002, Machine Learning.