Large Margin Partial Label Machine

Partial label learning (PLL) is a multi-class weakly supervised learning problem where each training instance is associated with a set of candidate labels but only one label is the ground truth. The main challenge of PLL is how to deal with the label ambiguities. Among various disambiguation techniques, large margin (LM)-based algorithms attract much attention due to their powerful discriminative performance. However, existing LM-based algorithms either neglect some potential candidate labels in constructing the margin or introduce auxiliary estimation of class capacities which is generally inaccurate. As a result, their generalization performances are deteriorated. To address the above-mentioned drawbacks, motivated by the optimistic superset loss, we propose an LM Partial LAbel machiNE (LM-PLANE) by extending multi-class support vector machines (SVM) to PLL. Compared with existing LM-based disambiguation algorithms, LM-PLANE considers the margin of all potential candidate labels without auxiliary estimation of class capacities. Furthermore, an efficient cutting plane (CP) method is developed to train LM-PLANE in the dual space. Theoretical insights into the effectiveness and convergence of our CP method are also presented. Extensive experiments on various PLL tasks demonstrate the superiority of LM-PLANE over existing LM based and other representative PLL algorithms in terms of classification accuracy.

[1]  Dong Xu,et al.  Learning by Associating Ambiguously Labeled Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Eyke Hüllermeier,et al.  Superset Learning Based on Generalized Loss Minimization , 2015, ECML/PKDD.

[3]  Yunyan Duan,et al.  Learning With Auxiliary Less-Noisy Labels , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Terrence J. Sejnowski,et al.  Unsupervised Learning , 2018, Encyclopedia of GIS.

[5]  Ivan Laptev,et al.  Is object localization for free? - Weakly-supervised learning with convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Hong Gu,et al.  Partial Label Learning via Gaussian Processes , 2017, IEEE Transactions on Cybernetics.

[7]  Trevor Hastie,et al.  Overview of Supervised Learning , 2001 .

[8]  Ji Gao,et al.  Fast training Support Vector Machines using parallel sequential minimal optimization , 2008, 2008 3rd International Conference on Intelligent System and Knowledge Engineering.

[9]  Rich Caruana,et al.  Classification with partial labels , 2008, KDD.

[10]  Xiaobing Pei,et al.  Joint Sparse Representation and Embedding Propagation Learning: A Framework for Graph-Based Semisupervised Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Andrew Zisserman,et al.  “Who are you?” - Learning person specific classifiers from video , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Thomas G. Dietterich,et al.  A Conditional Multinomial Mixture Model for Superset Label Learning , 2012, NIPS.

[13]  Fei Yu,et al.  Maximum margin partial label learning , 2017, Machine Learning.

[14]  Ivor W. Tsang,et al.  Convex and scalable weakly labeled SVMs , 2013, J. Mach. Learn. Res..

[15]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[16]  Jian Yang,et al.  A Regularization Approach for Instance-Based Superset Label Learning , 2018, IEEE Transactions on Cybernetics.

[17]  Eyke Hüllermeier,et al.  Learning from ambiguously labeled examples , 2005, Intell. Data Anal..

[18]  Cordelia Schmid,et al.  Multiple Instance Metric Learning from Automatically Labeled Bags of Faces , 2010, ECCV.

[19]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[20]  Fei Yu,et al.  Solving the Partial Label Learning Problem: An Instance-Based Approach , 2015, IJCAI.

[21]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[22]  Xiaoli Z. Fern,et al.  Rank-loss support instance machines for MIML instance annotation , 2012, KDD.

[23]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[24]  Zhi-Hua Zhou,et al.  Multi-Instance Learning Based Web Mining , 2005, Applied Intelligence.

[25]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[26]  Min-Ling Zhang,et al.  Confidence-Rated Discriminative Partial Label Learning , 2017, AAAI.

[27]  Yee Whye Teh,et al.  Names and faces in the news , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[28]  Francesco Orabona,et al.  Learning from Candidate Labeling Sets , 2010, NIPS.

[29]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[30]  Zhi-Hua Zhou Multi-Instance Learning : A Survey , 2004 .

[31]  Daniel P. Huttenlocher,et al.  Weakly Supervised Learning of Part-Based Spatial Models for Visual Object Recognition , 2006, ECCV.

[32]  Dacheng Tao,et al.  Large-margin Weakly Supervised Dimensionality Reduction , 2014, ICML.

[33]  Rong Jin,et al.  Learning with Multiple Labels , 2002, NIPS.

[34]  Xu-Ying Liu,et al.  Partial Label Learning via Feature-Aware Disambiguation , 2016, KDD.

[35]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[36]  Murat Dundar,et al.  Multiple Instance Learning for Computer Aided Diagnosis , 2006, NIPS.

[37]  Yanfang Ye,et al.  Cluster Validation Method for Determining the Number of Clusters in Categorical Sequences , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[38]  Ben Taskar,et al.  Learning from Partial Labels , 2011, J. Mach. Learn. Res..

[39]  Zhi-Hua Zhou,et al.  Cost-Sensitive Semi-Supervised Support Vector Machine , 2010, AAAI.

[40]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[41]  Eyke Hüllermeier,et al.  Learning from imprecise and fuzzy observations: Data disambiguation through generalized loss minimization , 2013, Int. J. Approx. Reason..

[42]  Koby Crammer,et al.  On the Learnability and Design of Output Codes for Multiclass Problems , 2002, Machine Learning.

[43]  Honglak Lee,et al.  Learning to Align from Scratch , 2012, NIPS.

[44]  Cordelia Schmid,et al.  Weakly Supervised Learning of Interactions between Humans and Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Min-Ling Zhang,et al.  Disambiguation-Free Partial Label Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[46]  Rama Chellappa,et al.  Learning from Ambiguously Labeled Face Images , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Jaume Amores,et al.  Multiple instance classification: Review, taxonomy and comparative study , 2013, Artif. Intell..

[48]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.