Ordinal Label Proportions

In Machine Learning, it is common to distinguish different degrees of supervision, ranging from fully supervised to completely unsupervised scenarios. However, lying in between those, the Learning from Label Proportions (LLP) setting [19] assumes the training data is provided in the form of bags, and the only supervision comes through the proportion of each class in each bag. In this paper, we present a novel version of the LLP paradigm where the relationship among the classes is ordinal. While this is a highly relevant scenario (e.g. customer surveys where the results can be divided into various degrees of satisfaction), it is as yet unexplored in the literature. We refer to this setting as Ordinal Label Proportions (OLP). We formally define the scenario and introduce an efficient algorithm to tackle it. We test our algorithm on synthetic and benchmark datasets. Additionally, we present a case study examining a dataset gathered from the Research Excellence Framework that assesses the quality of research in the United Kingdom.

[1]  Stefan R ping SVM Classifier Estimation from Group Probabilities , 2010, ICML 2010.

[2]  Alexander J. Smola,et al.  Estimating Labels from Label Proportions , 2009, J. Mach. Learn. Res..

[3]  Raúl Santos-Rodríguez,et al.  Cost-sensitive learning based on Bregman divergences , 2009, Machine Learning.

[4]  Ivor W. Tsang,et al.  Tighter and Convex Maximum Margin Clustering , 2009, AISTATS.

[5]  Pedro Antonio Gutiérrez,et al.  Ordinal Regression Methods: Survey and Experimental Study , 2016, IEEE Transactions on Knowledge and Data Engineering.

[6]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[7]  Dong Liu,et al.  $\propto$SVM for learning with label proportions , 2013, ICML 2013.

[8]  Eibe Frank,et al.  A Simple Approach to Ordinal Classification , 2001, ECML.

[9]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[10]  Dale Schuurmans,et al.  Maximum Margin Clustering , 2004, NIPS.

[11]  Wei Chu,et al.  Support Vector Ordinal Regression , 2007, Neural Computation.

[12]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[13]  Eyke Hüllermeier,et al.  Is an ordinal class structure useful in classifier learning? , 2008, Int. J. Data Min. Model. Manag..

[14]  Shih-Fu Chang,et al.  On Learning with Label Proportions , 2014, ArXiv.

[15]  Xiaoming Zhang,et al.  Kernel Discriminant Learning for Ordinal Regression , 2010, IEEE Transactions on Knowledge and Data Engineering.

[16]  Klaus Obermayer,et al.  Support vector learning for ordinal regression , 1999 .

[17]  Oded Maron,et al.  Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[18]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[19]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[20]  Nando de Freitas,et al.  Learning about Individuals from Group Statistics , 2005, UAI.