The Gaussian process prior formulation introduced by us in this paper learns a mapping for ordinal regression task using dual sets of latent functions. In this formulation one set of latent functions are associated with data items and the other set of latent functions are associated with entities. An entity is a term introduced by us in this work to refer to the object responsible for assigning ordinal labels to data items. For example in the collaborative filtering problem an entity corresponds to a user. In our work we assume that the entities cluster, and we use latent functions to having a Gaussian process prior to model these clusters. Similarly we also assume that the data items cluster and use latent functions having a Gaussian process prior to model these clusters. We learn the parameters of these Gaussian processes in a discriminative learning framework by minimizing a loss function using an alternating minimization procedure. The purpose of introducing dual sets of latent functions is to overcome the deficiency in the predictive nature of discriminative models for ordinal regression tasks while learning from less training data unlike generative models which have a good performance even while learning from less training data. Thus we evaluate the performance of our model on two problems, collaborative filtering and image annotation, by comparing with well known baseline methods using a generative model approach so as to understand the efficacy of our model on less training data.
[1]
Robert L. Mercer,et al.
The Mathematics of Statistical Machine Translation: Parameter Estimation
,
1993,
CL.
[2]
P. McCullagh.
Regression Models for Ordinal Data
,
1980
.
[3]
Hans-Peter Kriegel,et al.
Collaborative ordinal regression
,
2006,
ICML.
[4]
Thomas Hofmann,et al.
Latent Class Models for Collaborative Filtering
,
1999,
IJCAI.
[5]
David A. Forsyth,et al.
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary
,
2002,
ECCV.
[6]
Wei Chu,et al.
Gaussian Processes for Ordinal Regression
,
2005,
J. Mach. Learn. Res..
[7]
David A. Forsyth,et al.
Matching Words and Pictures
,
2003,
J. Mach. Learn. Res..
[8]
Michael I. Jordan,et al.
On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes
,
2001,
NIPS.
[9]
Yoram Singer,et al.
Learning to Order Things
,
1997,
NIPS.
[10]
J. Cullum,et al.
Lanczos Algorithms for Large Symmetric Eigenvalue Computations Vol. I Theory
,
1984
.
[11]
Thomas Hofmann,et al.
Stochastic Relational Models for Discriminative Link Prediction
,
2007
.
[12]
Dimitri P. Bertsekas,et al.
Nonlinear Programming
,
1997
.