Democratic approximation of lexicographic preference models

Previous algorithms for learning lexicographic preference models (LPMs) produce a "best guess" LPM that is consistent with the observations. Our approach is more democratic: we do not commit to a single LPM. Instead, we approximate the target using the votes of a collection of consistent LPMs. We present two variations of this method---variable voting and model voting---and empirically show that these democratic algorithms outperform the existing methods. We also introduce an intuitive yet powerful learning bias to prune some of the possible LPMs. We demonstrate how this learning bias can be used with variable and model voting and show that the learning bias improves the learning curve significantly, especially when the number of observations is small.

[1]  G. Dantzig,et al.  Notes on Linear Programming: Part 1. The Generalized Simplex Method for Minimizing a Linear Form under Linear Inequality Restraints , 1954 .

[2]  A. Tversky Intransitivity of preferences. , 1969 .

[3]  Peter C. Fishburn,et al.  LEXICOGRAPHIC ORDERS, UTILITIES AND DECISION RULES: A SURVEY , 1974 .

[4]  R. Dawes Judgment under uncertainty: The robust beauty of improper linear models in decision making , 1979 .

[5]  J. Ford,et al.  Process tracing methods: Contributions, problems, and neglected research questions , 1989 .

[6]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[7]  Mirjam R. M. Westenberg,et al.  Multi-attribute evaluation process: Methodological and conceptual issues , 1994 .

[8]  G Gigerenzer,et al.  Reasoning the fast and frugal way: models of bounded rationality. , 1996, Psychological review.

[9]  Yoram Singer,et al.  Learning to Order Things , 1997, NIPS.

[10]  Jonathan A. Stirk,et al.  Singleton bias and lexicographic preferences among equally valued alternatives , 1999 .

[11]  A. Quesada Negative results in the theory of games with lexicographic utilities , 2003 .

[12]  Eyke Hüllermeier,et al.  Pairwise Preference Learning and Ranking , 2003, ECML.

[13]  Ronen I. Brafman,et al.  CP-nets: A Tool for Representing and Reasoning withConditional Ceteris Paribus Preference Statements , 2011, J. Artif. Intell. Res..

[14]  R. Rivest Learning Decision Lists , 1987, Machine Learning.

[15]  Michael Schmitt,et al.  On the Complexity of Learning Lexicographic Strategies , 2006, J. Mach. Learn. Res..

[16]  József Dombi,et al.  Learning lexicographic orders , 2007, Eur. J. Oper. Res..

[17]  Jude W. Shavlik,et al.  Relational Macros for Transfer in Reinforcement Learning , 2007, ILP.

[18]  Marie desJardins,et al.  More-or-Less CP-Networks , 2007, UAI.