Democratic approximation of lexicographic preference models

Lexicographic preference models (LPMs) are an intuitive representation that corresponds to many real-world preferences exhibited by human decision makers. Previous algorithms for learning LPMs produce a ''best guess'' LPM that is consistent with the observations. Our approach is more democratic: we do not commit to a single LPM. Instead, we approximate the target using the votes of a collection of consistent LPMs. We present two variations of this method-variable voting and model voting-and empirically show that these democratic algorithms outperform the existing methods. Versions of these democratic algorithms are presented in both the case where the preferred values of attributes are known and the case where they are unknown. We also introduce an intuitive yet powerful form of background knowledge to prune some of the possible LPMs. We demonstrate how this background knowledge can be incorporated into variable and model voting and show that doing so improves performance significantly, especially when the number of observations is small.

[1]  Jonathan A. Stirk,et al.  Singleton bias and lexicographic preferences among equally valued alternatives , 1999 .

[2]  Ronen I. Brafman,et al.  CP-nets: A Tool for Representing and Reasoning withConditional Ceteris Paribus Preference Statements , 2011, J. Artif. Intell. Res..

[3]  Mirjam R. M. Westenberg,et al.  Multi-attribute evaluation process: Methodological and conceptual issues , 1994 .

[4]  Yoram Singer,et al.  Learning to Order Things , 1997, NIPS.

[5]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[6]  A. Tversky Intransitivity of preferences. , 1969 .

[7]  József Dombi,et al.  Learning lexicographic orders , 2007, Eur. J. Oper. Res..

[8]  R. Dawes Judgment under uncertainty: The robust beauty of improper linear models in decision making , 1979 .

[9]  Michael Schmitt,et al.  On the Complexity of Learning Lexicographic Strategies , 2006, J. Mach. Learn. Res..

[10]  R. Rivest Learning Decision Lists , 1987, Machine Learning.

[11]  G Gigerenzer,et al.  Reasoning the fast and frugal way: models of bounded rationality. , 1996, Psychological review.

[12]  J. Ford,et al.  Process tracing methods: Contributions, problems, and neglected research questions , 1989 .

[13]  G. Dantzig,et al.  Notes on Linear Programming: Part 1. The Generalized Simplex Method for Minimizing a Linear Form under Linear Inequality Restraints , 1954 .

[14]  Marie desJardins,et al.  More-or-Less CP-Networks , 2007, UAI.

[15]  A. Quesada Negative results in the theory of games with lexicographic utilities , 2003 .

[16]  Antonio Quesada On the existence of self-enforcing equilibria , 2001 .

[17]  Eyke Hüllermeier,et al.  Pairwise Preference Learning and Ranking , 2003, ECML.

[18]  Peter C. Fishburn,et al.  LEXICOGRAPHIC ORDERS, UTILITIES AND DECISION RULES: A SURVEY , 1974 .

[19]  Jude W. Shavlik,et al.  Relational Macros for Transfer in Reinforcement Learning , 2007, ILP.