Real-time Multiattribute Bayesian Preference Elicitation with Pairwise Comparison Queries

Preference elicitation (PE) is an very important component of interactive decision support systems that aim to make optimal recommendations to users by actively querying their preferences In this paper, we present three principles important for PE in real-world problems: (1) multiattribute, (2) low cognitive load, and (3) robust to noise In light of three requirements, we introduce an approximate PE framework based on a variant of TrueSkill for performing efficient closed-form Bayesian updates and query selection for a multiattribute utility belief state — a novel PE approach that naturally facilitates the efficient evaluation of value of information (VOI) for use in query selection strategies Our VOI query strategy satisfies all three principles and performs on par with the most accurate algorithms on experiments with a synthetic data set.

[1]  Thomas Hofmann,et al.  TrueSkill™: A Bayesian Skill Rating System , 2007 .

[2]  Daphne Koller,et al.  Utilities as Random Variables: Density Estimation and Structure Discovery , 2000, UAI.

[3]  R. A. Bradley,et al.  RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS THE METHOD OF PAIRED COMPARISONS , 1952 .

[4]  Ralph L. Keeney,et al.  Decisions with multiple objectives: preferences and value tradeoffs , 1976 .

[5]  Vincent Conitzer Eliciting single-peaked preferences using comparison queries , 2007, AAMAS '07.

[6]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Craig Boutilier,et al.  Context-Specific Independence in Bayesian Networks , 1996, UAI.

[8]  Daphne Koller,et al.  Making Rational Decisions Using Adaptive Utility Elicitation , 2000, AAAI/IAAI.

[9]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[10]  Nicholas Roy,et al.  The permutable POMDP: fast solutions to POMDPs for preference elicitation , 2008, AAMAS.

[11]  Craig Boutilier,et al.  Regret-based optimal recommendation sets in conversational recommender systems , 2009, RecSys '09.

[12]  Craig Boutilier,et al.  A POMDP formulation of preference elicitation problems , 2002, AAAI/IAAI.

[13]  Daphne Koller,et al.  Learning an Agent's Utility Function by Observing Behavior , 2001, ICML.

[14]  Tom Minka,et al.  TrueSkillTM: A Bayesian Skill Rating System , 2006, NIPS.

[15]  X. Jin Factor graphs and the Sum-Product Algorithm , 2002 .