Preference Learning in Automated Negotiation Using Gaussian Uncertainty Models

In this paper, we propose a general two-objective Markov Decision Process (MDP) modeling paradigm for automated negotiation with incomplete information, in which preference elicitation alternates with negotiation actions, with the objective to optimize negotiation outcomes. The key ingredient in our MDP framework is a stochastic utility model governed by a Gaussian law, formalizing the agent's belief (uncertainty) over the user's preferences. Our belief model is fairly general and can be updated in real time as new data becomes available, which makes it a fundamental modeling tool.

[1]  Sarit Kraus,et al.  Multiagent Negotiation under Time Constraints , 1995, Artif. Intell..

[2]  David C. Kingsley,et al.  Preference Uncertainty, Preference Learning, and Paired Comparison Experiments , 2010, Land Economics.

[3]  Daphne Koller,et al.  Making Rational Decisions Using Adaptive Utility Elicitation , 2000, AAAI/IAAI.

[4]  Craig Boutilier,et al.  A POMDP formulation of preference elicitation problems , 2002, AAAI/IAAI.

[5]  Wei Chu,et al.  Preference learning with Gaussian processes , 2005, ICML.

[6]  E. Gerding,et al.  High frequency trading strategies, market fragility and price spikes: an agent basedmodel perspective , 2018 .

[7]  Yakov Ben-Haim,et al.  Decision Making in Times of Knightian Uncertainty: An Info-Gap Perspective , 2016 .

[8]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[9]  Kwang Mong Sim,et al.  Agent-Based Cloud Computing , 2012, IEEE Transactions on Services Computing.

[10]  Wolfgang Ketter,et al.  Power TAC: A competitive economic simulation of the smart grid , 2013 .

[11]  Roie Zivan,et al.  POMDP based Negotiation Modeling , 2009 .

[12]  Sarvapali D. Ramchurn,et al.  Minimising intrusiveness in pervasive computing environments using multi-agent negotiation , 2004, The First Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services, 2004. MOBIQUITOUS 2004..

[13]  Kuldeep Kumar,et al.  Agent-based negotiation and decision making for dynamic supply chain formation , 2009, Eng. Appl. Artif. Intell..

[14]  Yasser F. O. Mohammad,et al.  FastVOI: Efficient Utility Elicitation During Negotiations , 2018, PRIMA.

[15]  Ash Booth,et al.  High frequency trading strategies, market fragility and price spikes: an agent based model perspective , 2018, Annals of Operations Research.

[16]  Enrico Gerding,et al.  Optimal Incremental Preference Elicitation during Negotiation , 2015, IJCAI.

[17]  Wei Chu,et al.  Gaussian Processes for Ordinal Regression , 2005, J. Mach. Learn. Res..