This paper studies adaptive bilateral negotiation between software agents in e-commerce environments. Specifically, we assume that the agents are self-interested, the environment is dynamic, and both agents have deadlines. Such dynamism means that the agents' negotiation parameters (such as deadlines and reservation prices) are functions of both the state of the encounter and the environment. Given this, we develop an algorithm that the negotiating agents can use to adapt their strategies to changes in the environment in order to reach an agreement within their specific deadlines and before the resources available for negotiation are exhausted. In more detail, we formally define an adaptive negotiation model and cast it as a Markov decision process. Using a value iteration algorithm, we then indicate a novel solution technique for determining optimal policies for the negotiation problem without explicit knowledge of the dynamics of the system. We also solve a representative negotiation decision problem using this technique and show that it is a promising approach for analyzing negotiations in dynamic settings. Finally, through empirical evaluation, we show that the agents using our algorithm learn a negotiation strategy that adapts to the environment and enables them to reach agreements in a timely manner.
[1]
Nicholas R. Jennings,et al.
On Agent-Mediated Electronic Commerce
,
2003,
IEEE Trans. Knowl. Data Eng..
[2]
Michael P. Wellman,et al.
Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm
,
1998,
ICML.
[3]
Richard S. Sutton,et al.
Reinforcement Learning: An Introduction
,
1998,
IEEE Trans. Neural Networks.
[4]
Michael L. Littman,et al.
Markov Games as a Framework for Multi-Agent Reinforcement Learning
,
1994,
ICML.
[5]
A. Rubinstein.
Perfect Equilibrium in a Bargaining Model
,
1982
.
[6]
Nicholas R. Jennings,et al.
Negotiation decision functions for autonomous agents
,
1998,
Robotics Auton. Syst..
[7]
Ronald A. Howard,et al.
Dynamic Programming and Markov Processes
,
1960
.
[8]
N. R. Jennings,et al.
To appear in: Int Journal of Group Decision and Negotiation GDN2000 Keynote Paper Automated Negotiation: Prospects, Methods and Challenges
,
2022
.
[9]
Frédérick Garcia,et al.
A Learning Rate Analysis of Reinforcement Learning Algorithms in Finite-Horizon
,
1998,
ICML.