论文信息 - Dynamic pricing policies for interdependent perishable products or services using reinforcement learning

Dynamic pricing policies for interdependent perishable products or services using reinforcement learning

Dynamic prices maximize the expected revenue of interdependent products.Reinforcement learning optimizes the pricing of interdependent products.Interdependent pricing enhances learning. Many businesses offer multiple products or services that are interdependent, in which the demand for one is often affected by the prices of others. This article considers a revenue management problem of multiple interdependent products, in which dynamically adjusted over a finite sales horizon to maximize expected revenue, given an initial inventory for each product. The main contribution of this article is to use reinforcement learning to model the optimal pricing of perishable interdependent products when demand is stochastic and its functional form unknown. We show that reinforcement learning can be used to price interdependent products. Moreover, we analyze the performance of the Q-learning with eligibility traces algorithm under different conditions. We illustrate our analysis with the pricing of services.

Rupal Rana | Fernando S. Oliveira | F. Oliveira | R. Rana

[1] Dan Zhang,et al. Revenue Management for Parallel Flights with Customer-Choice Behavior , 2005, Oper. Res..

[2] Anton J. Kleywegt,et al. Models of the Spiral-Down Effect in Revenue Management , 2006, Oper. Res..

[3] Garrett J. van Ryzin,et al. A Multiproduct Dynamic Pricing Problem and Its Applications to Network Yield Management , 1997, Oper. Res..

[4] Susan H. Xu,et al. Joint Dynamic Pricing of Multiple Perishable Products Under Consumer Choice , 2010, Manag. Sci..

[5] Constantinos Maglaras,et al. Dynamic Pricing Strategies for Multi-Product Revenue Management Problems , 2009, Manuf. Serv. Oper. Manag..

[6] So Young Sohn,et al. Optimal pricing for mobile manufacturers in competitive market using genetic algorithm , 2009, Expert Syst. Appl..

[7] Rupal Rana,et al. Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning , 2014 .

[8] Yan Cheng. Dynamic Pricing for Multi-Products in E-Retailing , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[9] Yan Cheng,et al. Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach , 2009, Expert Syst. Appl..

[10] Chi-Bin Cheng,et al. Pricing and promotion strategies of an online shop based on customer segmentation and multiple objective decision making , 2011, Expert Syst. Appl..

[11] Shingo Mabu,et al. Adaptability analysis of genetic network programming with reinforcement learning in dynamically changing environments , 2012, Expert Syst. Appl..

[12] Sean B. Eom. A Survey of Operational Expert Systems in Business (1980–1993) , 1996 .

[13] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[14] Zhaohan Sheng,et al. Case-based reinforcement learning for dynamic inventory control in a multi-agent supply-chain system , 2009, Expert Syst. Appl..

[15] Habin Lee,et al. Agent based mobile negotiation for personalized pricing of last minute theatre tickets , 2012, Expert Syst. Appl..

[16] K. Talluri,et al. The Theory and Practice of Revenue Management , 2004 .

[17] Fernando S. Oliveira,et al. A Constraint Logic Programming Algorithm for Modeling Dynamic Pricing , 2008, INFORMS J. Comput..

[18] Bekir Karlik,et al. An artificial neural networks approach on automobile pricing , 2009, Expert Syst. Appl..

[19] Oscar Fontenla-Romero,et al. A comparative study of the scalability of a sensitivity-based learning algorithm for artificial neural networks , 2013, Expert Syst. Appl..

[20] Pinar Keskinocak,et al. Dynamic pricing in the presence of inventory considerations: research overview, current practices, and future directions , 2003, IEEE Engineering Management Review.