Real Time Demand Learning-Based Q-learning Approach for Dynamic Pricing in E-retailing Setting

Information technology has given e-retailers new capability of learning demand in real time. This paper investigates how to integrate this real time learning technology with Q-learning algorithm for the optimization of dynamic pricing in e-retailing setting. Especially, this paper studies the optimal dynamic pricing problem for seasonal and style products in e-retailing setting, and validate our approach in simulated test.