Dynamic Pricing and Demand Learning with Limited Price Experimentation

In a dynamic pricing problem where the demand function is not known a priori, price experimentation can be used as a demand learning tool. Existing literature usually assumes no constraint on price changes, but in practice, sellers often face business constraints that prevent them from conducting extensive experimentation. We consider a dynamic pricing model where the demand function is unknown but belongs to a known finite set. The seller is allowed to make at most m price changes during T periods. The objective is to minimize the worst-case regret-i.e., the expected total revenue loss compared with a clairvoyant who knows the demand distribution in advance. We demonstrate a pricing policy that incurs a regret of OlogmT, or m iterations of the logarithm. Furthermore, we describe an implementation of this pricing policy at Groupon, a large e-commerce marketplace for daily deals. The field study shows significant impact on revenue and bookings. The e-companion is available at https://doi.org/10.1287/opre.2017.1629 .

[1]  John N. Tsitsiklis,et al.  Linearly Parameterized Bandits , 2008, Math. Oper. Res..

[2]  Josef Broder,et al.  Dynamic Pricing Under a General Parametric Choice Model , 2012, Oper. Res..

[3]  Felipe Caro,et al.  Clearance Pricing Optimization for a Fast-Fashion Retailer , 2010, Oper. Res..

[4]  Özalp Özer,et al.  Information Acquisition for Capacity Planning via Pricing and Advance Selling: When to Stop and Act? , 2009, Oper. Res..

[5]  J.N. Tsitsiklis,et al.  A structured multiarmed bandit problem and the greedy policy , 2008, 2008 47th IEEE Conference on Decision and Control.

[6]  Gustavo J. Vulcano,et al.  Dynamic List Pricing , 2012 .

[7]  Susana V. Mondschein,et al.  Periodic Pricing of Seasonal Products in Retailing , 1997 .

[8]  Omar Besbes,et al.  On the (Surprising) Sufficiency of Linear Models for Dynamic Pricing with Demand Learning , 2014, Manag. Sci..

[9]  Serguei Netessine,et al.  Dynamic pricing of inventory/capacity with infrequent price changes , 2006, Eur. J. Oper. Res..

[10]  Zizhuo Wang,et al.  Close the Gaps: A Learning-While-Doing Algorithm for Single-Product Revenue Management Problems , 2014, Oper. Res..

[11]  J. Broder Online Algorithms For Revenue Management , 2011 .

[12]  G. Gallego,et al.  Optimal starting times for end-of-season sales and optimal stopping times for promotional fares , 1995 .

[13]  A. V. den Boer,et al.  Dynamic Pricing and Learning: Historical Origins, Current Research, and New Directions , 2013 .

[14]  Omar Besbes,et al.  Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms , 2009, Oper. Res..

[15]  Frank Thomson Leighton,et al.  The value of knowing a demand curve: bounds on regret for online posted-price auctions , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[16]  Assaf J. Zeevi,et al.  Dynamic Pricing with an Unknown Demand Model: Asymptotically Optimal Semi-Myopic Policies , 2014, Oper. Res..

[17]  M. Rothschild A two-armed bandit theory of market pricing , 1974 .

[18]  Izak Duenyas,et al.  Real-Time Dynamic Pricing with Minimal and Flexible Price Adjustment , 2016, Manag. Sci..

[19]  Bert Zwart,et al.  Simultaneously Learning and Optimizing Using Controlled Variance Pricing , 2014, Manag. Sci..

[20]  Arnoud V. den Boer,et al.  Dynamic Pricing with Multiple Products and Partially Specified Demand Distribution , 2014, Math. Oper. Res..

[21]  J. Michael Harrison,et al.  Bayesian Dynamic Pricing Policies: Learning and Earning Under a Binary Prior Distribution , 2011, Manag. Sci..

[22]  Mark J. Zbaracki,et al.  Managerial and Customer Costs of Price Adjustment: Direct Evidence from Industrial Markets , 2004 .