Chasing Demand: Learning and Earning in a Changing Environment

We consider a dynamic pricing problem in which a seller faces an unknown demand model that can change over time. The amount of change over a time horizon of T periods is measured using a variation metric that allows for a broad spectrum of temporal behavior. Given a finite variation “budget,” we first derive a lower bound on the expected performance gap between any pricing policy and a clairvoyant who knows a priori the temporal evolution of the underlying demand model, and then we design families of near-optimal pricing policies, the revenue performance of which asymptotically matches said lower bound. We also show that the seller can achieve a substantially better revenue performance in demand environments that change in “bursts” than in demand environments that change “smoothly,” among other things quantifying the net effect of the “volatility” in the demand environment on the seller’s revenue performance.

[1]  Yossi Aviv,et al.  A Partially Observed Markov Decision Process for Dynamic Pricing , 2005, Manag. Sci..

[2]  S. Boyd,et al.  Pricing and learning with uncertain demand , 2003 .

[3]  T. Lai Stochastic approximation: invited paper , 2003 .

[4]  Eric Moulines,et al.  On Upper-Confidence Bound Policies for Switching Bandit Problems , 2011, ALT.

[5]  Josef Broder,et al.  Dynamic Pricing Under a General Parametric Choice Model , 2012, Oper. Res..

[6]  Peter R. Winters,et al.  Forecasting Sales by Exponentially Weighted Moving Averages , 1960 .

[7]  Alexandre B. Tsybakov,et al.  Introduction to Nonparametric Estimation , 2008, Springer series in statistics.

[8]  Sven Rady,et al.  Optimal Experimentation in a Changing Environment , 1997 .

[9]  T. Lai Sequential changepoint detection in quality control and dynamical systems , 1995 .

[10]  Omar Besbes,et al.  Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms , 2009, Oper. Res..

[11]  A. Rustichini,et al.  Learning about variable demand in the long run , 1995 .

[12]  Vivek F. Farias,et al.  Simple Policies for Dynamic Pricing with Imperfect Forecasts , 2013, Oper. Res..

[13]  Assaf J. Zeevi,et al.  Dynamic Pricing with an Unknown Demand Model: Asymptotically Optimal Semi-Myopic Policies , 2014, Oper. Res..

[14]  A. Shiryaev Quickest Detection Problems: Fifty Years Later , 2010 .

[15]  Ronald J. Balvers,et al.  Actively Learning about Demand and the Dynamics of Price Adjustment , 1990 .

[16]  Omar Besbes,et al.  On the (Surprising) Sufficiency of Linear Models for Dynamic Pricing with Demand Learning , 2014, Manag. Sci..

[17]  Gunter W. Beck,et al.  Learning and control in a changing economic environment , 2002 .

[18]  Arnoud V. den Boer Tracking the market: Dynamic pricing and learning in a changing environment , 2015, Eur. J. Oper. Res..

[19]  Bert Zwart,et al.  Simultaneously Learning and Optimizing Using Controlled Variance Pricing , 2014, Manag. Sci..

[20]  J. Michael Harrison,et al.  Bayesian Dynamic Pricing Policies: Learning and Earning Under a Binary Prior Distribution , 2011, Manag. Sci..

[21]  Arnoud V. den Boer,et al.  Dynamic Pricing with Multiple Products and Partially Specified Demand Distribution , 2014, Math. Oper. Res..

[22]  J. Michael Harrison,et al.  Investment Timing with Incomplete Information and Multiple Means of Learning , 2015, Oper. Res..

[23]  Omar Besbes,et al.  On the Minimax Complexity of Pricing in a Changing Environment , 2011, Oper. Res..

[24]  B. Zwart,et al.  Mean Square Convergence Rates for Maximum Quasi-Likelihood Estimators , 2013 .

[25]  R. Phillips,et al.  Pricing and Revenue Optimization , 2005 .

[26]  Victor F. Araman,et al.  Dynamic Pricing for Nonperishable Products with Demand Learning , 2009, Oper. Res..

[27]  Benjamin Van Roy,et al.  Dynamic Pricing with a Prior on Market Response , 2010, Oper. Res..

[28]  Zizhuo Wang,et al.  Close the Gaps: A Learning-While-Doing Algorithm for Single-Product Revenue Management Problems , 2014, Oper. Res..

[29]  Omar Besbes,et al.  Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[30]  John N. Tsitsiklis,et al.  Linearly Parameterized Bandits , 2008, Math. Oper. Res..