Dynamic Pricing with Online Learning and Strategic Consumers: An Application of the Aggregating Algorithm

We study the problem faced by a monopolistic company that is dynamically pricing a perishable product or service and simultaneously learning the demand characteristics of its customers. In the learning procedure, the company observes the sales history over consecutive learning stages and predicts consumer demand by applying an aggregating algorithm (AA) to a pool of online stochastic predictors. Numerical implementation uses finite-sample distribution approximations that are periodically updated using the most recent sales data. These are subsequently altered with a random step characterizing the stochastic predictors. The company's pricing policy is optimized with a simulation-based procedure integrated with AA. The methodology of the paper is general and independent of specific distributional assumptions. We illustrate this procedure on a demand model for a market in which customers are aware that pricing is dynamic, may time their purchases strategically, and compete for a limited product supply. We derive the form of this demand model using a game-theoretic consumer choice model and study its structural properties. Numerical experiments demonstrate that the learning procedure is robust to deviations of the actual market from the model of the market used in learning.

[1]  Ronald J. Balvers,et al.  Actively Learning about Demand and the Dynamics of Price Adjustment , 1990 .

[2]  Samuel E. Bodily,et al.  A Taxonomy and Research Overview of Perishable-Asset Revenue Management: Yield Management, Overbooking, and Pricing , 1992, Oper. Res..

[3]  McGillJeff,et al.  Dynamic Pricing with Online Learning and Strategic Consumers , 2009 .

[4]  Yossi Aviv,et al.  Optimal Pricing of Seasonal Products in the Presence of Forward-Looking Consumers , 2008, Manuf. Serv. Oper. Manag..

[5]  Katya Scheinberg,et al.  Recent progress in unconstrained nonlinear optimization without derivatives , 1997, Math. Program..

[6]  Pinar Keskinocak,et al.  Designing Optimal Preannounced Markdowns in the Presence of Rational Customers with Multiunit Demands , 2008, Manuf. Serv. Oper. Manag..

[7]  N. Petruzzi,et al.  Dynamic pricing and inventory control with learning , 2002 .

[8]  Youyi Feng,et al.  A Continuous-Time Yield Management Model with Multiple Prices and Reversible Price Changes , 2000 .

[9]  Katya Scheinberg,et al.  On the convergence of derivative-free methods for unconstrained optimization , 1997 .

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  K. Talluri,et al.  The Theory and Practice of Revenue Management , 2004 .

[12]  Pinar Keskinocak,et al.  Dynamic pricing in the presence of inventory considerations: research overview, current practices, and future directions , 2003, IEEE Engineering Management Review.

[13]  Jeffrey I. McGill,et al.  Revenue Management: Research Overview and Prospects , 1999, Transp. Sci..

[14]  Vladimir Vovk,et al.  Derandomizing Stochastic Prediction Strategies , 1997, COLT '97.

[15]  G. Ryzin,et al.  Optimal dynamic pricing of inventories with stochastic demand over finite horizons , 1994 .

[16]  Tatsiana Levina,et al.  Online Methods for Portfolio Selection , 2006 .

[17]  A. Zeevi,et al.  Blind Nonparametric Revenue Management: Asymptotic Optimality of a Joint Learning and Pricing Method ∗ , 2006 .

[18]  Peter P. Belobaba,et al.  Survey Paper - Airline Yield Management An Overview of Seat Inventory Control , 1987, Transp. Sci..

[19]  C. Yano,et al.  Coordinated Pricing and Production/Procurement Decisions: A Review , 2005 .

[20]  T. Cover Universal Portfolios , 1996 .

[21]  Wenjiao Zhao,et al.  Optimal Dynamic Pricing for Perishable Assets with Nonhomogeneous Demand , 2000 .

[22]  D. Simchi-Levi,et al.  Coordination of Pricing and Inventory Decisions: A Survey and Classification , 2004 .

[23]  M. Puterman,et al.  Learning and pricing in an internet environment with binomial demands , 2005 .

[24]  Garrett J. van Ryzin,et al.  Strategic Capacity Rationing to Induce Early Purchases , 2008, Manag. Sci..

[25]  Xuanming Su,et al.  Intertemporal Pricing with Strategic Customer Behavior , 2007, Manag. Sci..

[26]  Kyle Y. Lin,et al.  Dynamic pricing with real-time demand learning , 2006, Eur. J. Oper. Res..

[27]  Yossi Aviv,et al.  Pricing of Short Life-Cycle Products through Active Learning∗ , 2002 .

[28]  R. Phillips,et al.  Pricing and Revenue Optimization , 2005 .

[29]  Vladimir Vovk,et al.  Aggregating strategies , 1990, COLT '90.

[30]  R. Meyer,et al.  The rational effect of price promotions on sales and consumption , 1993 .

[31]  Jeff McGill,et al.  Optimal Dynamic Pricing of Perishable Items by a Monopolist Facing Strategic Consumers , 2010 .

[32]  Moshe Shaked,et al.  Stochastic orders and their applications , 1994 .

[33]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[34]  Richard E. Chatwin,et al.  Optimal dynamic pricing of perishable products with stochastic demand and a finite set of prices , 2000, Eur. J. Oper. Res..

[35]  D. Bertsimas,et al.  Working Paper , 2022 .

[36]  Wayne L. Winston,et al.  Optimal price skimming by a monopolist facing rational consumers , 1990 .

[37]  Yossi Aviv,et al.  A Partially Observed Markov Decision Process for Dynamic Pricing , 2005, Manag. Sci..