Near-Optimal Primal-Dual Algorithms for Quantity-Based Network Revenue Management

We study the canonical quantity-based network revenue management (NRM) problem where the decision-maker must irrevocably accept or reject each arriving customer request with the goal of maximizing the total revenue given limited resources. The exact solution to the problem by dynamic programming is computationally intractable due to the well-known curse of dimensionality. Existing works in the literature make use of the solution to the deterministic linear program (DLP) to design asymptotically optimal algorithms. Those algorithms rely on repeatedly solving DLPs to achieve near-optimal regret bounds. It is, however, time-consuming to repeatedly compute the DLP solutions in real time, especially in large-scale problems that may involve hundreds of millions of demand units. In this paper, we propose innovative algorithms for the NRM problem that are easy to implement and do not require solving any DLPs. Our algorithm achieves a regret bound of $O(\log k)$, where $k$ is the system size. To the best of our knowledge, this is the first NRM algorithm that (i) has an $o(\sqrt{k})$ asymptotic regret bound, and (ii) does not require solving any DLPs.

[1]  William L. Cooper Asymptotic Behavior of an Allocation Policy for Revenue Management , 2002, Oper. Res..

[2]  Garrett J. van Ryzin,et al.  Revenue Management Under a General Discrete Choice Model of Consumer Behavior , 2004, Manag. Sci..

[3]  Thomas P. Hayes,et al.  The adwords problem: online keyword matching with budgeted bidders under random permutations , 2009, EC '09.

[4]  Siddhartha Banerjee,et al.  The Bayesian Prophet: A Low-Regret Framework for Online Decision Making , 2018, SIGMETRICS.

[5]  Lijian Chen,et al.  Re-solving stochastic programming models for airline revenue management , 2010, Ann. Oper. Res..

[6]  David D. Yao,et al.  Control and pricing in stochastic networks with concurrent resource occupancy , 2004, PERV.

[7]  G. Ryzin,et al.  Optimal dynamic pricing of inventories with stochastic demand over finite horizons , 1994 .

[8]  Alessandro Arlotto,et al.  Logarithmic Regret in the Dynamic and Stochastic Knapsack Problem with Equal Rewards , 2018, Stochastic Systems.

[9]  Richard D. Wollmer,et al.  An Airline Seat Management Model for a Single Leg Route When Lower Fare Classes Book First , 1992, Oper. Res..

[10]  Garrett J. van Ryzin,et al.  A Multiproduct Dynamic Pricing Problem and Its Applications to Network Yield Management , 1997, Oper. Res..

[11]  Nikhil R. Devanur,et al.  Linear Contextual Bandits with Knapsacks , 2015, NIPS.

[12]  Constantinos Maglaras,et al.  Dynamic Pricing Strategies for Multi-Product Revenue Management Problems , 2009, Manuf. Serv. Oper. Manag..

[13]  Rami Atar,et al.  Asymptotically optimal dynamic pricing for network revenue management , 2012 .

[14]  Yinyu Ye,et al.  Simple and fast algorithm for binary integer and online linear programming , 2020, Mathematical Programming.

[15]  Nikhil R. Devanur,et al.  Fast Algorithms for Online Stochastic Convex Programming , 2014, SODA.

[16]  Vivek F. Farias,et al.  Simple Policies for Dynamic Pricing with Imperfect Forecasts , 2013, Oper. Res..