Online Regularization for High-Dimensional Dynamic Pricing Algorithms

We propose a novel \textit{online regularization} scheme for revenue-maximization in high-dimensional dynamic pricing algorithms. The online regularization scheme equips the proposed optimistic online regularized maximum likelihood pricing (\texttt{OORMLP}) algorithm with three major advantages: encode market noise knowledge into pricing process optimism; empower online statistical learning with always-validity over all decision points; envelop prediction error process with time-uniform non-asymptotic oracle inequalities. This type of non-asymptotic inference results allows us to design safer and more robust dynamic pricing algorithms in practice. In theory, the proposed \texttt{OORMLP} algorithm exploits the sparsity structure of high-dimensional models and obtains a logarithmic regret in a decision horizon. These theoretical advances are made possible by proposing an optimistic online LASSO procedure that resolves dynamic pricing problems at the \textit{process} level, based on a novel use of non-asymptotic martingale concentration. In experiments, we evaluate \texttt{OORMLP} in different synthetic pricing problem settings and observe that \texttt{OORMLP} performs better than \texttt{RMLP} proposed in \cite{javanmard2019dynamic}.

[1]  Odalric-Ambrym Maillard Mathematics of Statistical Sequential Decision Making. (Mathématique de la prise de décision séquentielle statistique) , 2019 .

[2]  J. Wellner,et al.  Log-Concavity and Strong Log-Concavity: a review. , 2014, Statistics surveys.

[3]  Elad Hazan,et al.  Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[4]  Jean-Luc Ville Étude critique de la notion de collectif , 1939 .

[5]  Adel Javanmard,et al.  Multi-Product Dynamic Pricing in High-Dimensions with Heterogeneous Price Sensitivity , 2019, 2020 IEEE International Symposium on Information Theory (ISIT).

[6]  Yuancheng Zhu,et al.  Uncertainty Quantification for Online Learning and Stochastic Approximation via Hierarchical Incremental Gradient Descent , 2018, 1802.04876.

[7]  Zenglin Xu,et al.  Online Learning for Group Lasso , 2010, ICML.

[8]  N. B. Keskin,et al.  Personalized Dynamic Pricing with Machine Learning: High Dimensional Features and Heterogeneous Elasticity , 2020 .

[9]  Assaf J. Zeevi,et al.  Dynamic Pricing with an Unknown Demand Model: Asymptotically Optimal Semi-Myopic Policies , 2014, Oper. Res..

[10]  Yoram Singer,et al.  A primal-dual perspective of online learning algorithms , 2007, Machine Learning.

[11]  Jonas Mueller,et al.  Low-rank Bandit Methods for High-dimensional Dynamic Pricing , 2018, NeurIPS.

[12]  J. Harrison,et al.  Dynamic Pricing with an Unknown Linear Demand Model : Asymptotically Optimal Semi-myopic Policies , 2011 .

[13]  David Simchi-Levi,et al.  Dynamic Learning and Pricing with Model Misspecification , 2018, Manag. Sci..

[14]  Xi Chen,et al.  Uncertainty Quantification for Demand Prediction in Contextual Dynamic Pricing , 2020, Production and Operations Management.

[15]  Gábor Lugosi,et al.  Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.

[16]  Martin J. Wainwright,et al.  High-Dimensional Statistics , 2019 .

[17]  Adel Javanmard,et al.  Dynamic Pricing in High-Dimensions , 2016, J. Mach. Learn. Res..

[18]  Irina Gaynanova,et al.  Oracle inequalities for high-dimensional prediction , 2016, Bernoulli.

[19]  Shou-De Lin,et al.  Online Linear Optimization with Sparsity Constraints , 2019, ALT.

[20]  Roger B. Myerson,et al.  Optimal Auction Design , 1981, Math. Oper. Res..

[21]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[22]  Tengyuan Liang,et al.  Adaptive Feature Selection: Computationally Efficient Online Sparse Linear Regression under RIP , 2017, ICML.

[23]  Johannes Lederer,et al.  Estimating the Lasso's Effective Noise , 2020, J. Mach. Learn. Res..

[24]  Virag Shah,et al.  Semi-parametric dynamic contextual pricing , 2019, NeurIPS.

[25]  Ambuj Tewari,et al.  Stochastic methods for l1 regularized loss minimization , 2009, ICML '09.

[26]  D. Simchi-Levi,et al.  A Statistical Learning Approach to Personalization in Revenue Management , 2015, Manag. Sci..

[27]  S. Bobkov,et al.  Concentration of the information in data with log-concave distributions , 2010, 1012.5457.

[28]  Martin J. Wainwright,et al.  A Practical Scheme and Fast Algorithm to Tune the Lasso With Optimality Guarantees , 2016, J. Mach. Learn. Res..

[29]  Jörg Becker,et al.  Enabling individualized recommendations and dynamic pricing of value-added services through willingness-to-pay data , 2010, Electron. Mark..

[30]  A. V. den Boer,et al.  Dynamic Pricing and Learning: Historical Origins, Current Research, and New Directions , 2013 .

[31]  Josef Broder,et al.  Dynamic Pricing Under a General Parametric Choice Model , 2012, Oper. Res..

[32]  M.D. Anderson,et al.  Dynamic pricing [of electricity] , 2000, IEEE Potentials.

[33]  Mee Young Park,et al.  L1‐regularization path algorithm for generalized linear models , 2007 .

[34]  John Langford,et al.  Sparse Online Learning via Truncated Gradient , 2008, NIPS.