论文信息 - Online Regularization for High-Dimensional Dynamic Pricing Algorithms

Online Regularization for High-Dimensional Dynamic Pricing Algorithms

We propose a novel \textit{online regularization} scheme for revenue-maximization in high-dimensional dynamic pricing algorithms. The online regularization scheme equips the proposed optimistic online regularized maximum likelihood pricing (\texttt{OORMLP}) algorithm with three major advantages: encode market noise knowledge into pricing process optimism; empower online statistical learning with always-validity over all decision points; envelop prediction error process with time-uniform non-asymptotic oracle inequalities. This type of non-asymptotic inference results allows us to design safer and more robust dynamic pricing algorithms in practice. In theory, the proposed \texttt{OORMLP} algorithm exploits the sparsity structure of high-dimensional models and obtains a logarithmic regret in a decision horizon. These theoretical advances are made possible by proposing an optimistic online LASSO procedure that resolves dynamic pricing problems at the \textit{process} level, based on a novel use of non-asymptotic martingale concentration. In experiments, we evaluate \texttt{OORMLP} in different synthetic pricing problem settings and observe that \texttt{OORMLP} performs better than \texttt{RMLP} proposed in \cite{javanmard2019dynamic}.

[1] Odalric-Ambrym Maillard. Mathematics of Statistical Sequential Decision Making. (Mathématique de la prise de décision séquentielle statistique) , 2019 .

[2] J. Wellner,et al. Log-Concavity and Strong Log-Concavity: a review. , 2014, Statistics surveys.

[3] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[4] Jean-Luc Ville. Étude critique de la notion de collectif , 1939 .

[5] Adel Javanmard,et al. Multi-Product Dynamic Pricing in High-Dimensions with Heterogeneous Price Sensitivity , 2019, 2020 IEEE International Symposium on Information Theory (ISIT).

[6] Yuancheng Zhu,et al. Uncertainty Quantification for Online Learning and Stochastic Approximation via Hierarchical Incremental Gradient Descent , 2018, 1802.04876.

[7] Zenglin Xu,et al. Online Learning for Group Lasso , 2010, ICML.

[8] N. B. Keskin,et al. Personalized Dynamic Pricing with Machine Learning: High Dimensional Features and Heterogeneous Elasticity , 2020 .

[9] Assaf J. Zeevi,et al. Dynamic Pricing with an Unknown Demand Model: Asymptotically Optimal Semi-Myopic Policies , 2014, Oper. Res..

[10] Yoram Singer,et al. A primal-dual perspective of online learning algorithms , 2007, Machine Learning.

[11] Jonas Mueller,et al. Low-rank Bandit Methods for High-dimensional Dynamic Pricing , 2018, NeurIPS.

[12] J. Harrison,et al. Dynamic Pricing with an Unknown Linear Demand Model : Asymptotically Optimal Semi-myopic Policies , 2011 .

[13] David Simchi-Levi,et al. Dynamic Learning and Pricing with Model Misspecification , 2018, Manag. Sci..

[14] Xi Chen,et al. Uncertainty Quantification for Demand Prediction in Contextual Dynamic Pricing , 2020, Production and Operations Management.

[15] Gábor Lugosi,et al. Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.

[16] Martin J. Wainwright,et al. High-Dimensional Statistics , 2019 .

[17] Adel Javanmard,et al. Dynamic Pricing in High-Dimensions , 2016, J. Mach. Learn. Res..

[18] Irina Gaynanova,et al. Oracle inequalities for high-dimensional prediction , 2016, Bernoulli.

[19] Shou-De Lin,et al. Online Linear Optimization with Sparsity Constraints , 2019, ALT.

[20] Roger B. Myerson,et al. Optimal Auction Design , 1981, Math. Oper. Res..

[21] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .

[22] Tengyuan Liang,et al. Adaptive Feature Selection: Computationally Efficient Online Sparse Linear Regression under RIP , 2017, ICML.

[23] Johannes Lederer,et al. Estimating the Lasso's Effective Noise , 2020, J. Mach. Learn. Res..

[24] Virag Shah,et al. Semi-parametric dynamic contextual pricing , 2019, NeurIPS.

[25] Ambuj Tewari,et al. Stochastic methods for l1 regularized loss minimization , 2009, ICML '09.

[26] D. Simchi-Levi,et al. A Statistical Learning Approach to Personalization in Revenue Management , 2015, Manag. Sci..

[27] S. Bobkov,et al. Concentration of the information in data with log-concave distributions , 2010, 1012.5457.

[28] Martin J. Wainwright,et al. A Practical Scheme and Fast Algorithm to Tune the Lasso With Optimality Guarantees , 2016, J. Mach. Learn. Res..

[29] Jörg Becker,et al. Enabling individualized recommendations and dynamic pricing of value-added services through willingness-to-pay data , 2010, Electron. Mark..

[30] A. V. den Boer,et al. Dynamic Pricing and Learning: Historical Origins, Current Research, and New Directions , 2013 .

[31] Josef Broder,et al. Dynamic Pricing Under a General Parametric Choice Model , 2012, Oper. Res..

[32] M.D. Anderson,et al. Dynamic pricing [of electricity] , 2000, IEEE Potentials.

[33] Mee Young Park,et al. L1‐regularization path algorithm for generalized linear models , 2007 .

[34] John Langford,et al. Sparse Online Learning via Truncated Gradient , 2008, NIPS.